Who Am I?

Technical Experience

Downloadable Version

APrime Technology Senior Software Engineer 2022-Present

During my time at APrime, I was a consultant that focused primarily on data engineering and devops. During my time consulting, I worked with companies ranging from pre-seed funding all the way through Series B companies across industries including ETF investmenting, usage-based billing platforms, benefits reimbursal, and many others. For larger companies I worked embedded in existing engineering teams and for smaller companies I worked more greenfield suggesting and implementing best practices. Because of this, I worked with a wide-range of different technologies and languages that addressed problems such as running a a GCP to AWS cloud migration, full private VPC migration, in-place kubernetes upgrades, custom Go based migration tool, full end-to-end machine learning pipeline using Prefect and R, migration of AWS Step Functions to Dagster, deployment of Argo Rollouts and ArgoCD, and others.

New York Times Senior Software Engineer; Machine Learning Infrastructure 2021-2022

My title at the New York Times is a bit wordy, but I, essentially, worked on the machine learning platform team that was in charge of serving all recommendations to end users. This includes things like recommending articles that a user might like, generating a top-5 of articles to read for a user, and creating a newsletter to drive users to the frontpage. I focused mostly on revamping the CI/CD process during this time to reduce deploy times as well as migrating our kubernetes cluster to a service mesh to allow for shadow and canary deployments.
Our endpoints received thousands of requests per second and we had a SLA of around 100 milliseconds to return a response to the internal gateway API. Given this, all of our models were written and experimented in Python but then subsequently translated into Golang to provide the best latency possible.

Bark Data Scientist to Machine Learning Engineer 2019-2021

I worked in a number of different roles at Bark, but I started as a data scientist and worked my up to a machine learning engineer. This meant that I started off working on our recommendation engine that we used to try and get users to add extra products to their monthly shipments. But, this also spun off into projects including trying to predict an estimated time of delivery and running survival analysis on our subscribers. Over time, though, I need to work further and further back in the stack and started shifting my focus from just the model but to the whole model lifecycle (i.e. how do we deploy, monitor, and update models safely). This involved standardizing all of our model training using KubeFlow (on GCP) to deploying models with TensorFlow Serving.
Of course, there were a number of other projects that came up, as is common in any startup. I migrated our MongoDB instance to Amazon Redshift, did some causal analysis to try and determine the effectives of our interventions in preventing subscriber churn, improved our sampling weighting to better capture a stratified random sample for our NPS survey, and creating a microservice to deploy our predictions rather than running everything as a batch process.

Trendalytics Software Engineer; Machine Learning 2018-2019

We were focused here on doing lots of image segmentation, image processing, and time-series analysis to try and predict upcoming fashion trends for clients. Most of my time here was spent on improving our implementation of the Yolo algorithm we were using for image segmentation and also looking at how we can combine a bunch of different model outputs (i.e. keyword search trends, NLP text extractions, and image vectors) into one singular usable score for our clients to action upon.

Institute for Health Metrics and Evaluation Data Analyst, Machine Learning Data Specialist 2016-2018

National Human Genome Research Institute Data Analyst 2014-2016

Competencies

Cloud Computing Environments:  AWS, GCP, Azure

Container Orchestration:  Kubernetes, Helm, Kustomize, Grafana, Argo Rollouts, Istio

Machine Learning Platforms:  SageMaker, Vertex, KubeFlow, DataBricks

Infrastructure as Code:  Terraform, Terragrunt, Atlantis

CI/CD:  CircleCI, GitLab, GitHub Actions, Drone CI

Data Engineering Platforms:  Prefect, Dagster, Airflow

Languages:  Golang, Python, TypeScript, R, Stata

Data Stores & Data Streaming:  PostgreSQL, MySQL, GraphQL, MongoDB, Confluent, ElasticSearch

Education

Wageningen University: Master's Data Science for Food and Health

Haverford College: Bachelor's in Biochemistry, Minor in Art History

WhatsApp

+1-404-805-4916

Location

Arnhem, Netherlands