Staff Software Engineer

Jeremy Cohen

Infrastructure, reliability &
developer experience.

12+ years building distributed systems and developer tooling at scale. Previously infrastructure and reliability at Meta/Instagram. Currently Staff Engineer at Webflow.

Staff Software Engineer who turns complex infrastructure problems into simple, reliable solutions. I set technical direction, align teams, and build platforms and tools that let engineering teams focus on what matters.

I'm Jeremy Cohen, a Staff Software Engineer at Webflow. Hard problems are where I do my best work. Over 12+ years at Meta/Instagram, Orchard, and Webflow, I've built a track record of tackling reliability crises, scaling bottlenecks, and overcoming thorny systems challenges by finding creative paths forward.

I think at the architect level: I've defined Instagram's reliability strategy, owned Orchard's developer experience direction, and led Webflow's MongoDB sharding and native analytics product from zero to GA.

2023 - Now
Staff Software Engineer
Webflow
Led zero-downtime sharding of 50%+ of Webflow's 4 TB primary database, eliminating the critical scaling bottleneck for company-wide data growth. Tech lead for Webflow Analyze, Webflow's native analytics product. Built a 0→1 multi-service backend enabling Optimize sites in Webflow.
2021 - 2022
Staff Software Engineer, Developer Experience
Orchard Technologies
Redesigned the E2E testing platform on self-hosted AWS infrastructure, reducing CI costs by 30%. Led migration from Protractor to Cypress across ~60% of UI test coverage. Drove platform standardization across a 70+ engineer org via shared base images, IaC dev environments, and acquisition stack integration.
2013 - 2021
Staff Software Engineer
Facebook
Instagram App Reliability Tech Lead (3yr) + EM (1yr), sustaining industry-leading iOS crash rates at global scale. Built an automated crash mitigation system with fully automated resolution of severe crash classes. Built regression detection tooling responsible for pre-production crash, OOM, and stall detection on iOS. Also worked across several Facebook teams: overhauled mobile data model infrastructure via code-generated models, designed realtime GraphQL-driven update systems for Facebook Sports Stadium, and launched Trending Topics.
Backend & Systems
Python · TypeScript · Node.js Go · Java · PHP/Hack REST · GraphQL · microservices
Cloud & Infrastructure
AWS · Docker · Terraform Pulumi · Kubernetes (CKA) Cloudflare · IaC · multi-region
Data & Storage
MongoDB · PostgreSQL · Redis ClickHouse · Snowflake sharding · ETL · streaming
CI/CD & Delivery
Buildkite · CircleCI · ArgoCD trunk-based dev canary · blue-green · release orch.
2009 - 2013
B.S. Computer Science
Cornell University
Magna Cum Laude.
Jeremy CohenBuilt with Next.js