Platform Engineer
01
11M

Building infrastructure from scratch. Scaling systems to millions. Helping startups bootstrap and enterprises scale.

Available for part-time consulting (up to 20 hours/week)

About Me

I'm a Platform Engineer and Site Reliability Engineer with over a decade in cloud infrastructure. As the first SRE at Trade Republic, I helped grow the platform from 20,000 to more than a million customers—scaling Kubernetes clusters from tens of containers to thousands and the engineering organisation right alongside them.

I work across the entire infrastructure lifecycle. Kubernetes and AWS are at the core, joined by Terraform, Helm, and deep observability. I have run observability stacks handling tens of millions of active series and terabytes of logs a day, optimised ultra-low-latency trading systems, built in-house IAM and SLO tooling, and led SRE teams supporting hundreds of engineers.

Since 2024 I've worked as an independent consultant with startups and scaleups—available part-time alongside my main role. Whether you're standing up your first production environment or taming the complexity that comes with rapid growth, I bring the experience and focus to help you succeed.

Ideal Client Profile

  • Early-stage startups needing infrastructure expertise without a full-time hire
  • Mid-market SaaS companies scaling their platform infrastructure
  • Enterprise teams requiring part-time SRE/Platform support
  • Teams transitioning from manual processes to automated, scalable infrastructure

From Zero to One, One to a Million

Specialized expertise across the entire infrastructure lifecycle

0→1

Building from Scratch

I help startups and new projects bootstrap their infrastructure from the ground up. From initial architecture design to first deployments, I'm your go-to for building solid foundations.

  • Infrastructure architecture & design
  • CI/CD pipeline setup
  • Cloud platform selection & migration
  • DevOps best practices implementation
1→1M

Scaling to Millions

When your infrastructure needs to handle growth, I help you scale efficiently. From observability to automation, I ensure your systems can handle the load.

  • Observability & monitoring systems
  • Performance optimization & scaling
  • Automation & tooling development
  • Process improvement & SRE practices

Services & Expertise

Comprehensive platform engineering services tailored to your needs

🏗️

Infrastructure Architecture

Designing robust, scalable infrastructure from the ground up or optimizing existing systems.

☁️

Cloud Platform Expertise

AWS, GCP, Azure - helping you choose and implement the right cloud strategy.

🔄

CI/CD & DevOps

Building automated pipelines and implementing DevOps best practices for faster, safer deployments.

📊

Observability & Monitoring

Setting up comprehensive monitoring, logging, and alerting systems to keep your infrastructure healthy.

Scaling & Performance

Optimizing systems for growth, handling traffic spikes, and ensuring reliability at scale.

🛠️

Automation & Tooling

Building custom tools and automation to streamline operations and reduce toil.

Selected Projects

Real-world infrastructure challenges solved across the 0→1 and 1→1M spectrum

0→1

Langdock Platform Infrastructure

Built customer-management Terraform modules for AWS and GCP from an existing Azure deployment, added a Helm chart to run the Langdock stack on Kubernetes, and integrated Datadog-based observability—preparing the platform to scale from tens to hundreds of managed customer stacks.

Key Outcomes:

  • Customer-management Terraform modules for AWS and GCP from an Azure base
  • Helm chart to deploy the full stack on Kubernetes
  • Datadog observability built into the Terraform and Helm systems
  • Infrastructure prepared to scale from tens to hundreds of customer stacks

Technology Stack:

TerraformAWSGCPAzureHelmKubernetesDatadog
0→1

Langfuse Infrastructure

Designed and implemented a public Helm chart and public Terraform repositories for deploying the Langfuse LLM engineering platform across AWS, GCP, and Azure.

Key Outcomes:

  • Public Helm chart launched for easy Langfuse deployment
  • Public Terraform repositories supporting AWS, GCP, and Azure
  • Multi-cloud deployment that increased accessibility for users

Technology Stack:

HelmTerraformAWSGCPAzureKubernetes
1→1M

E-Commerce Scaleup — Fractional CTO

Led the platform engineering domain for an e-commerce scaleup as fractional CTO—owning the roadmap, consulting on architecture and design, prioritising high-impact work, and overhauling incident response and delivery.

Key Outcomes:

  • Reduced MTTR from 3 hours to 15 minutes with standard, SLO-based alerting
  • Cut average ticket lead time from 2.5 months to 2 weeks
  • Hired a technical manager for a 4-person team supporting ~50 engineers

Technology Stack:

Platform EngineeringArchitectureSLOsLeadershipHiring
0→1

Kubernetes Cluster Management API

Architected and built a Kubernetes Cluster Management API from scratch for an early-stage startup—Go backend, React frontend, and a Keycloak authentication stack with full tenant isolation and role management via RBAC.

Key Outcomes:

  • Cluster Management API designed and built end to end
  • Keycloak authentication with full tenant isolation
  • Role-based access control across tenants

Technology Stack:

KubernetesGoReactKeycloakRBAC
1→1M

In-house IAM & Observability (Fintech)

Built an in-house, tamper-proof Identity and Access Management solution with compliance monitoring and reporting, a Kubernetes Ingress-based authentication service, and a centralized SLO monitoring tool with standardized, executive-level metric reporting.

Key Outcomes:

  • Tamper-proof IAM with built-in compliance monitoring
  • Kubernetes Ingress authentication service for internal tools
  • Centralized SLO monitoring with standardized executive reporting

Technology Stack:

IAMKubernetesSLOsMonitoringCompliance

Let's Work Together

Ready to build or scale your infrastructure? Let's discuss how I can help.

Available for part-time consulting projects (up to 20 hours/week)