Platform Engineer – Backend & Reliability

Trade Republic

Seniority

Midweight

Model

Hybrid

Sector

Fintech

Salary

Undisclosed

Contract

Full-Time

About the rolePlatform Engineering is the backbone of Trade Republic's engineering velocity. Our Backend Reliability team builds and owns Trade Republic's most critical backend systems — the services that process every trade, every payment, every savings plan. This is a software engineering role first: we believe the best reliability work happens at the code level, not the runbook level.
What you'll doDesign and implement new backend services and platform components in Kotlin — high-throughput, low-latency, fault-tolerant by design.
Own reliability as a first-class engineering concern: define and drive SLOs, model failure modes during design, implement circuit breakers, bulkheads, graceful degradation, and retry strategies directly in the application layer.
Design and run load tests, stress tests, and chaos experiments against critical backend services. Translate findings into concrete architectural improvements and engineering standards.
Define best practices for service design, error handling, resiliency patterns, and safe deployment — and work directly with engineering teams to raise the bar across the platform.
Model traffic growth, identify performance bottlenecks across the application and data layer, and own the engineering work that keeps systems performant ahead of demand.
Own post-mortems for critical service failures and ensure findings close real gaps — in architecture, test coverage, observability, or deployment practices.
Define and implement structured logging, distributed tracing, and alerting standards for critical backend services.
What you'll need5+ years of hands-on backend software engineering experience, with deep ownership of production systems at scale.
Strong programming skills in Go, with experience building infrastructure automation as software rather than scripts.
Proven experience building resilient, high-throughput distributed systems — strong intuitions around backpressure, failure isolation, consistency trade-offs.
Experience with load testing, performance profiling, and capacity planning at scale.
Familiarity with event-driven architectures and distributed data systems in production.
Track record of raising engineering standards through documentation, design reviews, or internal tools.
Production experience with container orchestration and modern infrastructure.
Strong incident command and post-mortem skills.
What they offerBased in London, Berlin, or Paris with relocation support provided.
Flexible hybrid setup with 2–3 days a week in the office.

APPLY →