Senior Platform Engineer
Moss
Seniority
Senior
Model
Hybrid
Sector
Salary
Undisclosed
Contract
Full-Time
About the role
As a Senior Platform Engineer, you will join our core Platform team that designs, builds, and maintains the infrastructure powering Moss. You will work on critical systems that must be updated without downtime, ensuring our services remain secure, scalable, and resilient. You'll collaborate closely with product, data, and security teams, balancing planned initiatives with incident response, cloud engineering, and regular maintenance.
What you'll do
- Design, build, and operate cloud-native infrastructure (GKE, Kubernetes, networking, databases) supporting a high-availability, low-latency FinTech platform processing real-time payments across Europe.
- Own the reliability and scalability of 100+ microservices - including defining and enforcing SLOs, managing autoscaling strategies, and driving resilience patterns like circuit breakers, bulkheads, and graceful degradation.
- Lead safe, continuous deployment practices across a fully automated CD pipeline - including rollout strategies, rollback mechanisms, and deployment observability at scale.
- Drive observability across the platform - metrics, distributed tracing, and structured logging - with a focus on reducing MTTR and enabling engineers to self-serve incident diagnosis.
- Manage and evolve infrastructure-as-code (Terraform, Helm) with a no-ClickOps discipline - every change peer-reviewed, version-controlled, and auditable.
- Champion security and compliance practices including Zero Trust architecture, Workload Identity, dynamic secrets via Vault, network policies, and audit readiness (ISO27001, SOC2).
- Own incident response across networking, load balancing, Kubernetes, and cloud services - and drive post-incident improvements that prevent recurrence.
- Raise the engineering bar - actively contribute to architectural decisions, review platform changes, and help grow the early-senior engineers on the team.
What you'll need
- 7+ years total experience with at least 4+ years in platform, infrastructure, or SRE roles in a cloud-native environment.
- Deep Kubernetes expertise - scheduling internals, autoscaling (HPA/VPA/KEDA), pod lifecycle, network policies, PodDisruptionBudgets, and multi-zone topology.
- Strong grasp of microservices operational challenges at scale - service mesh, inter-service resilience patterns, connection pool management, graceful shutdown, and database migration safety in a continuous deployment model.
- Solid CI/CD experience - designing pipelines for 100+ services, immutable artefact management, Workload Identity Federation, and automated rollback.
- Hands-on observability experience - building platforms covering metrics, logs, and distributed traces including across async boundaries (e.g. Kafka).
- Proficiency in infrastructure-as-code - Terraform and Helm as primary tools, with a strong IaC-first mindset.
- Programming proficiency in Golang and/or shell scripting for platform tooling.
- Proven troubleshooting skills across distributed systems - latency contagion, cascading failures, connection exhaustion, and autoscaling lag under traffic spikes.
Nice to have
- Experience with dynamic secrets management via HashiCorp Vault, including database credential rotation.
- Familiarity with GCP-specific primitives - Workload Identity, GKE Autopilot vs. Standard tradeoffs, Cloud Armor, VPC-native networking.
- Experience with KEDA or scheduled scaling strategies for predictable traffic spikes.
- Prior experience in a regulated FinTech or financial services environment.
What they offer
- Top-of-market compensation package, including equity.
- 20 days "work from abroad".
- 600 EUR/GBP Learning and Development Budget.

