Senior Engineer, Operational Excellence
GetYourGuide
Seniority
Senior
Model
Hybrid
Sector
Salary
Undisclosed
Contract
Full-Time
About the role
You will act as an "engineer for the engineers" — partnering with product teams to raise the bar on reliability, speed, and confidence in their systems. As a member of the Operational Excellence team, you will help GetYourGuide move toward a world of fewer interruptions and higher user trust — by preventing incidents before they happen and enabling teams to resolve them faster when they do.
What you'll do
- Drive down incident frequency, MTTD and MTTR
- Lead post-incident reviews and translate learnings into systemic improvements
- Build tooling and runbooks that enable teams to diagnose and resolve production issues faster
- Advance observability practice — metrics, logs, traces, dashboards, and alerting
- Ensure teams have meaningful SLOs and actionable alerts
- Improve change failure rate by helping teams invest in the right automated test coverage and pre-production validation
- Design and maintain paved paths for development, observability, testing, and incident response
- Work hands-on with product teams to help them improve system design, testability, and operational hygiene
What you'll need
- Deep understanding of observability tooling — we use Datadog (metrics, APM, logs, dashboards)
- Proven experience reducing MTTD, MTTR and change failure rate; DORA metrics are not just acronyms to you
- Strong coding skills in Java; comfortable reading and contributing in Go across infrastructure contexts
- Experience with Kubernetes, AWS, and service mesh technologies (Istio/Envoy)
- Solid understanding of distributed systems, networking, and container technology
- Hands-on experience with CI/CD, automated testing strategies, and build systems
- Ability to influence engineers and teams without direct authority
- Excellent written and verbal communication skills in English
Nice to have
- Led company-wide initiatives to measurably improve DORA metrics
- Driven improvements in automated testing that led to meaningful reductions in change failure rate and production incidents
- Embedded operational excellence practices into the culture of product engineering teams
- Driven meaningful cost-reduction outcomes through architectural or operational improvements
What they offer
- Annual personal growth budget and mentorship programs
- Work from anywhere in the world for 30 days per year
- Hybrid working approach with three days in-office (Mon, Tue, Thur) and two days optional at-home focus time
- Monthly transportation and fitness budget
- Discounts on GetYourGuide activities for you, friends, and family
- Health and wellness benefits

