Cloud Site Reliability Engineer (Scalability)
Scalable Capital
Seniority
Senior
Model
In-Office
Sector
Salary
Undisclosed
Contract
Full-Time
About the role
Shape the way how Scalable runs microservices in the most performant, secure and cost efficient way. The Scalability Engineering Team focuses on providing everything necessary to ensure Reliability and Scalability of services and storages while being cost efficient, including solutions for Monitoring, AutoScaling, Load Testing, Chaos Engineering and FinOps.
What you'll do
- Collaborate with cross-functional teams to identify and understand scalability requirements for our platform, both in terms of user growth and increasing data volume.
- Design and rollout Monitoring best practices in Datadog including SLI, SLO and SLAs
- Research and develop service and storage improvements by using serverless technologies and optimise our services and CICD to optimise scalability, cost and performance
- Develop and maintain internal tooling around Monitoring, Developer Portal and Load Testing
- Mentor and enable our software development teams to further foster our DevOps culture by educating them and providing reusable and unified building blocks
- Design and implement best practices around auto scaling of our infrastructure
- Run chaos engineering experiments to improve resilience of our services
What you'll need
- Multiple years of experience with AWS and infrastructure as code (preferably Terraform)
- Solid experience in Monitoring, Container Orchestration and Microservice setups is required
- Good working knowledge with Python, at least one additional general purpose programming language (Preferably Java/Kotlin or JavaScript/Node.js) and build automations tools
- Solid understanding of scalable system design principles, distributed systems, and cloud technologies
- A degree in a relevant field of study (e.g. computer science, engineering, sciences) or work experience in a role that typically requires a university degree
- Full professional proficiency in English and the ability to communicate concisely in an international English-speaking environment
Nice to have
- Experience with GitHub Actions and Jenkins would be beneficial
What they offer
- Work from offices in Munich or Berlin, or choose to work remotely within Germany
- Latest hardware and tools
- In-house knowledge sharing and Education Budget
- Flexible vacation policy and opportunity to work from abroad
- Attractive compensation package and company pension scheme
- Monthly 50% contribution for Deutschland Jobticket

