Site Reliability Engineer
Helsing
Seniority
Midweight
Model
In-Office
Sector
Salary
Undisclosed
Contract
Full-Time
About the role
This Site Reliability Engineer position involves designing, implementing, and managing on-premise Kubernetes infrastructure for high-security defense AI environments. You'll be responsible for building cloud-native platforms that enable development teams to operate services at scale while maintaining strict security requirements.
What you'll do
- Design and build cloud-native infrastructure platforms on-premises with Kubernetes-based solutions
- Create robust observability frameworks using Grafana, Prometheus, and distributed tracing
- Architect secure, multi-tenant Kubernetes clusters with access controls and zero-trust networking
- Develop operators and controllers to automate infrastructure provisioning and compliance
- Build and maintain MLOps platforms for AI model deployment and scaling
- Collaborate with Security teams on supply chain security and runtime protection
What you'll need
- Scripting experience in Python, Go, Rust or Bash/Shell for automation
- Experience with GitOps workflows and CI/CD automation
- Deep Kubernetes expertise including custom controllers/operators and service mesh architectures
- Hands-on experience with CNCF ecosystem tools like Helm, ArgoCD, Flux, and Falco
- Expert-level knowledge of observability stack: Grafana, Prometheus, Loki, Tempo, OpenTelemetry
- Expert understanding of networking concepts, protocols and security
- Experience with MLOps platforms like Kubeflow or MLflow
- Proficiency with Infrastructure as Code tools: Terraform, Ansible, OPA/Gatekeeper
- Deep Linux/Unix system administration and distributed systems knowledge
- High personal integrity, reliability, and attention to detail
- Willingness to relocate to Munich, London, or Berlin
Nice to have
- Experience running cloud-native workloads in air-gapped environments
- Software engineering mindset with passion for building developer productivity tools
What they offer
- Competitive compensation and stock options
- Focus on outcomes, not time-tracking
- Relocation support
- Social and education allowances
- Regular company events across Europe
- Hands-on onboarding program

