Site Reliability Engineer

Helsing

Seniority

Midweight

Model

Hybrid

Sector

Deeptech

Salary

Undisclosed

Contract

Full-Time

This Site Reliability Engineer position involves designing, implementing, and managing on-premise Kubernetes infrastructure for high-security defense AI environments. You'll be responsible for building cloud-native platforms that enable development teams to operate services at scale while maintaining strict security requirements.
What you'll doDesign and build cloud-native infrastructure platforms on-premises with Kubernetes-based solutions
Create robust observability frameworks using Grafana, Prometheus, and distributed tracing
Architect secure, multi-tenant Kubernetes clusters with access controls and zero-trust networking
Develop operators and controllers to automate infrastructure provisioning and compliance
Build and maintain MLOps platforms for AI model deployment and scaling
Collaborate with Security teams on supply chain security and runtime protection
What you'll needScripting experience in Python, Go, Rust or Bash/Shell for automation
Experience with GitOps workflows and CI/CD automation
Deep Kubernetes expertise including custom controllers/operators and service mesh architectures
Hands-on experience with CNCF ecosystem tools like Helm, ArgoCD, Flux, and Falco
Expert-level knowledge of observability stack: Grafana, Prometheus, Loki, Tempo, OpenTelemetry
Expert understanding of networking concepts, protocols and security
Experience with MLOps platforms like Kubeflow or MLflow
Proficiency with Infrastructure as Code tools: Terraform, Ansible, OPA/Gatekeeper
Deep Linux/Unix system administration and distributed systems knowledge
High personal integrity, reliability, and attention to detail
Willingness to relocate to Munich, London, or Berlin
Nice to haveExperience running cloud-native workloads in air-gapped environments
Software engineering mindset with passion for building developer productivity tools
What they offerCompetitive compensation and stock options
Focus on outcomes, not time-tracking
Relocation support
Social and education allowances
Regular company events across Europe
Hands-on onboarding program

APPLY →

Site Reliability Engineer

What you'll do

What you'll need

Nice to have

What they offer

ABOUT HELSING

SIMILAR ROLES THIS WEEK