Job Drop BerlinYOUR WAY INTO BERLIN TECH
NewsletterLinkedIn
AboutTermsImpressumPrivacy

AI Research Engineer - ML Engineering

HHelsing
Seniority
Midweight
Model
In-Office
Sector
AI
Salary
Undisclosed
Contract
Full-Time

About the role

You'll architect and implement AI platforms and tools that enable autonomous decision-making for defence applications. Working with cross-functional teams, you'll focus on scaling distributed systems to maximize training throughput and developer velocity across high-volume data processing, reinforcement learning, and large-scale foundation models.

What you'll do

  • Extend highly integrated deep learning frameworks built on PyTorch, optimizing for efficiency and usability across diverse use cases
  • Scale infrastructure and tooling stack to support faster and larger distributed training operations
  • Design data strategy for large-scale datasets and efficient storage to ensure optimal GPU utilization

What you'll need

  • MSc or PhD in Computer Science or STEM field with focus on Machine Learning and Deep Learning
  • Strong Python software engineering skills and fluency with modern deep learning frameworks (PyTorch/JAX/TensorFlow)
  • Experience writing custom layers, loss functions, and distributed training loops
  • First-principles mindset with ability to rapidly integrate latest AI optimizations into codebases
  • Production ML pipeline debugging experience with ability to identify subtle numerical or performance issues
  • Clear communication skills and ability to build from complex theoretical concepts

Nice to have

  • Hands-on experience training models on large-scale GPU clusters with advanced parallelism strategies
  • Experience with large-scale multi-modal datasets and understanding of locality, encoding, and streaming tradeoffs
  • Proficiency with workload orchestrators like Slurm, Kubernetes, or Ray for managing concurrent training jobs
  • Low-level GPU architecture understanding including memory hierarchies and warp execution

What they offer

  • Competitive salary and stock options (ESOP)
  • Relocation support up to €2,500 and 4 weeks temporary accommodation
  • €500/£450 yearly learning allowance
  • Gym membership and mental health support
  • Enhanced parental leave: 22 weeks fully paid for primary caregivers
  • Hands-on AI-duction onboarding program to learn tech stack and ML pipelines
APPLY →