AI Engineer – Applied LLMs, Workflows & Evals

Delvo

Seniority

Midweight

Model

In-Office

Sector

AI

Salary

Undisclosed

Contract

Full-Time

About the roleBuild reliable LLM workflows and agent systems for Delvo's procurement intelligence platform. You'll design production-ready AI systems with strong evaluation frameworks, moving beyond demos to real enterprise environments that deliver measurable results.
What you'll doDevelop LLM workflows for retrieval, tool use, structured outputs, and multi-step reasoning
Build agent orchestration systems with tools, control flows, retries, and safety checks
Create evaluation and monitoring infrastructure including golden datasets and regression detection
Implement enterprise integrations with ERPs and data sources with strong observability
Optimize cost and latency through caching, streaming, batching, and intelligent model routing
Collaborate with design and forward-deployed engineers to ship AI for real users
What you'll needStrong proficiency in TypeScript and Python for backend and product work
Hands-on experience with LLMs including RAG, function calling, structured parsing, and guardrails
Experience with modern AI stack: Vercel AI SDK, OpenAI/Azure, embeddings, vector stores
Evaluation and quality mindset with ability to define tasks, gold data, and success criteria
Systematic approach to preventing regressions and maintaining AI system quality
Nice to haveExperience with data pipelines and ERP integrations
Knowledge of procurement domain
Experience with Langfuse or similar monitoring tools
What they offerCompetitive salary plus meaningful equity with real ownership opportunity
AI-first tooling with unlimited tokens and best-in-class development tools
Deep technical growth in agentic systems and production AI
Direct founder access in a small team environment
Office located in CIC Berlin, Kreuzberg startup ecosystem
Path to AI/ML leadership as the company scales

APPLY →