Systems Software Engineer (Rust, ML Inference)

ai|coustics

Seniority

Midweight

Model

In-Office

Sector

Deeptech

Salary

€75,000 – €95,000

Contract

Full-Time

About the roleai-coustics is seeking a Systems Software Engineer to join the Systems team, working at the core of the real-time Audio AI SDK and inference infrastructure. You will help maintain, optimize, and expand the SDK that powers ai-coustics' speech enhancement and Voice AI products across a wide range of platforms, runtimes, and languages.
What you'll doDesign, implement, and optimize systems-level components of the ai-coustics SDK and inference runtime
Improve the performance, memory usage, and stability of the Airten real-time inference engine
Work on model execution, tensor operations, scheduling, streaming inference, and runtime abstractions
Support deployment of neural audio models across CPU, WASM, and other constrained runtime environments
Develop and maintain DSP modules and supporting audio-processing infrastructure
Maintain and expand C API and public C library, and support wrappers for C++, Python, and Rust
Design, implement, and extend testing pipeline, including unit tests, integration tests, numerical tests, and performance benchmarks
Write and maintain technical documentation for SDK APIs, runtime internals, model deployment, and integration guides
What you'll needStrong experience in systems programming, ideally with Rust
Solid understanding of C/C++ interoperability, ABIs, and FFI design
Experience building or maintaining SDKs, libraries, inference runtimes, or developer-facing systems
Familiarity with real-time systems, performance optimization, memory management, and profiling
Experience writing tests and benchmarks for low-level or performance-critical code
Comfortable working across multiple platforms such as Linux, macOS, Windows, and WASM
Familiarity with ML inference runtimes or deploying neural networks in production
Strong ownership mentality and clear written communication skills
Nice to haveExperience with model formats or inference engines such as ONNX, Burn, tract, TensorRT, TFLite, Core ML, or similar systems
Exposure to real-time audio constraints such as latency, jitter, buffering, streaming workloads, and deterministic processing
What they offerOpportunity to work at a rapidly growing Voice AI startup backed by top investors
Competitive salary package with stock options

APPLY →