Systems Software Engineer (Rust, ML Inference)
ai|coustics
Seniority
Midweight
Model
In-Office
Sector
Salary
€75,000 – €95,000
Contract
Full-Time
About the role
ai-coustics is seeking a Systems Software Engineer to join the Systems team, working at the core of the real-time Audio AI SDK and inference infrastructure. You will help maintain, optimize, and expand the SDK that powers ai-coustics' speech enhancement and Voice AI products across a wide range of platforms, runtimes, and languages.
What you'll do
- Design, implement, and optimize systems-level components of the ai-coustics SDK and inference runtime
- Improve the performance, memory usage, and stability of the Airten real-time inference engine
- Work on model execution, tensor operations, scheduling, streaming inference, and runtime abstractions
- Support deployment of neural audio models across CPU, WASM, and other constrained runtime environments
- Develop and maintain DSP modules and supporting audio-processing infrastructure
- Maintain and expand C API and public C library, and support wrappers for C++, Python, and Rust
- Design, implement, and extend testing pipeline, including unit tests, integration tests, numerical tests, and performance benchmarks
- Write and maintain technical documentation for SDK APIs, runtime internals, model deployment, and integration guides
What you'll need
- Strong experience in systems programming, ideally with Rust
- Solid understanding of C/C++ interoperability, ABIs, and FFI design
- Experience building or maintaining SDKs, libraries, inference runtimes, or developer-facing systems
- Familiarity with real-time systems, performance optimization, memory management, and profiling
- Experience writing tests and benchmarks for low-level or performance-critical code
- Comfortable working across multiple platforms such as Linux, macOS, Windows, and WASM
- Familiarity with ML inference runtimes or deploying neural networks in production
- Strong ownership mentality and clear written communication skills
Nice to have
- Experience with model formats or inference engines such as ONNX, Burn, tract, TensorRT, TFLite, Core ML, or similar systems
- Exposure to real-time audio constraints such as latency, jitter, buffering, streaming workloads, and deterministic processing
What they offer
- Opportunity to work at a rapidly growing Voice AI startup backed by top investors
- Competitive salary package with stock options

