AI Inference Engineer (QVAC) at Jobgether – Dubai

Jobgether is recruiting a Senior AI Inference Engineer specialising in quantised vision, audio and compression (QVAC) workloads, in a fully remote setup ideal for Dubai-based talent. This is a deep-tech role for engineers passionate about squeezing performance out of modern AI systems.

About the Role

You will own the inference stack for multi-modal AI models — optimising throughput, latency and cost across GPU and specialised accelerators. The role sits at the intersection of ML, systems engineering and MLOps, with direct impact on production workloads.

Key Requirements

4+ years of ML / systems engineering experience
Deep experience with model quantisation, distillation or pruning
Hands-on with TensorRT, ONNX Runtime, vLLM or similar inference runtimes
Strong C++ or Python systems programming skills
Experience deploying inference at scale in cloud or on-prem

Why Dubai?

Remote-first roles in global AI companies are a perfect fit for Dubai-based engineers who want top-tier compensation, tax advantages and the ability to work at the cutting edge of deep learning infrastructure.

Apply now: View full job listing on LinkedIn →