Senior Parallel Programming Expert (CUDA/AVX)

Status

Open

Contract

Full-time
APPLY NOW

At EXXA, we are building the most cost-efficient, high-throughput AI infrastructure for large-scale, asynchronous workloads. Our mission is to balance Gen-AI demand and processing supply by leveraging idle GPUs, optimizing batch inference, and pushing AI models' inference efficiency.
If you are passionate about open-source AI, obsessed with performance, and love tackling complex technical challenges, we want to hear from you!

EXXA is hiring a Senior Parallel Programming Expert (CUDA/AVX) to co-lead the development of EXXA inference engine, focusing on batch processing and throughput rather than low-latency constraints.

Key responsibilities:
Qualifications:
Why you should join us:

🚀 Technical innovation

🌐 Remote first 💸 Competitive compensation and benefits 🙏 Backed by the best

Any questions?

Even if you don't meet all the qualifications, we encourage you to apply. Contact us if you have any questions at careers@withexxa.com.

APPLY NOW Other positions