inference Jobs

33 jobs from companies building with AI

agents 134 alignment 168 deep-learning 80 distributed-systems 188 embeddings 47 evaluation 27 fine-tuning 158 go 49 gpu 57 inference 33 infrastructure 82 llm 519 mlops 33 nlp 71 pre-training 41 pytorch 170 reinforcement-learning 93 research 163 search 174 tensorflow 65

ML Compiler Engineer

HuggingFace · Remote (Global) · $180k - $320k
ml-compiler python c++ cuda quantization optimization inference
full-time senior

Staff Software Engineer - GenAI inference

Databricks · San Francisco, California
llm gpu distributed-systems inference
full-time lead

Sr. Manager, Engineering - AI Gateway (LLM Inference)

Databricks · New York
llm inference
full-time senior

Software Engineer - GenAI inference

Databricks · San Francisco, California
gpu distributed-systems llm inference
full-time mid

Member of Technical Staff, Inference

Runway · Remote
pytorch gpu diffusion-models inference
full-time lead

Generative AI Inference Engineer

Stability AI · United States
pytorch diffusion-models deep-learning inference
full-time mid

Senior Backend Engineer, Inference Platform

Together AI · San Francisco
distributed-systems gpu llm inference
full-time senior

Machine Learning Engineer - Inference

Together AI · San Francisco
gpu llm pytorch search inference research
full-time mid

Customer Support Engineer (Inference), India

Together AI · India
fine-tuning gpu inference
full-time mid

Engineering Manager - Inference

Perplexity · San Francisco
pytorch tensorflow llm gpu inference
full-time mid

Site Reliability Engineer, Inference Infrastructure

Cohere · Toronto
distributed-systems nlp llm infrastructure inference
full-time mid

Full-Stack Software Engineer, Inference

Cohere · Toronto
inference
full-time mid

Staff Software Engineer, Inference Infrastructure

Cohere · San Francisco
distributed-systems llm nlp infrastructure inference
full-time lead

Audio Inference Engineer, Model Efficiency

Cohere · New York
pytorch tensorflow deep-learning llm inference
full-time mid

Software Engineer, Collect

Cohere · Toronto
inference
full-time mid

Inference Technical Lead, On-Device Transformers

OpenAI · San Francisco
gpu inference transformers
full-time lead

Software Engineer, Inference – AMD GPU Enablement

OpenAI · San Francisco
llm gpu inference
full-time mid

Software Engineer, Inference - Multi Modal

OpenAI · San Francisco
llm inference
full-time mid

Inference Technical Lead, Sora

OpenAI · San Francisco
research search inference
full-time lead

TL, Research Inference

OpenAI · San Francisco
distributed-systems research inference search
full-time mid

Get AI Jobs in Your Inbox

Weekly digest of the best AI developer opportunities.

Agentic API

Build with our API. Let your agents post jobs and apply automatically.

# Search jobs
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

# Register for API access
curl -X POST https://aidevboard.com/api/v1/register/company \
  -d '{"name":"Acme AI","email":"hire@acme.ai"}'