reinforcement-learning Jobs

93 jobs from companies building with AI

agents 134 alignment 168 deep-learning 80 distributed-systems 188 embeddings 47 evaluation 27 fine-tuning 158 go 49 gpu 57 inference 33 infrastructure 82 llm 519 mlops 33 nlp 71 pre-training 41 pytorch 170 reinforcement-learning 93 research 163 search 174 tensorflow 65

Research Engineer, Production Model Post-Training

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
alignment llm deep-learning distributed-systems reinforcement-learning fine-tuning
full-time mid

Research Engineer, Production Model Post-Training

Anthropic · Zürich, CH
reinforcement-learning distributed-systems alignment llm fine-tuning deep-learning
full-time mid

Research Engineer, Pretraining

Anthropic · London, UK
llm pytorch reinforcement-learning deep-learning alignment pre-training
full-time mid

Research Engineer, Pretraining

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
pre-training pytorch reinforcement-learning alignment llm deep-learning
full-time mid

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic · San Francisco, CA | New York City, NY
llm reinforcement-learning tensorflow pytorch gpu alignment
full-time mid

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic · London, UK
tensorflow gpu alignment code-generation llm reinforcement-learning
full-time mid

Research Engineer, Frontier Red Team (Autonomy)

Anthropic · San Francisco, CA
reinforcement-learning llm alignment search research
full-time mid

Research Engineer, Environment Scaling

Anthropic · Remote-Friendly (Travel Required) | San Francisco, CA
reinforcement-learning fine-tuning llm distributed-systems alignment research
full-time mid

Research Engineer, Discovery

Anthropic · San Francisco, CA
distributed-systems pytorch alignment reinforcement-learning research search
full-time mid

Research Engineer, Cybersecurity Reinforcement Learning

Anthropic · San Francisco, CA | New York City, NY
llm reinforcement-learning alignment fine-tuning research search
full-time mid

ML/Research Engineer, Safeguards

Anthropic · San Francisco, CA | New York City, NY
reinforcement-learning fine-tuning alignment search research
full-time mid

Machine Learning Systems Engineer, RL Engineering

Anthropic · San Francisco, CA | New York City, NY | Seattle, WA
llm reinforcement-learning fine-tuning distributed-systems alignment search
full-time mid

[Expression of Interest] Research Scientist / Engineer, Honesty

Anthropic · New York City, NY; San Francisco, CA
reinforcement-learning fine-tuning alignment rag llm search
full-time mid

Get AI Jobs in Your Inbox

Weekly digest of the best AI developer opportunities.

Agentic API

Build with our API. Let your agents post jobs and apply automatically.

# Search jobs
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

# Register for API access
curl -X POST https://aidevboard.com/api/v1/register/company \
  -d '{"name":"Acme AI","email":"hire@acme.ai"}'