reinforcement-learning Jobs

93 jobs from companies building with AI

agents 134 alignment 168 deep-learning 80 distributed-systems 188 embeddings 47 evaluation 27 fine-tuning 158 go 49 gpu 57 inference 33 infrastructure 82 llm 519 mlops 33 nlp 71 pre-training 41 pytorch 170 reinforcement-learning 93 research 163 search 174 tensorflow 65

Researcher, Synthetic RL

OpenAI · San Francisco
reinforcement-learning search research
full-time mid

Technical Lead, Safety Research

OpenAI · San Francisco
deep-learning alignment llm reinforcement-learning fine-tuning search
full-time lead

Software Engineer, Applied Evals

OpenAI · San Francisco
reinforcement-learning llm deep-learning evaluation
full-time mid

Research Engineer/Research Scientist, RL/Reasoning

OpenAI · San Francisco
reinforcement-learning search research
full-time mid

Researcher, Health AI

OpenAI · San Francisco
reinforcement-learning deep-learning llm alignment search research
full-time mid

Researcher, Safety Oversight

OpenAI · San Francisco
alignment reinforcement-learning research search
full-time mid

Researcher, Trustworthy AI

OpenAI · San Francisco
llm reinforcement-learning alignment search rust research
full-time mid

Research Engineer / Research Scientist, Post-Training

OpenAI · San Francisco
reinforcement-learning search research
full-time mid

Researcher, Robustness & Safety Training

OpenAI · San Francisco
alignment deep-learning reinforcement-learning search research
full-time mid

Staff Research Engineer, Discovery Team

Anthropic · San Francisco, CA
reinforcement-learning distributed-systems alignment research search
full-time lead

Software Engineer, Sandboxing

Anthropic · San Francisco, CA | New York City, NY
reinforcement-learning alignment distributed-systems search research
full-time mid

Senior Research Scientist, Reward Models

Anthropic · Remote-Friendly (Travel Required) | San Francisco, CA
llm fine-tuning reinforcement-learning alignment research search
full-time senior

Research Lead, Training Insights

Anthropic · Remote-Friendly (Travel Required) | San Francisco, CA; San Francisco, CA | New York City, NY
pre-training alignment reinforcement-learning llm research search
full-time lead

Research Engineer, Virtual Collaborator (Cowork)

Anthropic · New York City, NY; San Francisco, CA; Seattle, WA
reinforcement-learning alignment search research
full-time mid

Research Engineer, Universes

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
reinforcement-learning fine-tuning distributed-systems llm alignment search
full-time mid

Research Engineer / Scientist, Alignment Science - London

Anthropic · London, UK
fine-tuning alignment llm nlp reinforcement-learning research
full-time mid

Research Engineer / Scientist, Alignment Science

Anthropic · San Francisco, CA
alignment nlp reinforcement-learning fine-tuning llm research
full-time mid

Research Engineer, Science of Scaling

Anthropic · London, UK
alignment reinforcement-learning deep-learning llm research search
full-time mid

Research Engineer, Reward Models Platform

Anthropic · Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY
alignment reinforcement-learning fine-tuning mlops distributed-systems research
full-time mid

Research Engineer / Research Scientist, Tokens

Anthropic · New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA
pytorch reinforcement-learning alignment research search
full-time mid

Get AI Jobs in Your Inbox

Weekly digest of the best AI developer opportunities.

Agentic API

Build with our API. Let your agents post jobs and apply automatically.

# Search jobs
curl https://aidevboard.com/api/v1/jobs?tags=llm,pytorch

# Register for API access
curl -X POST https://aidevboard.com/api/v1/register/company \
  -d '{"name":"Acme AI","email":"hire@acme.ai"}'