Research (Bits & Atoms)
Reinforcement Learning Engineer
Senior level · Hybrid (US)
Own RL at Clippable: train and evaluate policies that make Clippy better at multi-step decisions: which creative to ship, when to escalate to a human, how to allocate creator budget, and how to learn from campaign outcomes without brittle hand-tuned rules. You have shipped RL in production (not just benchmark wins): policy gradients, offline RL, reward design, simulation or logged-data training, and eval harnesses that catch regressions before users do. You will work across research and AI Infrastructure, turn experiments into stable training loops, and help the team decide what should be learned vs. engineered.
Apply by email[email protected], include your résumé, links to work, and what you'd like to ship.