Loading...
About the role:
We're partnering with a leading AI lab to evaluate how a frontier model performs on long-horizon protein-design problems. We're running a small, hands-on project with senior protein-design experts to author and evaluate well-scoped design problems — and to iterate live with our team to shape what good looks like.
What you'll do:
Brainstorm 2–3 candidate problem ideas (rough descriptions, ~1 page each). Examples of the shape we're looking for:
Iterate with us until we align on a task structure
Write up the chosen problem: target/inputs, allowed tools and information sources, what success looks like, and — most importantly — how you'd evaluate the model's design decisions and reasoning along the way.
Review the model's trajectories (likely 5–20 per problem) and grade design choices + reasoning quality at each step.
Critical fit criteria (please address these in your application)
Time commitment:
Compensation
Apply directly on Mercor to get started.