Machine Learning Engineer

Verita AI

Contract

Remote

Job description

Verita AI is at the forefront of next-generation artificial intelligence, emphasizing nuance, multimodal reasoning, and human judgment. We are dedicated to building high-trust data pipelines for training and evaluating models.

Our founding team comes from Mercor, Stanford, HRT, Citadel, Stripe and Yale. We partner with world-class researchers and engineers to advance experimentation, rigor, and reliability in the field of AI.

https://www.verita-ai.com/

Brief Description:

We’re hiring RL Environments Engineers to design and build MLE environments. The goal is to teach LLMs advanced concepts from modern AI/ML.

Compensation:

$50–$150 USD per hour(dependent on the expertise level and quality of take-home assignment).
$500 bonus for the take-home assignment if offered a job.

Technical Evaluation:

If you are interested in exploring this role, please complete:

Verita AI | RL Environment Engineer Technical Screening

https://form.typeform.com/to/OFHwbf2E

Requirements:

Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related technical field.
Strong Python skills (engineering-quality, not notebook-only).
Production mindset including debugging, reliability, and iteration speed.
Clear understanding of LLMs and their current limitations.
Advanced English proficiency (C1/C2).
Ability to meet throughput expectations and respond quickly to feedback.
Complete an average of two tasks each week.
Turn the take-home assignment into a complete task using company tools and guidelines upon onboarding.

You may be a good fit if:

You have a deep understanding of transformer internals, training/inference of modern LLMs, or experience with inference libraries (vLLM, SGLang, etc).
Expertise in CUDA or Pallas kernel development optimizing non-trivial neural modules to specific hardware
Expert knowledge in an active DL/ML research area, with publications or public code to show for it.
You have strong fundamentals and broad research interests, you read many papers, understand them deeply and have creativity to translate them into RLVR problems
You have built complex interactive RL environments and have strong insights into open-ended RL-based learning systems

Additional Details

Time Commitment: At least 4 hours overlap to UTC-8 (PST 9am–5pm).
Process: Includes a HackerRank take-home assignment (2–5 hours average) with a one-week turnaround.
Any questions, reach out to [email protected] or [email protected]

Job Type: Contract

Pay: $50.00 - $150.00 per hour

Expected hours: No less than 20 per week

Work Location: Remote

Apply