Machine Learning Engineer
Verita AI
Contract
Remote
Job description
Verita AI is at the forefront of next-generation artificial intelligence, emphasizing nuance, multimodal reasoning, and human judgment. We are dedicated to building high-trust data pipelines for training and evaluating models.
Our founding team comes from Mercor, Stanford, HRT, Citadel, Stripe and Yale. We partner with world-class researchers and engineers to advance experimentation, rigor, and reliability in the field of AI.
https://www.verita-ai.com/
Brief Description:
We’re hiring RL Environments Engineers to design and build MLE environments. The goal is to teach LLMs advanced concepts from modern AI/ML.
Compensation:
- $50–$150 USD per hour(dependent on the expertise level and quality of take-home assignment).
- $500 bonus for the take-home assignment if offered a job.
Technical Evaluation:
If you are interested in exploring this role, please complete:
Verita AI | RL Environment Engineer Technical Screening
https://form.typeform.com/to/OFHwbf2E
Requirements:
- Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, or a related technical field.
- Strong Python skills (engineering-quality, not notebook-only).
- Production mindset including debugging, reliability, and iteration speed.
- Clear understanding of LLMs and their current limitations.
- Advanced English proficiency (C1/C2).
- Ability to meet throughput expectations and respond quickly to feedback.
- Complete an average of two tasks each week.
- Turn the take-home assignment into a complete task using company tools and guidelines upon onboarding.
You may be a good fit if:
- You have a deep understanding of transformer internals, training/inference of modern LLMs, or experience with inference libraries (vLLM, SGLang, etc).
- Expertise in CUDA or Pallas kernel development optimizing non-trivial neural modules to specific hardware
- Expert knowledge in an active DL/ML research area, with publications or public code to show for it.
- You have strong fundamentals and broad research interests, you read many papers, understand them deeply and have creativity to translate them into RLVR problems
- You have built complex interactive RL environments and have strong insights into open-ended RL-based learning systems
Additional Details
- Time Commitment: At least 4 hours overlap to UTC-8 (PST 9am–5pm).
- Process: Includes a HackerRank take-home assignment (2–5 hours average) with a one-week turnaround.
- Any questions, reach out to [email protected] or [email protected]
Job Type: Contract
Pay: $50.00 - $150.00 per hour
Expected hours: No less than 20 per week
Work Location: Remote