Calling the bold.

Calling the bold.

Uncork-backed companies are hiring explorers, builders, and operators ready to help chart new territory.

Uncork-backed companies are hiring explorers, builders, and operators ready to help chart new territory.

Machine Learning Engineering Intern, Evals/Post-training

Groq

Groq

Software Engineering, Data Science
Palo Alto, CA, USA
USD 30-50 / hour
Posted on Sep 20, 2025
Machine Learning Engineering Intern, Evals/Post-training
Palo Alto, CA
Internships
Hybrid
Intern
About Groq
Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. From our Bay Area roots to our growing global presence, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.
Machine Learning Engineering Intern, Evals/Post-training
Winter 2026 (January - April) Internship - full-time
Hybrid (Palo Alto, CA)
Mission:
We’re a small, fast team behind OpenBench (open, reproducible LLM evals). We turn model behavior into measurable progress, then upstream it. You’ll work alongside people, not for people: low ceremony, quick feedback, lots of ownership. You won’t be siloed; you’ll jump across evals, post-training, infra, and (when useful) product/GTM.
Responsibilities & opportunities in this role:
  • Build and reimplement evals (accuracy, robustness, safety, latency) end-to-end.
  • Run tight SFT/DPO/RLHF-style loops; track deltas and ship models for customers.
  • Red-team models; turn quirks into metrics and provide feedback to the inference team
  • Own scoped projects: design → implement → document → upstream.
  • Write research papers on evals you build.
  • Pitch improvements across the company when you see them, then ship.
Ideal candidates have/are:
  • Founding Engineer (grinder)
    • You unblock yourself, learn fast, and ship relentlessly - scrappy first, then clean and reproducible.
    • Signals: productionized side projects, CI’d repos, tools other people actually use.
  • Researcher (loves data and pushing the frontier)
    • You reason clearly about eval design, failure modes, and data quality; you run ablations and write tight analyses.
    • Signals: careful experiments, thoughtful write-ups, PRs to open-source projects.
  • Must-haves
    • Agentic, kind, gritty.
    • Hands-on with evals, post-training, or applied AI (not just theory).
    • Comfort getting a bit hacky while keeping results reproducible.
Why Join Us
  • Purposeful Hiring: You’re not here by accident, and neither is anyone else. Every teammate is handpicked with intention because who we build with matters.
  • Builders Wanted: You’re not just riding the rocket ship, you’re building it. Your work directly shapes the trajectory of our company.
  • Mission-Driven Work: We’re here to make a real impact. Our mission fuels everything we do.
  • Tackling Hard Problems: If easy isn’t your thing, you’re in the right place. We solve some of the most complex and exciting challenges in our space.
  • Excellence Is The Standard: High performance isn’t just encouraged, it’s the baseline. And it’s contagious.
If this sounds like you, we’d love to hear from you!
Compensation: The US pay range for our technical internships is $30-$50 / per hour. The pay range for our non-technical internships is $30-$40 / per hour. Compensation is determined by your location, skills, qualifications, experience and internal benchmarks. This range is specific to roles in the United States, compensation for candidates outside the USA will be dependent on the local market. #LI-DNI
Groq is an Equal Opportunity Employer. We are committed to creating an inclusive environment for all employees and applicants. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex (including gender identity, sexual orientation, and pregnancy), age, disability, genetic information, protected veteran status, or any other characteristic protected by applicable law.
Groq complies with all applicable federal, state, and local laws governing nondiscrimination in employment. We do not tolerate discrimination or harassment based on any protected characteristic.
Groq is committed to working with and providing reasonable accommodations to qualified individuals with physical or mental disabilities. If you require a reasonable accommodation to complete an application or to participate in the hiring process, please contact us at talent@groq.com. This contact is for accommodation requests only, which will be considered on a case-by-case basis.
All offers of employment are contingent upon verification of the applicant’s identity and employment authorization in accordance with federal law.
Groq encourages people with criminal record histories to apply for employment, and values diverse experiences, including prior contact with the criminal legal system. To that end, Groq welcomes such applicants in accordance with the California Fair Chance Act, Los Angeles City Fair Chance Act Ordinance, Los Angeles County Fair Chance Act Ordinance, and San Francisco Fair Chance Act Ordinance. Philadelphia applicants can review information pertaining to Philadelphia’s Fair Criminal Record Screening Standards Ordinance here: https://www.phila.gov/documents/fair-chance-hiring-law-poster.
Req ID: INTERN-18-1