Calling the bold.

Calling the bold.

Uncork-backed companies are hiring explorers, builders, and operators ready to help chart new territory.

Uncork-backed companies are hiring explorers, builders, and operators ready to help chart new territory.

Staff Infrastructure Engineer

Groq

Groq

Other Engineering
Palo Alto, CA, USA
USD 132,100-279,800 / year + Equity
Posted on Sep 17, 2025
Staff Infrastructure Engineer
Palo Alto, CA
Compute, Storage & Eng Infra
Remote
Full-time
About Groq
Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. From our Bay Area roots to our growing global presence, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast.
Staff Infrastructure Engineer
Mission:
At Groq, we are building a custom cloud from the ground up - one data center at a time. Our Compute Storage team owns the systems that turn racks of bare metal into production-ready Kubernetes clusters powering the next generation of AI workloads.
We are looking for a Staff Infrastructure Engineer to help us scale this effort. This is a hands-on role focused on fully automating deployment and lifecycle management of the Groq Cloud server fleet. You will work closely with DC, network and platform teams to define and develop tools and automation that enable seamless deployment and management of Groq compute nodes and storage clusters. We're looking for someone passionate about infrastructure who enjoys debugging close to the metal. If you're eager to grow your skills in deploying, scaling, and optimizing bare metal to support complex distributed HPC in the expanding inference market – we would love to talk.
Responsibilities & Opportunities in this Role:
  • Develop robust, scalable automation solutions (Go, Python, Bash) to streamline and standardize deployment workflows across global data center environments.
  • Be part of large cross-functional collaboration with data center operations, networking, and platform teams, ensuring infrastructure is fully integrated and production-ready.
  • Develop automation to ensure all production machines and clusters consistently meet optimal health standards in a timely manner.
  • Define best practices and standards for infrastructure-as-code and configuration management using Git, Flux, Terraform, and related tools.
  • Set technical direction and maintain high-quality system documentation, operational runbooks, and internal tooling that improve the resilience, repeatability, and observability of the infrastructure stack.
Ideal candidates have/are:
  • Experience with deploying and supporting Linux / Kubernetes systems at scale.
  • Familiarity with infrastructure-as-code and Git-based workflows (e.g., Terraform, Flux, Kustomize).
  • Ability to write and maintain basic tooling in common modern languages such as Go and Python.
  • Understanding of networking fundamentals (IPAM, VLANs, DHCP, DNS).
  • Working knowledge of storage concepts (block vs object, NFS, RAID, etc.).
  • Strong sense of ownership and a willingness to work through ambiguity.
Nice to Have:
  • Experience provisioning physical machines in a data center environment.
  • Exposure to Talos Linux, Kubernetes bootstrapping, or Kubernetes platform engineering.
  • Previous collaboration with facilities, hardware, or network teams in an operational role.
Attributes of a Groqster:
  • Humility - Egos are checked at the door
  • Collaborative & Team Savvy - We make up the smartest person in the room, together
  • Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
  • Curious & Innovative - Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking
Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $132,100 to $279,800 determined by your location, skills, qualifications, experience and internal benchmarks. This range is specific to roles in the United States, compensation for candidates outside the USA will be dependent on the local market.
#LI-Remote
Groq is an Equal Opportunity Employer. We are committed to creating an inclusive environment for all employees and applicants. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex (including gender identity, sexual orientation, and pregnancy), age, disability, genetic information, protected veteran status, or any other characteristic protected by applicable law.
Groq complies with all applicable federal, state, and local laws governing nondiscrimination in employment. We do not tolerate discrimination or harassment based on any protected characteristic.
Groq is committed to working with and providing reasonable accommodations to qualified individuals with physical or mental disabilities. If you require a reasonable accommodation to complete an application or to participate in the hiring process, please contact us at talent@groq.com. This contact is for accommodation requests only, which will be considered on a case-by-case basis.
All offers of employment are contingent upon verification of the applicant’s identity and employment authorization in accordance with federal law.
Groq encourages people with criminal record histories to apply for employment, and values diverse experiences, including prior contact with the criminal legal system. To that end, Groq welcomes such applicants in accordance with the California Fair Chance Act, Los Angeles City Fair Chance Act Ordinance, Los Angeles County Fair Chance Act Ordinance, and San Francisco Fair Chance Act Ordinance. Philadelphia applicants can review information pertaining to Philadelphia’s Fair Criminal Record Screening Standards Ordinance here: https://www.phila.gov/documents/fair-chance-hiring-law-poster.
Req ID: 515