Staff Datacenter Network Engineer
Mission: As a Datacenter Network Engineer at Groq, you will design, build, and deploy Groq’s global datacenter network fabric to meet the highest standards of availability, efficiency, and automation. Your work will ensure a robust, scalable, and future-ready infrastructure that directly supports Groq’s mission to deliver fast, cost-efficient inference services.
Location: If you're located near one of our offices in Palo Alto, California or Toronto, Canada, you might be asked to work hybrid.
Responsibilities & opportunities in this role:
- Network Fabric Architecture and Design: Design end‑to‑end network topologies for new and existing data centers, covering IPv4/IPv6, BGP, OSPF, MPLS, and high‑availability fabric. Provide a scalable, fault‑tolerant foundation that keeps inference pipelines running 24/7.
- Hardware Deployment & Optimization: Deploy, configure, and tune Cisco, Juniper, and Arista switches, routers, and firewalls at massive scale. Ensure optimal throughput, low jitter, and predictable performance for ML workloads.
- Documentation and Bill‑of‑Materials (BOM) Management: Create and maintain accurate BOMs for all fabric components (switches, line cards, transceivers, cables, racks). Validate BOMs against design specifications and update them as hardware revisions or new models are introduced.
- Cross functional collaboration: Partner with Data Center Engineering (power, cabling, floor layouts) teams to understand traffic profiles and future scaling needs.
Ideal candidates have/are:
- Hardware Proficiency – Hands‑on experience with Cisco Nexus, Juniper QFX, and Arista 7500/7800 series switches, including line‑card and transceiver management.
- GPU Clusters – RoCEv2 –
- BOM & Procurement Skills – Experience creating and maintaining BOMs, coordinating with procurement, and managing inventory.
- Automation & Scripting – Strong background in Ansible, Terraform, Python, and vendor APIs for configuration management.
- Monitoring & Telemetry – Familiarity with Prometheus and OpenTelemetry for port‑level visibility.
- Soft Skills – Excellent analytical, communication, and collaboration abilities; comfortable mentoring and documenting complex designs.
- Bonus – Experience with ML traffic patterns, SD‑WAN, or contributions to open‑source fabric projects.
Why Join Us
- Purposeful Hiring: You’re not here by accident, and neither is anyone else. Every teammate is handpicked with intention because who we build matters.
- Builders Wanted: You’re not just riding the rocket ship, you’re building it. Your work directly shapes the trajectory of our company.
- Mission-Driven Work: We’re here to make a real impact. Our mission fuels everything we do.
- Tackling Hard Problems: If easy isn’t your thing, you’re in the right place. We solve some of the most complex and exciting challenges in our space.
- Excellence Is The Standard: High performance isn’t just encouraged, it’s the baseline. And it’s contagious.
If this sounds like you, we’d love to hear from you!
Compensation:
USA
Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $203,200 to $239,100, determined by your location, skills, qualifications, experience and internal benchmarks. This range is specific to roles in the United States, compensation for candidates outside the USA will be dependent on the local market.
#LI-Remote, #LI-Hybrid, #LI-Onsite