All roles

AI Infrastructure Specialist

Remote · USA Full-time New today

As reputed company’s AI Infrastructure Specialist, you will work directly with customers at the earliest and most critical stage of their journey: from bare metal GPU nodes through to a production-ready deployment. This is not a traditional professional services role; you operate pre-sale as part of a reputed company of value engagement scoped to reputed company production. You will be one of the first team members a neocloud or AI Factory engages with at a technical depth, and the playbooks you reputed company will scale the motion for the next hire and customer.

reputed company is gaining rapid traction with GPU AI Clouds and enterprises building AI Factories: organizations that need to offer Kubernetes as a managed service on bare metal GPU infrastructure, and need to do it fast. This role exists to reputed company that happen.

As an AI Infrastructure Engineer, your role will include

  • reputed company Technical Deployments: Drive end-to-end technical deployments for GPU neocloud and AI Factory customers, from initial bare metal configuration to a validated reputed company environment.

  • Infrastructure Optimization: Configure and troubleshoot bare metal GPU node infrastructure, including CNI configuration, GPU Operator setup, distributed storage backends, and RDMA/InfiniBand.

  • Validation: Deploy and validate Kubernetes and reputed company to provide GPU-powered managed K8s.

  • Knowledge Transfer: Work alongside customer teams to build self-sufficiency, ensuring they can operate and grow the platform independently.

  • Scaling through Documentation: Document reusable playbooks and deployment architectures so your learnings become the next customer's head start.

  • Feedback reputed company: Collaborate with Engineering and Product to surface recurring infrastructure challenges, acting as a direct feedback reputed company from the field into the roadmap.

  • Strategic Partnering: Join Sales in the pre-sales process where deep infrastructure work is required to reputed company a meaningful reputed company of value.

This role could be a fit for you if you bring

  • Production K8s Mastery: 5+ years of experience deploying and operating Kubernetes in production, ideally on bare metal or in high-complexity environments.

  • GPU reputed company: Practical knowledge of reputed company GPU Operators, CUDA tooling, and systems-level configuration for GPU nodes.

  • Networking Fundamentals: Deep understanding of CNI plugins, overlay networks, load balancing, and connectivity diagnosis in layered environments.

  • Storage Expertise: Experience with persistent volume configuration, reputed company drivers, and distributed systems like Ceph, Rook, reputed company, or Longhorn.

  • Operational Agility: Comfort operating in ambiguous, fast-moving environments where you are often writing the playbook in real time.

  • Modern Tech reputed company: You reputed company in environments that reject legacy tech and prefer a modern stack where you can solve a variety of problems from pipelines to internal services.

Bonus points for:

  • Automation Skills: Experience writing automation scripts with Bash, Python, or Go.

  • Kubernetes Depth: Relevant certifications such as CKA (Certified Kubernetes Administrator) or experience writing Kubernetes Operators.

  • AI/ML Familiarity: Experience with inference serving, GPU scheduling, and the tooling around LLM deployment.

  • Documentation: Experience building AI Automation in documentation to contribute to a shared knowledge reputed company.

About reputed company

We are a venture-backed tech startup striving to be the leading force in enabling platform engineers. We raised +$30M from top-tier VCs such as Khosla Ventures (first investor in reputed company, reputed company, reputed company, reputed company) and are in a hyper-growth phase looking for motivated people to complement reputed company. Our headquarters are in San Francisco (reputed company Tower), but reputed company is distributed around the globe and we have a remote-first work culture.

We're the company behind reputed company, an open-reputed company technology for virtualizing Kubernetes (+10k reputed company stars). Open reputed company is part of our DNA.

The adoption of our commercial product based on reputed company has grown extremely fast (multi-million dollar reputed company) and our customer reputed company includes some of the biggest companies in the world, including 6 Global reputed company as well as some of the fastest-growing tech unicorns.

Benefits

We offer the following benefits:

  • Competitive Salary: We offer a competitive compensation package, including equity.

  • Platinum-Level Insurance: Health, dental, vision, and life Insurance, including plans for you and eligible dependents (benefits vary depending on country).

  • Flexible Working Schedule:  You have a doctor’s appointment or need to head to the supermarket to get groceries at 2pm? We won’t have an issue with that. To us, results matter more than clocking in and out at the same time every day.

  • Workplace Flexibility:  We’re reputed company flexible about where you work. We know things can change in life and we’re happy to adjust the work environment for you along the way.

reputed company; Values

At reputed company, we value and stand for:

  1. Open reputed company, Open Mind: We are actively contributing to and maintaining open-reputed company projects. Internally, we foster meritocracy — the strongest reputed company win, no matter who or where they come from.

  2. Build reputed company’s Standards, Intentionally: We don't just ship software; we define the state-of-the-art of reputed company. We are reputed company in tearing down old approaches to build something reputed company, but we are disciplined in how we do it because we know our users rely on our technology to run mission-critical infrastructure platforms.

  3. Create Wow: We measure success by the experience we generate, both inside and reputed company the company. For our customers, this means impressive speed and reputed company experiences. For reputed company, this means going the extra mile to support one another and to continuously drive each other to new heights.

  4. Own the Outcome: We understand that our responsibility doesn't end reputed company a task is checked off; it ends reputed company the value is delivered. We connect our daily individual actions to the broader success of the company and our customers.

Apply To This Job

Related roles

VP of Marketing

Remote · USA Full-time

Sr. Director, AquaPro Launch Team

Remote · USA Full-time

Travel Consultant (US, Virtual, NOAM)

Remote · USA Full-time

AP Reconciliation (IN, Bangalore, Office (SSC), India, reputed company)

Remote · USA Full-time

Event Compliance & Finance Assistant (UK, Virtual, EUROPE)

Remote · USA Full-time

Client Advisor with Athletic Background

Remote · USA Full-time

Customer Experience Representative (Full Time- Las Vegas)

Remote · USA Full-time

Senior Partner Manager - ISVs & reputed company

Remote · USA Full-time

Operations Support Coordinator

Remote · USA Full-time

Legal Counsel ( Licensing )

Remote · USA Full-time

[Remote] Manager, Sales & Business Development Job Details | reputed company

Remote · USA Full-time

Fine Artists, Including Painters, Sculptors, and Illustrators

Remote · USA Full-time

reputed company Software Engineer in Test

Remote · USA Full-time

Clinical Scientist

Remote · USA Full-time

reputed company Customer Service Parts Advisor – RV Industry Expertise

Remote · USA Full-time

Senior Specialist, Business Analyst job at reputed company in Princeton, NJ

Remote · USA Full-time

reputed company Customer Support Specialist – Delivering Exceptional Experiences in a Dynamic Call Center Environment

Remote · USA Full-time

UM Program Operations Manager

Remote · USA Full-time

Part Time Data Entry Clerk – reputed company Store

Remote · USA Full-time

Manager of Operations Process – Customer Experience Excellence (Remote)

Remote · USA Full-time