All roles

[Remote] Applied AI Inference Engineer

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. Baseten is a company that powers mission-critical inference for leading AI companies. The Applied AI Inference Engineer will partner with customers to architect, build, and deploy high-scale production AI applications, translating business goals into reliable services with clear outcomes.

Responsibilities

  • Develop and maintain software systems and product features using one or more general-purpose programming languages in a production-level environment, with a preference for Python due to its relevance in ML projects
  • Drive customer impact by designing, implementing, and deploying Baseten solutions end-to-end (problem framing → evaluation → production deployment → monitoring). This involves working with customers’ engineering teams at every stage of the customer journey including: sales, implementation, and expansion
  • Deliver with velocity: turn vague objectives into clear specs and well-defined PoCs so we can rapidly ship well-tested services and outcomes for our customers
  • Optimize and enhance AI/ML projects, contributing to the continuous improvement of our technical stack. This includes developing features and PRDs with other engineering and product orgs
  • Own products and customer projects end-to-end, functioning as both an engineer, project manager, and product manager, with a focus on user empathy, project specification, and end-to-end execution
  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems, avoiding unnecessary complexity
  • Demonstrate pride, ownership, and accountability for your work, expecting the same from your teammates

Skills

  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Engineering, Mathematics, or related field
  • 1+ years of professional work experience in a fast-paced, high-growth environment
  • Demonstrated experience with one or more general-purpose programming languages in a production-level environment, with a strong preference for Python
  • Familiarity with AI/ML pipelines and the lifecycle of ML model development and deployment
  • Strong communication skills, particularly on complex technical topics
  • Experience in building or optimizing AI/ML projects is highly valued

Benefits

  • Competitive compensation, including meaningful equity.
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)
  • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Company Overview

  • Baseten is an AI infrastructure company that integrates machine learning into business operations, production, and processes. It was founded in 2019, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website is https://www.baseten.co.
  • Company H1B Sponsorship

  • Baseten has a track record of offering H1B sponsorships, with 1 in 2026, 6 in 2025, 8 in 2024, 1 in 2023, 1 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    Related roles

    Junior SRE (Endpoint focus)

    Remote · USA Full-time

    AI Solutions Engineer

    Remote · USA Full-time

    Software Engineer II, Machine Learning (Feature Platform)

    Remote · USA Full-time

    Software Engineer II, Machine Learning (Feature Platform)

    Remote · USA Full-time

    Productivity Platforms Developer

    Remote · USA Full-time

    Associate Network Engineer

    Remote · USA Full-time

    Junior Software Engineer (Backend + AI)

    Remote · USA Full-time

    Software Engineer II, Machine Learning (Feature Platform)

    Remote · USA Full-time

    Software Engineer (Secret) (4611)

    Remote · USA Full-time

    Support Products Analyst

    Remote · USA Full-time

    Experienced Data Entry Associate – arenaflex Work-from-Home Opportunity for Career Growth

    Remote · USA Full-time

    Canada Residents Survey Participants West Perth Canada

    Remote · USA Full-time

    Experienced Data Collection Nurse – Hybrid Role for Quality Measures in the Chattanooga Area

    Remote · USA Full-time

    Senior Counsel

    Remote · USA Full-time

    TWS Lead

    Remote · USA Full-time

    Servicing Vendor Management Analyst

    Remote · USA Full-time

    Experienced Customer Support Representative – Apple Home Advisor

    Remote · USA Full-time

    Job Title: Remote Data Entry Specialist – Entry Level Part-Time Position | Work From Home Opportunity With Flexible Hours

    Remote · USA Full-time

    Experienced Full Stack Chat Support Specialist – Remote Work at arenaflex

    Remote · USA Full-time

    Experienced Bilingual Spanish Call Center Customer Service Representative – Thrive in a Dynamic Environment at arenaflex

    Remote · USA Full-time