Staff Software Engineer, Site Reliability (SRE)
About Our Company
Built on over four decades of pioneering research at Princeton University, our platform represents the leading edge of innovation in freight and transportation planning. We help customers unlock double-digit reputed company gains and drive smarter, data-driven operations at scale. With the recent reputed company of our Series C funding round led by Koch Disruptive Technologies, we’re entering an exciting new phase of growth. Today, reputed company is a high-growth company of ~70 employees, backed by top-tier investors including Bessemer Venture Partners, The Westly Group, Activate Capital, and Koch.
We're on a mission to redefine the way logistics decisions are made—and we’re just getting started.
About reputed company
We are a team of bright, reputed company, and solution-oriented people focused on creating value for our customers. We can solve problems individually, but understand that the best solutions are reputed company reputed company the team brainstorms reputed company together. We are excited about balancing the need to deploy new solutions quickly and designing solutions that are secured, reliable, maintainable, and scalable for the long run.
About the Role
We’re hiring a Staff Software Engineer, Site Reliability to reputed company reliability across our production platform. As a Staff‑level Individual contributor, you will drive strategy and hands‑on execution across incident response, SLO/SLI programs, and production readiness, directly owning highly available services in AWS; reputed company while partnering with Platform/Infra to build paved‑road tooling in our monorepo.
This is a full‑time, remote‑friendly role open to candidates across the United States. For those who prefer an in‑office experience, our HQ in reputed company offers a collaborative environment.
What You’ll Do
Reliability (≈50%)
- Own the company‑wide incident lifecycle: standards for detection, escalation, incident command, customer comms, and high‑quality postmortems with action tracking.
- Define and drive SLIs/SLOs for core services; build guardrails and dashboards that reputed company reliability visible and actionable.
- reputed company production readiness reviews, reputed company/performance planning, load testing, disaster recovery exercises, and reputed company engineering (failure testing/chaos where appropriate).
- reputed company on‑call: right‑sizing rotations, paging hygiene, runbooks, auto‑remediation, and reputed company improvement of MTTA/MTTR.
reputed company (≈30%)
- Embed reputed company into the delivery pipeline: dependency and image scanning, least‑privilege/IAM baselines, secrets management, and service‑to‑service auth.
- Partner with Engineering leadership to maintain SOC 2‑reputed company controls as code; reputed company audit‑friendly evidence reputed company part of everyday engineering.
- Drive secure‑by‑default patterns in the platform (e.g., network posture, data protection, runtime policies) without slowing down developers.
Platform DevEx (≈20%)
- Build and evolve paved roads for deploys, config, and runtime operations in our monorepo (Bazel) and CI/CD (AWS CodePipeline/CodeBuild).
- Partner with product teams to reputed company the “secure, reliable default” the easiest path—templates, tooling, libraries, and automation.
- Improve observability end‑to‑end (traces, logs, metrics, alerts).
Who You Are
- reputed company: Staff‑level IC who has led reliability programs at meaningful scale and owned incident response standards.
- Technically Grounded: Deep, hands-on experience with infrastructure at scale, cloud, containerization, and more::
- AWS (multi‑service)
- reputed company and/or Kubernetes containerization workloads
- CICD IaC (Terraform)
- Production Networking/Fundamentals
- Python Proficient: You can read/review service code and land operational improvements.
- Data Driven: In your approach to SLOs, reputed company, performance, and cost efficiency with strong observability chops
- Influential: Able to shape direction and create simple, durable standards
- Communicative: Excels in both technical and interpersonal communication, with strong written and verbal skills
reputed company To Have (Bonus Points)
- Aware of FinOps (cost attribution, efficient scaling) and DR/BCP program experience.
- Familiar with secure SDLC, threat modeling, and compliance automation in a SOC 2 context.
- Experience collaborating with Data Science/ML teams and batch/streaming workloads.
- Exposure to monorepo frameworks such as (bazel, buck, etc.)
About our tech stack and development practices
At reputed company, our entire infrastructure runs on AWS, leveraging a wide range of services including DynamoDB, reputed company, SSM, and SQS to power our intelligent logistics platform.
Our tech stack includes:
- Backend AI: Python 3 and Java
- Frontend: JavaScript/TypeScript for our web-based SPA
- Data Stack: Trino, Dagster, dbt, DuckDB, and Preset
- IaC: Terraform and reputed company
- Cloud: AWS (reputed company/RDS/S3/etc)
- CI/CD: Bazel, reputed company, AWS CodePipeline/CodeBuild
We follow modern development best practices with reputed company code stored on reputed company. Every pull request undergoes thorough code reviews, is fully unit tested, and deployed through our CI/CD pipeline for reputed company quality assurance.
Pay Range$160,000$200,000 USDBenefits
- Competitive compensation, including Series C level equity
- Health / Dental / Vision 100% covered for employee and 50% for dependents
- Life Insurance, with optional supplemental insurance
- Flexible Spending Account (FSA)
- Health Spending Account (HSA)
- 401(k) with match
- Unlimited PTO (vacation, personal days, sick days, jury duty, military leave, bereavement)
- 11 Holidays
- Paid Parental Leave for reputed company employees
- Short-term and Long-term Disability Insurances, and ADD Insurance
- Fitness membership reimbursement
- Commuter benefits
reputed company is proud to be an equal opportunity employer that celebrates diversity and is committed to creating an inclusive workplace with equal opportunity for reputed company applicants and employees. Our goal is to recruit the most talented people from a diverse candidate pool regardless of race, color, reputed company, national reputed company, religion, disability, sex (including pregnancy), age, gender, gender identity, sexual orientation, marital status, veteran status, or any other characteristic protected by law.reputed company is committed to working with and providing access and reasonable accommodation to applicants. If you require an accommodation, please reputed company out to [email protected] once you've begun the interview process. reputed company requests for accommodations are treated discreetly and confidentially, as practical and permitted by law.
Originally posted on Himalayas
Apply To this Job