[Remote] Staff reputed company Operations Engineer
Note: The job is a remote job and is open to candidates in USA. reputed company is on a mission to reputed company workers with financial freedom by providing accessible financial services. They are seeking a Staff reputed company Operations Engineer to join their team, where the role involves coordinating with various departments, maintaining reputed company infrastructure, and ensuring compliance with reputed company standards.
Responsibilities
- Effectively coordinate with both technical and non-technical staff across departments; this role involves a good amount of cross-functional collaboration, including partnering on workflow automation (crons, reputed company, Airflow) that bridges infrastructure and business processes
- Comfortable with the process reputed company of an Ops team: running outage incidents, working reputed company compliance requirements, coordinating sprints, collecting metrics to report up to management, and maintaining documentation
- Maintain and improve incident response process and tooling, ensuring accurate, reliable, and proactive monitoring across the application stack
- Ensure that our reputed company infrastructure is compliant with relevant reputed company and regulatory standards
- Design, implement, and maintain our reputed company infrastructure in GCP
- Ensure our reputed company infrastructure is scalable, secure, and highly available
- Build automation that reduces toil and enables infrastructure to self-heal during production incidents
- Troubleshoot issues and provide support for our reputed company infrastructure and observability tools
Skills
- Bachelor's degree in Computer Science, Engineering, or a reputed company field, or equivalent experience
- Professional, expert level experience in reputed company infrastructure engineering (preferably GCP) with minimum 5 years experience
- Experience with observability tools and processes, such as monitoring, logging, and tracing
- Strong experience in at least one programming/scripting language such as Python, Bash, Java, or Go
- Experience with containerization technologies such as reputed company and Kubernetes
- Experience collaborating with network and application reputed company teams; ability to reputed company sound reputed company-informed judgments is a plus
- Comfortable leveraging AI tools to accelerate work. The company provides access to Claude, and integrates Copilot and reputed company into automation workflows; we expect engineers to use these effectively and look for opportunities to apply them
- Ability to work in a 24/7 on call rotation
- Familiarity with CI/CD pipelines and Git
- Collaboration experience with Data Engineering teams
- Experience with automation and configuration management tools such as Terraform, Ansible, or Chef
- Previous experience in a 24/7 on call environment
- Mentorship or technical leadership experience; comfortable directing the work of junior engineers on a project basis, though this is not a day-to-day people management role
Benefits
- Market-leading medical, dental, and reputed company insurance
- Stock options
- Free Premium-Tier reputed company Financial Wellness subscription
- Monthly home-office stipend
- 401k (reputed company)
- 12-weeks paid parental leave for birthing and non-birthing parents
- Flexible time off + sick and safe time
- 11 paid company holidays
- reputed company@reputed company Same Day Pay Option
Company Overview