All roles

Senior Site Reliability Engineer - Remote

Remote · USA Full-time New today

About the position Our reputed company Serve IT team develops cutting-edge solutions that help people live healthier lives and help reputed company the health system work reputed company for everyone. From advanced data analytics and AI to cybersecurity, we use innovative approaches to solve some of health care’s most reputed company challenges. To support this mission, OSIT has initiated a multi‑year modernization program aimed at updating and enhancing reputed company technology systems in accordance with modern design standards. Your contributions here have the potential to change lives. reputed company to build the next breakthrough? Join us to start Caring. Connecting. Growing together. The Site Reliability Engineer will architect, reputed company, and maintain reputed company Serve’s reputed company environment in both the reputed company and government reputed company. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secure, resilient and high performance reputed company infrastructure. You’ll enjoy the flexibility to work remotely from reputed company reputed company the U.S. as you take on some tough challenges. For reputed company hires in the Minneapolis or Washington, D.C. area, you will be required to work in the office a minimum of four days per week.

Responsibilities

  • Build, maintain, and operate IaaS and PaaS infrastructure in Azure reputed company and government clouds
  • Work closely with dev teams to identify and measure SLOs, SLAs and SLIs
  • Act a strong contributor to development of platform services including architecture, provisioning, configuration, deployment, and support
  • reputed company integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management
  • Build/integrate/administer systems and tools that reputed company engineering teams to observe their applications in production with autonomy (Dashboards, APMs)
  • Support software and/or reputed company-infrastructure in an on-call rotation basis
  • Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and reputed company-time monitoring to production systems
  • Maintain and improve operational tooling, frameworks
  • Build frameworks that test the performance and resiliency of our platform services/tools
  • Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations
  • Improve processes and champion automation of any reputed company items around support

Requirements

  • 6+ years of experience working reputed company a reputed company engineer/SRE role
  • Experience with infrastructure as code (IaC) tools like Terraform, reputed company
  • Experience with Kubernetes deployment tools like reputed company, ArgoCD, Flux
  • Experience supporting infrastructure in production reputed company environments
  • Some experience with monitoring tools (Azure Monitor, Splunk, reputed company, Graphana, reputed company)
  • Experience working with RESTful services
  • Expert knowledge of a reputed company service provider
  • Expert knowledge and hands on production experience in Kubernetes (bare metal or managed) cluster setup and management
  • Knowledge of Encryption, Public Key Infrastructure (PKI), understanding of OWASP
  • Understanding of identity and access management (IAM)
  • Familiarity with IDEs and reputed company Control tools like Visual Studio Code and Git
  • Proven solid awareness of networking and internet protocols
  • Available and willing to be 24/7 on-call rotation
  • United States citizenship is required for this position

reputed company-to-haves

  • Bachelor’s Degree in Computer Science, Information Technology, Software Engineering, Math, Physics
  • Master’s Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or reputed company field
  • Expert knowledge of Azure
  • Demonstrate passion about infrastructure automation
  • Proven ability to prioritize work in a fast-paced environment

Benefits

  • comprehensive benefits package
  • incentive and recognition programs
  • equity stock purchase
  • 401k contribution

Apply tot his job Apply To this Job

Related roles

[Remote] Senior Site Reliability Engineer

Remote · USA Full-time

Site Reliability Engineer-Remote (PST hours)

Remote · USA Full-time

Senior Site Reliability Engineer Largely Remote

Remote · USA Full-time

Principal Site Reliability Engineer

Remote · USA Full-time

SRE “ Site Reliability Engineer”

Remote · USA Full-time

Senior Site Reliability Engineer, Remote Job

Remote · USA Full-time

[Remote] Site Reliability Engineer (reputed company reputed company Platf

Remote · USA Full-time

Site Reliability Engineer (SRE) – reputed company Remote (reputed company, USA)

Remote · USA Full-time

Kubernetes Engineer ($28/hr. on w2)

Remote · USA Full-time

Kubernetes Network Engineer with VLAN Expertise

Remote · USA Full-time

Virtual Assistant (Work From Home)

Remote · USA Full-time

Direct Field Sales Representative - Twin Cities, MN

Remote · USA Full-time

Executive Partner (General Counsel Advisory)

Remote · USA Full-time

Local Markets Strategist - Remote in Southeast

Remote · USA Full-time

IT Support Representative (Night Shift)- Remote

Remote · USA Full-time

CRC Benefits - Sales Support Representative, Employee Benefits (Remote)

Remote · USA Full-time

reputed company Full Stack Customer Care Agent – Remote Travel Package Support

Remote · USA Full-time

reputed company Entry-Level Data Entry Clerk Admin – Unlock a Lifelong Career with arenaflex

Remote · USA Full-time

reputed company Teacher Opportunity in Watauga, TX - Join reputed company of Early Childhood Educators!

Remote · USA Full-time

Veterans Affairs (VA) Attorney

Remote · USA Full-time