[Remote] Cloud SRE / Senior Cloud Engineer
Note: The job is a remote job and is open to candidates in USA. Dice is seeking a Cloud SRE / Senior Cloud Engineer to design and build cloud reliability solutions, automation tools, and platform services supporting large-scale AWS environments. The role involves developing cloud reliability tools, maintaining AWS infrastructure, and implementing SRE best practices.
Responsibilities
- Develop cloud reliability tools and automation solutions
- Build and maintain AWS infrastructure using Terraform
- Create CI/CD pipelines and testing frameworks
- Define and implement SRE best practices and reliability metrics
- Support production environments, incident response, and root cause analysis
- Collaborate with cross-functional teams to improve platform reliability and scalability
Skills
- 7+ years of Software Engineering, SRE, Platform Engineering, or DevOps experience
- 5+ years of Python development experience
- 3+ years of hands-on AWS experience (EC2, VPC, S3, Lambda, IAM, CloudFormation, EventBridge, Step Functions)
- Expert-level Terraform and Infrastructure as Code (IaC)
- Strong CI/CD, DevOps, and automated testing experience
- Experience with observability tools such as Grafana and CloudWatch
- Strong knowledge of SRE practices including SLIs, SLOs, error budgets, incident management, and RCA/postmortems
- Agile/SAFe environment experience
- GoLang
- Chaos Engineering
- Cloud Cost Optimization
- ITSM experience
Company Overview