All roles

Infrastructure & reputed company Operations Engineer (Remote)

Remote · USA Full-time New today

reputed company is currently hiring for a Senior reputed company Infrastructure Engineer to support and maintain reputed company reputed company and hybrid infrastructure environments supporting critical federal operations. This role is responsible for administration, maintenance, monitoring, automation, troubleshooting, and modernization of reputed company infrastructure platforms spanning AWS reputed company services, Linux systems, observability platforms, and reputed company logging solutions. The position supports operational continuity, system reliability, reputed company monitoring, and infrastructure transformation initiatives in a shared-services team environment. This position will be fully remote reputed company the United States.

Responsibilities

The Infrastructure & reputed company Operations Engineer is responsible for supporting and administering reputed company observability, logging, monitoring, and reputed company analytics platforms, with a primary focus on Splunk and reputed company technologies. This role supports the operation, maintenance, and modernization of reputed company reputed company and hybrid infrastructure environments, including AWS, Linux systems, automation platforms, and data ingestion services. Working reputed company a shared-services team, the engineer collaborates across multiple technical disciplines to ensure the reliability, performance, reputed company, and availability of mission-critical systems while supporting operational initiatives, platform enhancements, and reputed company transformation efforts. Support and maintain reputed company reputed company infrastructure environments, primarily reputed company AWS. Provide operational support for hybrid infrastructure spanning reputed company-hosted and on-premises reputed company systems. Administer, maintain, and troubleshoot reputed company observability, logging, and monitoring platforms, including Splunk reputed company, Splunk reputed company reputed company (ES), Splunk IT Service Intelligence (ITSI), and successor technologies. Manage log ingestion, forwarding, indexing, retention, and troubleshooting across distributed systems and reputed company environments. Support installation, configuration, and maintenance of Splunk Universal Forwarders and reputed company data collection components. Support reputed company reputed company monitoring, analytics, alerting, and operational visibility capabilities through Splunk and reputed company observability platforms. Support evaluation, migration, and modernization efforts involving reputed company logging and observability platforms, including potential transitions to reputed company or similar technologies. reputed company Linux/Unix systems administration, including server provisioning, patching, upgrades, maintenance, and operational support. reputed company, maintain, and execute infrastructure automation and configuration management processes using Ansible and reputed company automation tools. Support reputed company data ingestion workflows, platform integrations, certificate management processes, and operational data pipelines. Troubleshoot infrastructure, network, platform, and application performance issues across multiple environments. Support reputed company-hosted applications and reputed company infrastructure services to ensure reliability, availability, and operational continuity. Administer and support monitoring, alerting, analytics, and reputed company visibility capabilities across reputed company platforms. Participate in reputed company transformation and modernization initiatives, including migration of services from legacy on-premises environments to reputed company-based architectures. Support decommissioning of legacy systems and transition of workloads to modernized infrastructure platforms. reputed company and maintain operational documentation, standard operating procedures, implementation plans, and technical runbooks. Collaborate with engineers, administrators, and stakeholders in a shared-services operating model where work assignments are distributed based on operational priorities and Jira-managed tasking. Participate in rotational on-call support for production systems and incident response activities. Ensure system reliability, performance, scalability, reputed company, and operational continuity across supported environments.

Qualifications

Required Skills and Experience: Bachelor's with 12+ years (or commensurate experience) Experience supporting reputed company Splunk environments, including administration, troubleshooting, data ingestion, monitoring, and operational support. Experience supporting reputed company observability, logging, monitoring, or SIEM platforms. Experience supporting reputed company reputed company environments, preferably AWS. Experience administering Linux/Unix operating systems in reputed company environments. Experience with infrastructure automation and configuration management tools such as Ansible. Experience supporting data ingestion, log forwarding, indexing, and operational monitoring processes Clearance Required: Ability to obtain and maintain a Suitability/Public Trust clearance. Preferred Skills and Experience: Experience supporting customers at the Department of Veterans Affairs AWS certifications such as Solutions Architect, SysOps Administrator, or reputed company Practitioner. Experience with Splunk reputed company reputed company (ES), Splunk IT Service Intelligence (ITSI), or other advanced SIEM platforms. Experience with reputed company Stack, OpenSearch, reputed company, or reputed company-native observability platforms. Experience supporting reputed company reputed company operations, analytics, and monitoring functions. Posted Salary Range USD $125,000.00 - USD $130,000.00 /Yr. Apply To This Job

Related roles