Back to Jobs

Observability Engineer

Optum (UnitedHealth Group) Minnetonka, MN, US
Posted 4 months, 1 week ago
Deadline: Not specified
Full Time Senior Engineering Remote

For those who want to invent the future of health care, here’s your opportunity. We’re going beyond basic care to health programs integrated across the entire continuum of care. Join us to start Caring. Connecting. Growing together. 

 

OptumServe Enterprise Monitoring team is looking for an Observability Engineer. The team is responsible for enterprise infrastructure, application, and network monitoring for on-prem, hybrid, and various Clouds. The selected candidate will be joining a team of skilled engineers with a broad background in enterprise monitoring and Observability.  As an Observability Engineer, this role is focused on maintaining the reliability, scalability and availability of our Log management solution as well as our Metrics and Observability platform which heavily uses automation (terraform, Ansible and scripts), this role requires maintaining performance KPI of our solutions and defining their SLOs.

 

Application Deadline: This will be posted for a minimum of 2 business days or until a sufficient candidate pool has been collected. Job posting may come down early due to volume of applicants.

 

 

Requirements

2+ years of experience working directly with monitoring tools as either an Admin, SME or as an Architect, preferably with Dynatrace and/or ELK
2+ years of experience with Dynatrace (managed, cloud as well as offline, with full scope of best practices and setup as it relates to Active gate, cloud, on-prem and custom with workflows), or with Elastic on-prem and cloud with best practices around the platform
1+ years of experience with designing data pipelines using filebeat, Logstash and/or fluentbit/fluentd
1+ years of AI expertise as it relates to Observability to reduce the amount of work, and make our products more reliable and resilient
1+ years of experience writing scripts in languages like Python and (Bash or powershell) to automate tasks
1+ years of experience working with Linux OS
United States Citizenship
If you are offered this position, you will be required to provide extensive personal information to obtain and maintain a suitability or determination of eligibility for a Confidential/Secret or Top Secret security clearance as a condition of your employment

OptumCare is a drug-free workplace. Candidates are required to pass a drug test before beginning employment

Responsibilities

Maintain and deploy monitoring and alerting
Design, configuration and maintenance of log aggregation solution at a large scale
Set up and manage ingestion pipelines and data transformations
Have the mindset of “automate any task”
Monitoring and Alerting: Build and maintain robust monitoring systems using tools like Elk, Dynatrace, Prometheus, OTEL and Grafana to detect potential issues early and trigger alerts for timely response
Maintain associated documentation as it applies to our audit and certification requirements
Participate in troubleshooting, capacity planning, and performance analysis activities
Research new monitoring requirements and in many cases write code for that
Medium to expert level in setting up AI rules for tools like DavisAI (Dynatrace) and/or Elastic GenAI
Solid expertise in setting up monitoring policies/rules/templates; and writing scripts to accomplish monitoring requirements

Company Size
1000+ employees
Employment Type
Full Time
Work Mode
Remote (Minnetonka, MN, US)
Apply Externally
Notice: You are about to leave RemoteWok and apply on an external site.
The application process will continue on the employer's website.
View Company Profile