Senior Site Reliability Engineer (SRE)
Collective Health
This job is no longer accepting applications
See open jobs at Collective Health.See open jobs similar to "Senior Site Reliability Engineer (SRE)" RRE.We all depend on healthcare throughout our lifetimes, for ourselves, and our families and friends, but it is notoriously difficult to navigate and understand. As an industry that comprises 20% of the US economy we think healthcare should work better for all of us. At Collective Health we believe it’s time for a new day in healthcare where as members we are informed and empowered to make the right care choices when the decisions are urgent and critical.
Infrastructure reliability is critical to Collective Health and its customers, as it’s the foundation on which our healthcare software and services are built. The Site Reliability Engineering team at Collective Health ensures the reliability, availability and performance of our deployed cloud footprint.
As a Senior Site Reliability Engineer on the team, you will play a key role in ensuring our deployed cloud footprint is highly stable and performant and meeting the expectations of external customers and users and internal stakeholders.
What you'll do:
- Establish service level indicators and data-driven objectives, and develop SRE standards and processes to uphold and improve uptime, latency, and system health.
- Define and execute initiatives to continuously improve our deployed cloud footprint in areas such as observability / monitoring, risk detection and mitigation, disaster recovery, cost optimization, and related areas.
- Collaborate across engineering and other stakeholders to ensure that key stability and maintainability requirements are understood and maintained.
- Create automation in areas such as monitoring, alerting, deployment, and others to enable scale and efficiency.
- Be part of the SRE on-call rotation, including responsibility for incident response.
- Implement best practices around incident management and root cause analysis while being part of on-call rotations.
- Provide mentorship to junior site reliability engineers on best practices.
To be successful in this role, you'll need:
- Bachelor's degree in Computer Science, Management Information Systems, or equivalent practical experience.
- 4+ years of experience in site reliability engineering focused on maintaining production-grade cloud infrastructure.
- Familiarity with a wide range of cloud-based infrastructure technologies, such as those used in container orchestration, data orchestration, business middleware, security, and governance. This includes AWS (S3, EC2, RDS, more), Kubernetes, Docker, Kafka, Jenkins, and Grafana.
- Demonstrated track record in effectively analyzing and troubleshooting large-scale distributed systems.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
Pay Transparency Statement
This is a hybrid position based out of our offices: San Francisco, CA, Plano, TX, or Lehi, UT. Hybrid employees are expected to be in the office three days per week (Plano, TX) or two days per week (all other locations). #LI-hybrid
The actual pay rate offered within the range will depend on factors including geographic location, qualifications, experience, and internal equity. In addition to the salary, you will be eligible for stock options and benefits like health insurance, 401k, and paid time off. Learn more about our benefits at https://jobs.collectivehealth.com/#benefits.
About Collective Health
Collective Health is the leading health benefits platform that brings together medical, dental, vision, pharmacy, and program partners into an integrated solution that better enables employees and their families to understand, navigate, and pay for healthcare. By reducing the administrative lift of delivering health benefits, providing an intuitive member experience, and helping control costs and improve outcomes, the company guides employees toward healthier lives and companies toward healthier bottom lines.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Collective Health is committed to providing support to candidates who require reasonable accommodation during the interview process. If you need assistance, please contact recruiting-accommodations@collectivehealth.com.
Privacy Notice
For more information about why we need your data and how we use it, please see our privacy policy: https://collectivehealth.com/privacy-policy/.
This job is no longer accepting applications
See open jobs at Collective Health.See open jobs similar to "Senior Site Reliability Engineer (SRE)" RRE.