Lead, Site Reliability Engineer

Oak Street Health Inc. Chicago , IL 60602

Posted 1 week ago

Company: Oak Street Health

Title: Lead Engineer, Site Reliability

Location: Chicago or Remote

Role Description:

As a Lead Engineer - Site Reliability Engineer (SRE), you will play a critical role in leading the design, implementation, and maintenance of highly available and scalable systems. You will leverage your extensive experience to drive best practices, mentor junior team members, and collaborate with cross-functional teams to ensure the reliability and performance of our infrastructure.

Key Responsibilities:

  • Lead the design, implementation, and maintenance of scalable and reliable systems and applications.

  • Provide technical leadership and guidance to the SRE team, including mentoring junior engineers and fostering a culture of continuous learning and improvement.

  • Collaborate with development, operations, and other cross-functional teams to define and implement SRE best practices, standards, and processes.

  • Architect and implement monitoring and observability solutions using Grafana and other tools to ensure proactive detection and resolution of issues.

  • Oversee Azure infrastructure management, including resource provisioning, configuration, and optimization.

  • Develop and execute comprehensive performance and load testing strategies to identify and address bottlenecks and optimize system performance.

  • Drive automation efforts by developing and maintaining scripts and tools using PowerShell, JavaScript, and other scripting languages.

  • Implement systems integration solutions to enable seamless communication and interoperability between different systems and services.

  • Lead incident response efforts during outages or incidents, ensuring timely resolution and minimizing downtime.

  • Document architectures, processes, and procedures to facilitate knowledge sharing and ensure system reliability.

Qualifications:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience

  • 5+ years of experience working in a Site Reliability Engineering or similar role.

  • Extensive experience with Grafana, Azure administration, or similar observability tools and cloud platforms.

  • Strong expertise in performance and load testing methodologies and tools.

  • Proficiency in scripting languages such as PowerShell and JavaScript.

  • Demonstrated leadership skills with the ability to lead and mentor a team effectively.

  • Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.

  • Proven track record of designing and implementing scalable and reliable systems.

  • Ability to thrive in a fast-paced environment and effectively prioritize multiple tasks.

Preferred Qualifications:

  • Master's degree in Computer Science, Engineering, or a related field.

  • Certification in Azure administration or related fields.

  • Experience with containerization technologies such as Docker and Kubernetes.

  • Familiarity with CI/CD pipelines and DevOps principles.

  • Deep understanding of networking concepts and protocols.

What does being 'Oaky' look like?

  • Radiating positive energy

  • Assuming good intentions

  • Creating an unmatched patient experience

  • Driving clinical excellence

  • Taking ownership and delivering results

  • Being relentlessly determined

Why Oak Street Health?

Oak Street Health is on a mission to 'Rebuild healthcare as it should be'', providing personalized primary care for older adults on Medicare, with the goal of keeping patients healthy and living life to the fullest. Our innovative care model is centered right in our patient's communities, and focused on the quality of care over volume of services. We're an organization on the move! With over 150 locations and an ambitious growth trajectory, Oak Street Health is attracting and cultivating team members who embody 'Oaky' values and passion for our mission.

Oak Street Health Benefits:

  • Mission-focused career impacting change and measurably improving health outcomes for medicare patients

  • Paid vacation, sick time, and investment/retirement 401K match options

  • Health insurance, vision, and dental benefits

  • Opportunities for leadership development and continuing education stipends

  • New centers and flexible work environments

  • Opportunities for high levels of responsibility and rapid advancement

Oak Street Health is an equal opportunity employer. We embrace diversity and encourage all interested readers to apply.

Learn more at www.oakstreethealth.com/diversity-equity-and-inclusion-at-oak-street-health


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Lead Site Reliability Engineer

Ultimate Kronos Group

Posted 3 days ago

VIEW JOBS 5/23/2024 12:00:00 AM 2024-08-21T00:00 General Information Ref #: 20240038412 Travel Amount Required: Up to 25% Job Type: Regular-Full Time Location: Lowell Massachusetts United States Ultimate Kronos Group Lowell MA

Lead, Site Reliability Engineer

Oak Street Health Inc.