Manager, Reliability Engineer (Remote)

The Hartford Columbus , OH 43216

Posted 1 week ago

Manager, Reliability Engineer - IE07LE

We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.

Claims, Contact Center & Digital Enablement is seeking a Manager of Reliability Engineering for its Claims portfolio. This manager will be responsible for The Technical Product Owner Role driving the shared aligned vision, OKRs, and backlog priorities for the Claims RE Squad. As Technical Product owner the candidate will support and prioritize the adoption and application of engineering and RE best-practices by the Squad. Successful candidates will have experience in driving cloud transformation initiatives and sustaining top quartile operating standards by leveraging leading market practices. Proven delivery in cloud-centric operating models utilizing the Site Reliability Engineering (SRE) practices in/across large enterprises, preferred candidates will possess advanced AWS Cloud provider engineering and governance skills. This position requires a strong technical understanding of complex IT environments and evolving technologies.

Responsibilities:

  • Manages a team of Reliability Engineers through hiring, performance management, coaching and development. May include managing vendor partner relationships to create value for the organization.

  • Guide the use of best-in-class software engineering standards, tools, and design practices to enable highly available and performant customer-facing applications. Lead adoption of metrics of overall application health - availability, performance, monitoring, alerting, quality, currency and resiliency

  • Serve as key liaison between the architecture and software engineering teams to influence the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization

  • Function as the go-to technical expert for the applications and infrastructure supported, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces and business domain.

  • Develop effective tooling, alerts, and response mechanisms to identify and address reliability and security risks leveraging automation to support problem prevention, detection, mitigation, and resolution.

  • Enhance the velocity of the SDLC by engineering the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.

  • Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines.

  • Promote and implement innovative solutions.

  • Champion the migration of applications to open source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility.

  • Drive simplification across the stack, responsible for ensuring that all technical designs can be effectively operated in a cost-efficient manner, without adding operational complexity.

  • Drives inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities

  • Strong experience in setting up scalable SDLC environments using COTS, PaaS, SaaS products catering to Data, Application and Infrastructure-based pipeline needs

  • Ability to build solutions to promote migration of applications to open source platforms, PaaS and use of containers and other cloud technology standards for cloud-enablement and platform agility.

  • Ensure operational excellence. Independently drive the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate end-to-end ownership.

  • Partner with infrastructure teams to design and implement intelligent automation and orchestration systems, enhanced monitoring/alerting capabilities and rapid service restoration processes. Take proactive measures to prevent high impactful incidents.

  • Achieve and maintain the technical business continuity of Hartford and third-party assets that support customer-facing functions. Accountable for improving the IT application and infrastructure resiliency.

  • Governance of overall D&A platform ecosystem with focus on process and solutions catering to Data masking (PII management), data lifecycle management needs

Qualifications:

  • 8-12+ years of relevant technical experience in the financial services industry

  • Bachelor's degree or equivalent work experience in Information Technology Management or equivalent experience

  • A minimum of 3+ years of demonstrated leadership inclusive of the ability to influence senior management and critical stakeholders

  • Must have led a production engineering team or SRE function, with responsibilities including supporting and maintaining customer facing SLOs or KPIs, or has owned the full development lifecycle with regards to developer experience ecosystem tools.

  • Must have demonstrated a strong track record of achieving business plan objectives

  • Must have exceptional communication skills (written, oral, presentation and facilitation)

  • Understanding of robotics and artificial intelligence to improve services

  • Experience in strategy development to achieve business objectives

  • Hands-on application development and production support is a plus

  • Ability to develop, manage and communicate frameworks: e.g., Cloud Security Alliance

  • Solid understanding of technologies that support the services offered for cloud applications

  • Firm analytical and problem-solving skills

Compensation

The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:

$136,400 - $204,600

Equal Opportunity Employer/Females/Minorities/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age

About Us | Culture & Employee Insights | Diversity, Equity and Inclusion | Benefits


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove

Manager, Reliability Engineer (Remote)

The Hartford