Lead Site Reliability Engineer

Time Warner Cable Charlotte , NC 28201

Posted 2 months ago

Posting Job Description


This position is responsible for leading design, development and implementation efforts of cloud based technologies. In this role, you will use your development and operations knowledge to identify and prioritize issues, find solutions to common problems and mentor and support junior staff to help support our Cloud infrastructure enterprise wide. This includes working with our entire engineering organization and Enterprise Architecture.


Actively and consistently supports all efforts to simplify and enhance the customer experience.

  • Take ownership and accountability of the Product/site reliability.

  • Assist in analyzing code for reliability issues, components, and infrastructure and system level problems.

  • Work with architects, teach leads, test leads and stakeholders to identify points of failure.

  • Define and lead Blue-Green deployment approach to enable zero-downtime deployment.

  • Lead and improve the tooling and automation of our infrastructure to minimize manual work, increase performance, and decrease the frequency and severity of incidents.

  • Lead technical hands on implementations for our Cloud service offerings.

  • Define the type of alert requirements, exceptions and messages to be monitored that will trigger the alerts and recovery.

  • Establish best practices for system logging, monitoring, health checks, and recovery.

  • Define approach for scale up and scale down and ensure Infrastructure provisioning scripts and automation meet required implementation.

  • Work with QA lead, Tech leads, architects to ensure test automation, security testing is integrated with our Cloud solutions and pipeline.

  • Lead or assist with Root Cause Analyses (RCAs).

  • Provide critical input into the selection, configuration, and implementation of new and existing technology solutions.

  • Demonstrate high ownership and ability to drive issues to resolution.

  • Highly organized and have the ability to juggle many tasks without losing sight to the highest priority items.

  • Perform other duties as requested.


Required Skills/Abilities and Knowledge

Ability to read, write, speak and understand English

  • Advanced experienced with the VMWare suite of products

  • Advanced experienced with managing both physical and Virtual infrastructure

  • Advanced experienced with multiple operating systems (e.g. Windows and Linux)

  • Hands-on experience in one or more of cloud computing services (e.g. AWS, Microsoft Azure, Google Cloud Platforms, IBM, etc.)

  • Advanced experience implementing a variety of cloud service models (e.g. Private, Public, Multi-Cloud)

  • Proficient scripting in one or more languages (e.g. Python, Shell, PowerShell, Ansible or Perl)

  • Advanced experience with CI/CD tools (Puppet, Ansible, Jenkins)

  • Advanced experience managing monitoring and alerting tools

  • Prior experience working in an Agile environment

  • Familiar with containerized workloads (e.g. Kubernetes, Openshift, TKGI)

  • Advanced experienced with firewalls, routing and load balancing

  • Skilled in troubleshooting methodologies

  • Must have excellent written and oral communications, including technical documents, and process documents.

  • Requires attention to detail and excellent organizational skills

  • Ability to contribute independently as well as be a team player

  • Advanced experience managing small projects

  • Self-starter, ability to manage tasks with little supervision

Required Education

Bachelor's degree in Computer Science or related field, or equivalent experience

Required Related Work Experience and Number of Years

Network experience

  • 5+ yrs

System Administration experience

  • 5+ yrs


  • 5+ yrs

Container Services

  • 2+ yrs


  • 3+ yrs

Preferred Related Work Experience and Number of Years

rVMware System Administration experience

  • 8+ yrs

TKGI Enterprise Pivotal Container Services

  • 2+ yrs

VMware NSX-T

  • 2+ yrs.

vROPs, Log Insight, vRNI, vRIL

  • 3+ yrs

Cisco networking

  • 3+ yrs

Firewall configuration management

  • 3+ yrs

Load Balancer configuration management

  • 3+ yrs

CI/CD experience in a customer facing, production environment

  • 1+ yrs

Experience as a Site-Reliability/DevOps System Engineer

  • 3+ yrs

AWS or other cloud computing platforms

  • 1+ yrs


Office Environment

On Call support, on a rotation basis

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Lead Site Reliability Engineer

Wells Fargo

Posted 2 weeks ago

VIEW JOBS 6/10/2022 12:00:00 AM 2022-09-08T00:00 About this role: Wells Fargo is seeking a Lead Systems Operations Engineer... In this role, you will: * Lead in high level technical concepts spanning technology and business * This role maintains the health and efficiency of all application infrastructure, provides professional hands-on implementation and technical advice to ensure the successful implementation of IT Infra. Management products. * Provide support of application platforms at the enterprise level which includes automating platform build, patches, vulnerabilities, change management, incident management, monitoring and continued platform improvement * Collaborate with peers, technology partners and vendors to identify, automate, and roll out patches; resolve issues and achieve goals * Support and manage APIs and Microservices using Java, Spring Boot etc. * Create, organize and implement API support processes * Support and deploy API policies using Apigee Edge Platform * Perform cloud security and certificate management * Develop a long range plan designed to resolve problems and prevent them from recurring * Direct the daily risk and control flow of operations, focusing on policies, procedures and work standards to ensure success Required Qualifications, US: * 7+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education * 7+ years of Red Hat Linux or UNIX experience * 3+ years of scripting and automation using Python, Shell, or similar scripting languages * 3+ years of supporting application platforms * 2+ years of experience using one or more of the following automation tools: Ansible, Puppet, Chef, GitHub, Jenkins, Kubernetes, or similar tools * 1+ years of experience supporting APIs or API Management Platforms like Apigee, IBM, MuleSoft, etc. Desired Qualifications: * Experience with Apigee, OAuth admin, and JWT * Experience creating, organizing and implementing API support processes * Support and deploy API policies using Apigee Edge Platform * Understanding of OpenAPI Specifications in JSON or YAML format, including JWT authorizations * Knowledge of authentication, authorization of services via OAuth 2 * Support and manage APIs and Microservices using Java, Spring Boot etc. * Experience using POSTMAN for making management API Calls. * Knowledge and understanding of AppDynamics, Splunk and Kafka Job Expectations: * Flexibility to work in a 24/7 environment, including weekends and holidays * Ability to travel up to 10% of the time * Willingness to work on-site at stated location on the job opening * Ability to provide and work from a home office @RWF22 We Value Diversity At Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law. Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit's risk appetite and all risk and compliance program requirements. Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process. Wells Fargo Charlotte NC

Lead Site Reliability Engineer

Time Warner Cable