Posting Job Description
This position is responsible for leading design, development and implementation efforts of cloud based technologies. In this role, you will use your development and operations knowledge to identify and prioritize issues, find solutions to common problems and mentor and support junior staff to help support our Cloud infrastructure enterprise wide. This includes working with our entire engineering organization and Enterprise Architecture.
MAJOR DUTIES AND RESPONSIBILITIES
Actively and consistently supports all efforts to simplify and enhance the customer experience.
Take ownership and accountability of the Product/site reliability.
Assist in analyzing code for reliability issues, components, and infrastructure and system level problems.
Work with architects, teach leads, test leads and stakeholders to identify points of failure.
Define and lead Blue-Green deployment approach to enable zero-downtime deployment.
Lead and improve the tooling and automation of our infrastructure to minimize manual work, increase performance, and decrease the frequency and severity of incidents.
Lead technical hands on implementations for our Cloud service offerings.
Define the type of alert requirements, exceptions and messages to be monitored that will trigger the alerts and recovery.
Establish best practices for system logging, monitoring, health checks, and recovery.
Define approach for scale up and scale down and ensure Infrastructure provisioning scripts and automation meet required implementation.
Work with QA lead, Tech leads, architects to ensure test automation, security testing is integrated with our Cloud solutions and pipeline.
Lead or assist with Root Cause Analyses (RCAs).
Provide critical input into the selection, configuration, and implementation of new and existing technology solutions.
Demonstrate high ownership and ability to drive issues to resolution.
Highly organized and have the ability to juggle many tasks without losing sight to the highest priority items.
Perform other duties as requested.
Required Skills/Abilities and Knowledge
Ability to read, write, speak and understand English
Advanced experienced with the VMWare suite of products
Advanced experienced with managing both physical and Virtual infrastructure
Advanced experienced with multiple operating systems (e.g. Windows and Linux)
Hands-on experience in one or more of cloud computing services (e.g. AWS, Microsoft Azure, Google Cloud Platforms, IBM, etc.)
Advanced experience implementing a variety of cloud service models (e.g. Private, Public, Multi-Cloud)
Proficient scripting in one or more languages (e.g. Python, Shell, PowerShell, Ansible or Perl)
Advanced experience with CI/CD tools (Puppet, Ansible, Jenkins)
Advanced experience managing monitoring and alerting tools
Prior experience working in an Agile environment
Familiar with containerized workloads (e.g. Kubernetes, Openshift, TKGI)
Advanced experienced with firewalls, routing and load balancing
Skilled in troubleshooting methodologies
Must have excellent written and oral communications, including technical documents, and process documents.
Requires attention to detail and excellent organizational skills
Ability to contribute independently as well as be a team player
Advanced experience managing small projects
Self-starter, ability to manage tasks with little supervision
Bachelor's degree in Computer Science or related field, or equivalent experience
Required Related Work Experience and Number of Years
System Administration experience
Preferred Related Work Experience and Number of Years
rVMware System Administration experience
TKGI Enterprise Pivotal Container Services
vROPs, Log Insight, vRNI, vRIL
Firewall configuration management
Load Balancer configuration management
CI/CD experience in a customer facing, production environment
Experience as a Site-Reliability/DevOps System Engineer
AWS or other cloud computing platforms
On Call support, on a rotation basis
Time Warner Cable