Senior Manager - Site Reliability Engineering, Cloud Operations

Vmware, Inc. Colorado Springs , CO 80928

Posted 3 days ago

As a Senior Manager in CPE Cloud Operations, you will be working with and managing a team of seasoned systems, network, security, database, automation, and storage specialist responsible for complex operational issues and tasks found in our global cloud environment. This groups of people is challenged with solving operational issue using code and are adopting a strong DevOps mindset.

This team builds and operates one of (if not) the largest VMware private clouds in the world and is responsible for full end to end lifecycle of the service. With scale and high demand it is essential that the SRE team manages infrastructure efficiently leveraging practices for configuration management, Infrastructure as Code, efficient auto-remediation, etc

Success in this role requires strong experience managing engineering focused teams, an aptitude for distributed systems and attention to minute details. You need to have well developed systems and code-level troubleshooting abilities. You are expected to analyze complex system behaviors or performance problems, and be able to trace issues across multiple systems.

Responsibilities:

  • Directly Manage a team of senior level specialized

  • Manage initiative across a globally distributed workforce

  • Operate in a dynamic cloud services environment

  • Develop automation, mature processes, and design tools to improve cloud lifecycle

  • Participate in troubleshooting, capacity analysis and planning, and performance analysis

  • Work with internal engineering, product management, and other strategic teams to create and articulate VMware's vision for the software defined data center, including global cloud infrastructure architecture, network, systems and storage, virtualization design, cloud operating models and tools frameworks, and other supporting technologies and processes

Basic Qualifications:

  • Minimum of 7 years of experience managing and developing a highly technical team within large (>$1 billion) companies

  • Familiarity with cloud-based computing services like AWS, Rackspace Cloud, Azure, etc.

  • Hands-on operational experience in a critical production service environment

  • Multiple years of experience with the following technologies: Systems Administration (Linux/Windows), Networking (LAN, WAN), Storage, and Virtualization

  • Proven technical troubleshooting and performance tuning experience

  • Ability to attract, motivate, and retain top talent

  • Excellent verbal and written communication skills

  • Excellent teamwork and leadership skills

Preferred Qualifications:

  • Thorough understanding of cloud service delivery infrastructure ecosystem, operational processes, and orchestration models

  • Experience in creating and governing reference architecture processes and artifact portfolios that clearly link business and technical requirements to the strategic architectures, solution designs and technology strategies being used to satisfy them

  • Ability to articulate strategy at CxO levels

  • Experience with writing scripts and tools to diagnose and address issues (Python, Ruby, Ansible)

  • Experience with integration tools like Jenkins or stackstorm

  • Relevant technical certifications (MCSE, CCNA, VCP, etc.)

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Manager Site Reliability Engineering Cloud Operations

Dell Inc

Posted 2 days ago

VIEW JOBS 3/16/2019 12:00:00 AM 2019-06-14T00:00 Job ID R1903301-1 Date posted Mar. 15, 2019 As a Senior Manager in CPE Cloud Operations, you will be working with and managing a team of seasoned systems, network, security, database, automation, and storage specialist responsible for complex operational issues and tasks found in our global cloud environment. This groups of people is challenged with solving operational issue using code and are adopting a strong DevOps mindset. This team builds and operates one of (if not) the largest VMware private clouds in the world and is responsible for full end to end lifecycle of the service. With scale and high demand it is essential that the SRE team manages infrastructure efficiently leveraging practices for configuration management, Infrastructure as Code, efficient auto-remediation, etc Success in this role requires strong experience managing engineering focused teams, an aptitude for distributed systems and attention to minute details. You need to have well developed systems and code-level troubleshooting abilities. You are expected to analyze complex system behaviors or performance problems, and be able to trace issues across multiple systems. Responsibilities: * Directly Manage a team of senior level specialized * Manage initiative across a globally distributed workforce * Operate in a dynamic cloud services environment * Develop automation, mature processes, and design tools to improve cloud lifecycle * Participate in troubleshooting, capacity analysis and planning, and performance analysis * Work with internal engineering, product management, and other strategic teams to create and articulate VMware's vision for the software defined data center, including global cloud infrastructure architecture, network, systems and storage, virtualization design, cloud operating models and tools frameworks, and other supporting technologies and processes Basic Qualifications: * Minimum of 7 years of experience managing and developing a highly technical team within large (>$1 billion) companies * Familiarity with cloud-based computing services like AWS, Rackspace Cloud, Azure, etc. * Hands-on operational experience in a critical production service environment * Multiple years of experience with the following technologies: Systems Administration (Linux/Windows), Networking (LAN, WAN), Storage, and Virtualization * Proven technical troubleshooting and performance tuning experience * Ability to attract, motivate, and retain top talent * Excellent verbal and written communication skills * Excellent teamwork and leadership skills Preferred Qualifications: * Thorough understanding of cloud service delivery infrastructure ecosystem, operational processes, and orchestration models * Experience in creating and governing reference architecture processes and artifact portfolios that clearly link business and technical requirements to the strategic architectures, solution designs and technology strategies being used to satisfy them * Ability to articulate strategy at CxO levels * Experience with writing scripts and tools to diagnose and address issues (Python, Ruby, Ansible) * Experience with integration tools like Jenkins or stackstorm * Relevant technical certifications (MCSE, CCNA, VCP, etc.) Dell Inc Colorado Springs CO

Senior Manager - Site Reliability Engineering, Cloud Operations

Vmware, Inc.