Systems Reliability Engineer

Disney Glendale , AZ 85301

Posted 6 months ago

Job Description:

The Walt Disney Company (TWDC) has an immediate opening for a DevOps oriented Systems Reliability Engineer

(SRE). The qualified candidate will have a keen interest in DevOps/SRE philosophies and an extensive background in

implementing systems utilizing DevOps/SRE practices. The position is hands-on; therefore, the successful candidate

will have strong implementation, system diagnostic and leadership skills. This is a high-visibility position with a great

deal of growth opportunity in DevOps/SRE so apply today!

The SRE will help create, build and deliver new technologies or platforms. This will include consultation, designing,

building, and supporting development pipelines, automating infrastructure and operations, creating telemetry for

monitoring, engineering high reliability and reinforcing best practices to secure our company and guest data.

Job Type

Full Time

Alternate Location-State/Region

WA

Segment

The Walt Disney Company (Corporate)

Category

Technology

Basic Qualifications

Work Experience

  • 2-3 Years with degree or 4-5 years without degree

Technical Requirements

  • Familiarity in one or more programming languages (e.g. Go, Python, Ruby, Node, Java, others alike)
  • Networking skills and protocols (e.g. HTTP, TLS, SSH, DNS)
  • Software Development Continuous Integration (CI) Pipeline knowledge (e.g. Jenkins,Gitlab CI)
  • Experience with Containers (e.g. Docker) and Container Orchestration technologies (Kubernetes, Mesos, DockerEE)
  • Experience with Source Control Management systems (e.g. Git)
  • Expertise in public and private cloud hosting services (AWS, Azure)
  • Familiarity with configuration management (e.g. Terraform, Chef)
  • Systems administration skills on Linux platforms

Communication and Leadership Requirements

  • Curiosity/ability to stay abreast of emerging system development trends

  • Ability to adapt in fast-paced and changing environment

  • Experience with Service Management (e.g. ITIL, using ServiceNow for Incident, Change, Problem and Config Management)

Business

The Walt Disney Company (Corporate)

Required Education

Bachelor of Science degree in computer science or related field and 2-3 years of related work experience or 4-5 years of related work experience in lieu of a degree

Postal Code
91201

Responsibilities

Design: Leading project/planning efforts, architectural design, engineering, attending meetings w/ various teams.

Build: Implementing, integrating and configuring solutions, tools, infrastructure and systems.

Run: Troubleshoot, Conduct Post Mortems, Provide Level 2 & 3 Maintenance and Support

Alternate Country / Region

US

Job Description

The Walt Disney Company (TWDC) has an immediate opening for a DevOps oriented Systems Reliability Engineer

(SRE). The qualified candidate will have a keen interest in DevOps/SRE philosophies and an extensive background in

implementing systems utilizing DevOps/SRE practices. The position is hands-on; therefore, the successful candidate

will have strong implementation, system diagnostic and leadership skills. This is a high-visibility position with a great

deal of growth opportunity in DevOps/SRE so apply today!

The SRE will help create, build and deliver new technologies or platforms. This will include consultation, designing,

building, and supporting development pipelines, automating infrastructure and operations, creating telemetry for

monitoring, engineering high reliability and reinforcing best practices to secure our company and guest data.

Basic Qualifications

Work Experience

  • 2-3 Years with degree or 4-5 years without degree

Technical Requirements

  • Familiarity in one or more programming languages (e.g. Go, Python, Ruby, Node, Java, others alike)
  • Networking skills and protocols (e.g. HTTP, TLS, SSH, DNS)
  • Software Development Continuous Integration (CI) Pipeline knowledge (e.g. Jenkins,Gitlab CI)
  • Experience with Containers (e.g. Docker) and Container Orchestration technologies (Kubernetes, Mesos, DockerEE)
  • Experience with Source Control Management systems (e.g. Git)
  • Expertise in public and private cloud hosting services (AWS, Azure)
  • Familiarity with configuration management (e.g. Terraform, Chef)
  • Systems administration skills on Linux platforms

Communication and Leadership Requirements

  • Curiosity/ability to stay abreast of emerging system development trends

  • Ability to adapt in fast-paced and changing environment

  • Experience with Service Management (e.g. ITIL, using ServiceNow for Incident, Change, Problem and Config Management)

Required Education

Bachelor of Science degree in computer science or related field and 2-3 years of related work experience or 4-5 years of related work experience in lieu of a degree

Responsibilities

Design: Leading project/planning efforts, architectural design, engineering, attending meetings w/ various teams.

Build: Implementing, integrating and configuring solutions, tools, infrastructure and systems.

Run: Troubleshoot, Conduct Post Mortems, Provide Level 2 & 3 Maintenance and Support


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Sr Systems Reliability Engineer

Walt Disney Co.

Posted 6 days ago

VIEW JOBS 10/11/2019 12:00:00 AM 2020-01-09T00:00 Job Description Do you want to be part of a team that creates magic for millions of guests across all of the Disney brands? Behind the scenes, the Enterprise Technology team helps deliver magical digital & physical experiences leveraging the latest technology; and our teams provide expert engineering services in cloud, automation, and systems reliability engineering to support the innovation and operation of The Walt Disney Company. We are passionate about ensuring our systems provide the best guest experience! To be successful in this role, you will protect, operate, and continuously improve the automation and systems that run Disney's experiences, products, & services with a focus on availability, latency, automation, & cross-company collaboration while embracing a DevOps culture. Teams are located in Seattle, Burbank, Orlando, Bristol and New York. This position is ideally in Burbank. Job Type Full Time Segment The Walt Disney Company (Corporate) Category Technology Basic Qualifications * Proficient, collaborative, & experienced in building reliable, scalable, micro-service-oriented systems * Passionate and curious about ways to leverage technology while continually learning * Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed system * Configuration management and orchestration (e.g. Chef, Terraform, Cloud Formation) * One or more languages in your skillset (e.g. GO, Python, Java, Ruby) * Containerization (e.g. Docker, Kubernetes, Mesos, Elastic Container Service) * Skilled in Cloud/PaaS Environments (e.g. AWS, Google Cloud Compute) * Thorough knowledge of continuous integration tools (e.g. Jenkins) * UNIX/Linux administration, troubleshooting, performance tuning, & security * 5 years of experience in technical operations or systems engineering * 3+ years' operating complex, large-scale Enterprise guest-facing Applicati3+ years' operating complex, large-scale Enterprise guest-facing Applications or web sites * Experience with AWS, Google or similar cloud computing environments. * Experience working in an Agile development environment * Experience with F5 load balancing helpful * UNIX/LINUX and some Windows server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Web (IIS, Apache) and Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Experience in at least one relevant scripting or programming languages (Ruby, Perl, Python, Shell, etc.) * Experience with Automation platforms (Chef, cfengine, puppet, etc.) * Web (IIS, Apache) and Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Experience in at least one relevant scripting or programming languages (Ruby, Perl, Python, Shell, etc.) * Experience with Automation platforms (Chef, cfengine, puppet, etc.) * Understanding of internet standards such as HTTP, DNS, FTP, • * SSH, HTML, XML, JDBC, SNMP and other protocols * Understanding of high availability hardware and database systems design and implementation including cluster management, redundancy and failover testing * Knowledge of storage systems (SAN, NAS, RAID Array, etc) * Experience hardening and maintaining secure systems (Safe Harbor or PCI experience a plus!) * Network hardware architecting experience with load balancing equipment, switches, routers, and network troubleshooting * Ability to produce system documentation, including writing requirements, operational specifications, system architecture, test plans and as-built documentation, all with attention to detail * Ability to work within formal processes that assure a quality product and operational integrity * Experience working with ITIL and Service Management best practices is a plus. Business The Walt Disney Company (Corporate) Required Education * Bachelor's degree or equivalent experience in technical operations or software engineering Postal Code 91201 Preferred Education * Bachelor's degree in computer science or related field preferred Responsibilities * Code, and deploy systems, new technologies, and best practices in the cloud using self-healing, infrastructure-as-code, security, and automation patterns * Develop useful telemetry, alerts, and response to identify and address reliability risks * Participate in on-call rotation with other engineering teams * Identify, experiment, & evangelize new technologies, ideas, and best practices across the broader engineering community * Collaborate and provide technical leadership within and across teams Job Description Do you want to be part of a team that creates magic for millions of guests across all of the Disney brands? Behind the scenes, the Enterprise Technology team helps deliver magical digital & physical experiences leveraging the latest technology; and our teams provide expert engineering services in cloud, automation, and systems reliability engineering to support the innovation and operation of The Walt Disney Company. We are passionate about ensuring our systems provide the best guest experience! To be successful in this role, you will protect, operate, and continuously improve the automation and systems that run Disney's experiences, products, & services with a focus on availability, latency, automation, & cross-company collaboration while embracing a DevOps culture. Teams are located in Seattle, Burbank, Orlando, Bristol and New York. This position is ideally in Burbank. Basic Qualifications * Proficient, collaborative, & experienced in building reliable, scalable, micro-service-oriented systems * Passionate and curious about ways to leverage technology while continually learning * Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed system * Configuration management and orchestration (e.g. Chef, Terraform, Cloud Formation) * One or more languages in your skillset (e.g. GO, Python, Java, Ruby) * Containerization (e.g. Docker, Kubernetes, Mesos, Elastic Container Service) * Skilled in Cloud/PaaS Environments (e.g. AWS, Google Cloud Compute) * Thorough knowledge of continuous integration tools (e.g. Jenkins) * UNIX/Linux administration, troubleshooting, performance tuning, & security * 5 years of experience in technical operations or systems engineering * 3+ years' operating complex, large-scale Enterprise guest-facing Applicati3+ years' operating complex, large-scale Enterprise guest-facing Applications or web sites * Experience with AWS, Google or similar cloud computing environments. * Experience working in an Agile development environment * Experience with F5 load balancing helpful * UNIX/LINUX and some Windows server experience, including expertise in system installation, configuration, administration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Web (IIS, Apache) and Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Experience in at least one relevant scripting or programming languages (Ruby, Perl, Python, Shell, etc.) * Experience with Automation platforms (Chef, cfengine, puppet, etc.) * Web (IIS, Apache) and Java application (Tomcat, Jboss, etc) server expertise including installation, administration, configuration, troubleshooting, performance tuning, preventative maintenance, capacity planning, monitoring, and security procedures * Experience in at least one relevant scripting or programming languages (Ruby, Perl, Python, Shell, etc.) * Experience with Automation platforms (Chef, cfengine, puppet, etc.) * Understanding of internet standards such as HTTP, DNS, FTP, • * SSH, HTML, XML, JDBC, SNMP and other protocols * Understanding of high availability hardware and database systems design and implementation including cluster management, redundancy and failover testing * Knowledge of storage systems (SAN, NAS, RAID Array, etc) * Experience hardening and maintaining secure systems (Safe Harbor or PCI experience a plus!) * Network hardware architecting experience with load balancing equipment, switches, routers, and network troubleshooting * Ability to produce system documentation, including writing requirements, operational specifications, system architecture, test plans and as-built documentation, all with attention to detail * Ability to work within formal processes that assure a quality product and operational integrity * Experience working with ITIL and Service Management best practices is a plus. Required Education * Bachelor's degree or equivalent experience in technical operations or software engineering Preferred Education * Bachelor's degree in computer science or related field preferred Responsibilities * Code, and deploy systems, new technologies, and best practices in the cloud using self-healing, infrastructure-as-code, security, and automation patterns * Develop useful telemetry, alerts, and response to identify and address reliability risks * Participate in on-call rotation with other engineering teams * Identify, experiment, & evangelize new technologies, ideas, and best practices across the broader engineering community * Collaborate and provide technical leadership within and across teams Walt Disney Co. Glendale AZ

Systems Reliability Engineer

Disney