Sr. Site Reliability Engineer

Adobe Systems Incorporated San Jose , CA 95111

Posted 2 months ago

Our Company

Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

The Opportunity

You will be a member of the Site Reliability Engineering team in Dynamic Media. We are looking for a Senior Site Reliability Engineer with a software engineering background who is passionate about developing software to help scale monitoring, alerting, provisioning and configuration management. We are a multi-cloud environment, are security-focused and are helping customers succeed. This individual should be self-motivated and have a drive for quality.

What you'll Do

  • Will use troubleshooting, monitoring and reporting tools to analyze the root cause of serious and impactful technical issues and to build stable and sustainable solutions and improvements.

  • Work closely with customer care, internal escalation teams, product management, and engineering to seek solutions for customers and drive ownership of tasks toward completion.

  • Drive and improve the whole lifecycle of operational readiness - from inception and design, through deployment, operation and refinement.

  • Develop tools, operational enhancements and automated solutions that enable self-service configuration changes, speed deployments and improve monitoring in support of business-critical customer facing SaaS applications and environments.

  • Assist our software engineering team to ensure proper monitoring and metrics are being built into the applications before going to production.

  • Participate in an on-call rotation.

What you need to succeed

  • Bachelor's Degree in Computer Science or equivalent and 5 years of relevant work experience.

  • Full Stack troubleshooting experience including networking, operating system (CentOS, Windows), Tomcat, AVI, Mongo and Oracle.

  • Experience leveraging monitoring tools such as Splunk, New Relic, Prometheus, Grafana and Nagios for troubleshooting.

  • Experience with AWS and/or Azure stack - particularly in the areas of networking, VMs, databases and load balancing.

  • Excellent information management practices, such as detailed documentation, usage of wikis and other collaboration tools.

  • Ability to scope project work, estimate effort and then break down work into sub-tasks.

  • Experience developing applications in one or more of the following: Python, Nodejs, or Java.

  • Strong comprehension of continuous integration and continuous deployment methodologies.

  • Excellent written and verbal communication skills, demonstrating the ability to effectively convey technical information to both technical and non-technical audiences.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Sr Site Reliability Engineer

Motion Recruitment

Posted 3 weeks ago

VIEW JOBS 9/23/2021 12:00:00 AM 2021-12-22T00:00 Salary Min: $100/Hr Max: $150/Hr Title: Sr. Site Reliability Engineer Job Description We are working with a leader in the image sharing and social media service space. The company is looking to hire some very senior Site Reliability Engineer contractors to join their Star Caching team and the Hbase storage team. The role will focus on working on Hbase and Hadoop. This role is based in San Francisco, has the option to be fully remote, and is a year long contract with the possibility to be extended to 18 months. The role will require experience with Puppet, AWS, and Python. Required Skills & Experience * Puppet, or other CM. * AWS * Python - strong scripting, not CS fundamentals though. Not algorithms * Nice to have - Big Data with Hadoop/HBase Desired Skills & Experience * Big Data with Hadoop/HBase The Offer * Competitive Salary: Up to $280K/year, DOE You will receive the following benefits: * Medical, Dental, Vision Insurance * 401(k) with 3% matching * 15 days of PTO * 10% bonus * Stock Options Applicants must be currently authorized to work in the United States on a full-time basis now and in the future. Motion Recruitment San Jose CA

Sr. Site Reliability Engineer

Adobe Systems Incorporated