Site Reliability Engineer

Adobe Systems Incorporated San Jose , CA 95111

Posted 2 months ago

Our Company

Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.

We're on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!

The Opportunity:

Adobe Analytics is THE industry-leading customer intelligence solution for brands to better build relationships with their customers. To deliver this world-wide solution, hundreds of engineers across dozens of teams and all geographies combine to ensure a performant, reliable, agile, responsive and secure Software as a Service platform. And as Site Reliability Engineers we're right in the thick of it!

As members of the Site Reliability Engineering practice we continuously evolve the operational foundation for myriad microservices and products with infrastructure ranging across public and private cloud, from bare metal to virtualization to containerization. We're passionate about innovation to resolve friction on the service-infrastructure interface. We standardize; we automate; we pull; we commit; we deliver. Our work is central to critical performance for THE data & insights engine for OUR customers about THEIR customers.

What will make you successful:

Be focused on high-value and differentiating software solutions to meet multifaceted technical and market shifts. Demonstrate aptitude at balancing use of existing technology offerings with measured customization against building from scratch or re-architecting.

Possess excellent troubleshooting skills. Have an appetite for signal production & capture, dynamic decisioning and automated response flows. Find optimizations and other efficiencies to scale the application & environment.

Tackle issues across the entire stack - hardware, network, software and application. Have solid understanding of operational principles, such as capacity planning, monitoring and incident handling. Participate in an on-call duties.

Qualifications & skills:

  • 7 years relevant work experience

  • Bachelor/Masters Degree in Computer Science/Information Systems or related fields

  • Proven programming skills in two or more of: PowerShell, Python, Java, Bash, Perl, PHP, C

  • Knowledge of HPE, Dell & Cisco hardware

  • 4+ years of Microsoft Windows systems administration experience (Server 2016, SCOM, SCCM, SCVMM)

  • Experience troubleshooting and deploying applications on Linux

  • Knowledge of networking concepts including TCP/IP, HTTP, DNS, SMTP, SSL, SSH

  • Ability to prioritize tasks and work independently with strong organizational, documentation and communication skills

Bonus skills

  • Configuration management tools such as Salt and Ansible

  • Server virtualization technologies

  • Experience with Hadoop, HDFS, Kafka, Spark, Docker, Mesos, Kubernetes, ZooKeeper, Casandra, AWS or Azure

Adobe believes in hiring the very best. We are known for our vibrant, dynamic and exciting workplace where personal and professional fulfillment and company success go hand in hand. We take pride in creating exceptional work experiences, encouraging innovation and being involved with our employees, customers and communities. We invite you to discover what makes Adobe a place where exceptional people thrive.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer

Accenture

Posted 1 week ago

VIEW JOBS 7/14/2021 12:00:00 AM 2021-10-12T00:00 Job Description Are you ready to step up to the New and take your technology expertise to the next level? Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture and make delivering innovative work part of your extraordinary career. As part of our practice, you will lead technology innovation for our clients through robust delivery of world-class solutions. You will build better software better! There will never be a typical day and that's why people love it here. The opportunities to make a difference within exciting client initiatives are unlimited in the ever-changing technology landscape. You will be part of a growing network of technology experts who are highly collaborative taking on today's biggest, most complex business challenges. We will nurture your talent in an inclusive culture that values diversity. Come grow your career in technology at Accenture! The Performance Engineering Practice within Accenture Technology is focused on optimizing the performance and scalability of enterprise applications through the combination of: * Testing: Analyzing, planning and executing production-like simulations across mobile and web solutions to identify & remediate performance problems, prevent production outages, and guarantee predictable performance. * Diagnostics & Monitoring: Instrumenting the complete application architecture to provide real user and system performance data to provide insight into the root cause of all application bottlenecks, enable real time visibility to reduce risk exposure. * Performance Analytics: Measuring the relationship between end-to-end performance, user behavior, and business goals to maximize the digital business, improve business KPIs, and increase client retention. * Business Optimization: Empowering digital businesses with contextual intelligence to visualize, quantify and maximize the business value of performance to improve the quality & performance of the business, increase customer satisfaction, and protect brand reputation. As a Site Reliability Engineer, some of your key responsibilities may include: * Maintain responsibility for the design, deployment, and maintenance of production-scale systems. * Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. * Use automation to streamline the provisioning, management, and monitoring of applications and services using multiple scripting languages, Java, and infrastructure-as-code. * Facilitate blameless Incident Retrospectives to understand root causes, communicate learnings, determine remediation and make us better and closer as a team. * Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation. * Introduce chaos engineering concepts that promote experimentation in production to identify systemic weaknesses while increasing service resiliency * Coordinate with the solution architect to design a highly available solution that meets availability and reliability objectives and to reduce manual activities using automation, when feasible. * Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace. * Instrumenting applications to enable performance diagnostics and monitoring * Collaborating with developers to promote the concept of reliability engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle * Monitoring application performance during performance tests or production usage through the use of APM and other monitoring tools to isolate the fault domain, dive deep into application code, and identify root cause of performance issues. * Interacting with client and/or Accenture development, operations, and infrastructure resources to recommend solutions to remediate performance issues * Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements * Developing dashboards and reports to provide ongoing visibility into the performance of client applications * Contributing learnings and experiences to the Accenture Performance Engineering community For now, all Accenture business travel, international and domestic, is currently restricted to client-essential sales/delivery activity only. Please note: The safety and well-being of our people continues to be the top priority, and our decisions around travel are informed by government COVID-19 response directives, recommendations from leading health authorities and guidance from a number of infectious disease experts. Read more of the job description Read Less Qualifications Basic Qualifications * Minimum 3 years hands-on design/development/engineering experience (e.g. Java, .Net, etc.) * Minimum 1 year hands-on experience performance monitoring & diagnostic tools (AppDynamics, Dynatrace, New Relic, CA APM (previously Wily Introscope), etc.) * Bachelor's Degree or equivalent (minimum 12 years) work experience. (If Associate Degree, must have minimum 6 years work experience). Preferred Qualifications * Previous Consulting experience * Experience with Agile and DevOps * Understanding or exposure to Chaos Engineering Tools (Chaos Toolkit, Gremlin, Simian Army, etc.) * Experience with Infrastructure-as-code automation tool, including Ansible and Puppet * Experience with version control software, including Git * Experience in calculating system reliability metrics, including RPO, RTO, SLO & SLI * Experience with Containers (kubernetes and docker) * Experience with logging solutions, including ELK and Splunk * Experience with open source monitoring and visualization systems and tools, i.e. Prometheus (monitoring + tracing), Grafana/Kibana (dashboards), Zipkin (distributed tracing), etc. * Experience with stream-processing open source frameworks/systems, i.e. Kafka, Spark, etc. * Knowledge of defining and monitoring system quality measures, including SLO and SLA * Experience with distributed computing, Web Services, SOA, and JEE design concepts * Experience delivering software designed for high concurrency, scalability, or availability * Hands-on experience collecting performance data, analyzing, troubleshooting, and tuning * Experience with different flavors of Linux, i.e. RedHat, Ubuntu, CentOS, etc. * Built tooling to improve reliability of systems, automated remediation of issues, or improve scalability. * Systems often need to be reconfigured, so you should have experience with a configuration management system like Puppet, Chef or Salt. * Experience with usage of common application protocols and messages (e.g. TCP/IP, HTTP, SOAP, RESTful APIs, XML/JSON, JDBC, JMS/MQ) * Exposure to Cloud, SaaS, and virtualization concepts and performance concerns * Exposure to application threading and concurrency concerns * Working knowledge of operating system design, processes, and threading model * Ability to work in other languages such as JavaScript, Ruby, PHP, Perl, Python, PowerShell, and Linux shell scripting * Experience with Amazon Web Services * Experience with Containers (kubernetes and docker) Professional Skill Requirements * Proven ability to build, manage and foster a team-oriented environment * Proven ability to work creatively and analytically in a problem-solving environment * Desire to work in an information systems environment * Excellent communication (written and oral) and interpersonal skills * Excellent leadership and management skills Unless expressly indicated, this role is not open in the state of Colorado. As required by the Colorado Equal Pay Transparency Act, Accenture provides a reasonable range of minimum compensation for roles that may be hired in Colorado. Actual compensation is influenced by a wide array of factors including but not limited to skill set, level of experience, and specific office location. For the state of Colorado only, the range of starting pay for this role is $72,400 - $114,499 and information on benefits offered is here. What We Believe We have an unwavering commitment to diversity with the aim that every one of our people has a full sense of belonging within our organization. As a business imperative, every person at Accenture has the responsibility to create and sustain an inclusive environment. Inclusion and diversity are fundamental to our culture and core values. Our rich diversity makes us more innovative and more creative, which helps us better serve our clients and our communities. Read more here Equal Employment Opportunity Statement Accenture is an Equal Opportunity Employer. We believe that no one should be discriminated against because of their differences, such as age, disability, ethnicity, gender, gender identity and expression, religion or sexual orientation. All employment decisions shall be made without regard to age, race, creed, color, religion, sex, national origin, ancestry, disability status, veteran status, sexual orientation, gender identity or expression, genetic information, marital status, citizenship status or any other basis as protected by federal, state, or local law. Accenture is committed to providing veteran employment opportunities to our service men and women. For details, view a copy of the Accenture Equal Opportunity and Affirmative Action Policy Statement. Requesting An Accommodation Accenture is committed to providing equal employment opportunities for persons with disabilities or religious observances, including reasonable accommodation when needed. If you are hired by Accenture and require accommodation to perform the essential functions of your role, you will be asked to participate in our reasonable accommodation process. Accommodations made to facilitate the recruiting process are not a guarantee of future or continued accommodations once hired. If you would like to be considered for employment opportunities with Accenture and have accommodation needs for a disability or religious observance, please call us toll free at 1 (877) 889-9009, send us an email or speak with your recruiter. Other Employment Statements Applicants for employment in the US must have work authorization that does not now or in the future require sponsorship of a visa for employment authorization in the United States. Candidates who are currently employed by a client of Accenture or an affiliated Accenture business may not be eligible for consideration. Job candidates will not be obligated to disclose sealed or expunged records of conviction or arrest as part of the hiring process. The Company will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. Additionally, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the Company's legal duty to furnish information. Accenture San Jose CA

Site Reliability Engineer

Adobe Systems Incorporated