Site Reliability Engineering Manager

Solekai Systems Corp Detroit , MI 48222

Posted 2 months ago

Are you ready to step up to the New and take your technology expertise to the next level?

Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture and make delivering innovative work part of your extraordinary career.

People in our Client Delivery & Operations career track drive delivery and capability excellence through the design, development and/or delivery of a solution, service, capability or offering. They grow into delivery-focused roles, and can progress within their current role, laterally or upward.

As part of our practice, you will lead technology innovation for our clients through robust delivery of world-class solutions. You will build better software better! There will never be a typical day and that's why people love it here. The opportunities to make a difference within exciting client initiatives are unlimited in the ever-changing technology landscape. You will be part of a growing network of technology experts who are highly collaborative taking on today's biggest, most complex business challenges. We will nurture your talent in an inclusive culture that values diversity. Come grow your career in technology at Accenture!

The Performance Engineering practice within Accenture Technology is focused on optimizing the performance and scalability of enterprise applications through the combination of:

  • Testing: Analyzing, planning and executing production-like simulations across mobile and web solutions to identify & remediate performance problems, prevent production outages, and guarantee predictable performance.

  • Diagnostics & Monitoring: Instrumenting the complete application architecture to provide real user and system performance data to provide insight into the root cause of all application bottlenecks, enable real time visibility to reduce risk exposure.

  • Performance Analytics: Measuring the relationship between end-to-end performance, user behavior, and business goals to maximize the digital business, improve business KPIs, and increase client retention.

  • Business Optimization: Empowering digital businesses with contextual intelligence to visualize, quantify and maximize the business value of performance to improve the quality & performance of the business, increase customer satisfaction, and protect brand reputation.

As a Site Reliability Engineering Manager, some of your key responsibilities may include:

  • Defining the strategy for enabling performance diagnostics and monitoring through the use of an Application Performance Management (APM) tool, other monitoring tools, and diagnostic techniques.

  • Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace.

  • Interacting with client and/or Accenture development, operations, and infrastructure resources to recommend solutions to remediate performance issues

  • Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements

  • Developing dashboards and reports to provide ongoing visibility into the performance of client applications

  • Contributing learnings and experiences to the Accenture Performance Engineering community

  • 80-100% travel requirement, typically M-TH.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer San Diego

Tusimple

Posted 3 weeks ago

VIEW JOBS 5/5/2020 12:00:00 AM 2020-08-03T00:00 Come join a higher calling! (Relocation Assistance to San Diego Provided) TuSimple is a global Artificial Intelligence Technology Company. We are the epicenter of the Autonomous Vehicle Universe. Our breakthroughs are multiple generations ahead of anyone in the world. While inventing the framework of Autonomous Driving, our current fleet of autonomous Trucks are helping communities receive much-needed supplies and medical equipment around the clock. Our people are some of the most talented engineers and contributors who are leaving behind a historic legacy. TuSimple was founded half a decade ago with the goal of bringing the top minds in the world together to achieve the dream of a driverless truck solution. With a foundation in computer vision, algorithms, mapping, and Artificial Intelligence, TuSimple is working to create the first GLOBALLY commercially viable autonomous truck driving platform! Job Description Site Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. The Site Reliability Engineer will be a part of a team working on a variety of software engineering tasks to create and maintain scalable solutions and reliable software systems for our autonomous truck platform. You will have an opportunity to impact backend services such as fleet monitoring, machine learning and continuous integration among others. Responsibilities * Collaborate with others to define and execute on our SRE vision * Continuously improve our process for detecting, responding to, and learning from production incidents * Provide visibility into availability and performance metrics * Reclaim Engineering time by substituting traditionally laborious human tasks with the automation of infrastructure, configuration, and delivery * Optimize tools and processes * Understand, deploy and provide technical support for infrastructure systems * Engage in system design and development from the perspective of SRE * Responsible for identifying and mitigating real and potential system problems and issues * Ensure and improve security, stability, and scalability by creating new code and scripting * Monitor and maintain enterprise-level data centers * Handle and debug complex server and service-related issues * Help service owners identify and instrument Service Level Objectives and design alerts that follow best practices * Build tools and mechanisms that enable engineers to deploy and test their services in production * Facilitate blameless postmortems and drive effective action items * Help teams with automating tedious tasks and enable them to quickly launch new services. * Work with service owners to have a proactive approach to designing tests, observing results and creating fixes for complex failure scenarios. Qualifications * 2-5+ years of experience as an SRE, Production or Systems Engineer * A strong foundation of Linux Systems Engineering and Automation. * Fluent with one or more programming languages such as Go, Python or Java * Deep understandings of Cloud-based (i.e. AWS, Azure, etc.) services and API * 1-2 years of experience with container-based architecture, such as Docker and Kubernetes * Streaming and Database technologies such as Postgres, Kafka, Cassandra, ElasticSearch, etc. * Able to debug complex problems across the whole stack * Proficiency with secure configuration management. * M.S. or B.S. in Computer Engineering and/or Computer Science * Strong communication skills and the ability to work across technical teams * A passion and habit for measuring key data points * Deep knowledge of networking OSI mode Perks * Relocation Assistance Available. * Work Visa Sponsorship Available. * Competitive salary and benefits * Bonus/paid vacations/insurance * Daily breakfast, lunch, and dinner * Full Kitchen with unlimited snacks and fruits * Medical, Vision, and Dental insurance plan * Company 401(K) program * Company-paid life insurance TuSimple is an Equal Opportunity Employer. This company does not discriminate in employment and personnel practices on the basis of race, sex, age, handicap, religion, national origin or any other basis prohibited by applicable law. Hiring, transferring and promotion practices are performed without regard to the above-listed items. Tusimple Detroit MI

Site Reliability Engineering Manager

Solekai Systems Corp