Site Reliability Engineer

Solekai Systems Corp Miami , FL 33196

Posted 2 months ago

Are you ready to step up to the New and take your technology expertise to the next level?

Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture and make delivering innovative work part of your extraordinary career.

People in our Client Delivery & Operations career track drive delivery and capability excellence through the design, development and/or delivery of a solution, service, capability or offering. They grow into delivery-focused roles, and can progress within their current role, laterally or upward.

As part of our practice, you will lead technology innovation for our clients through robust delivery of world-class solutions. You will build better software better! There will never be a typical day and that's why people love it here. The opportunities to make a difference within exciting client initiatives are unlimited in the ever-changing technology landscape. You will be part of a growing network of technology experts who are highly collaborative taking on today's biggest, most complex business challenges. We will nurture your talent in an inclusive culture that values diversity. Come grow your career in technology at Accenture!

The Performance Engineering practice within Accenture Technology is focused on optimizing the performance and scalability of enterprise applications through the combination of:

  • Testing: Analyzing, planning and executing production-like simulations across mobile and web solutions to identify & remediate performance problems, prevent production outages, and guarantee predictable performance.

  • Diagnostics & Monitoring: Instrumenting the complete application architecture to provide real user and system performance data to provide insight into the root cause of all application bottlenecks, enable real time visibility to reduce risk exposure.

  • Performance Analytics: Measuring the relationship between end-to-end performance, user behavior, and business goals to maximize the digital business, improve business KPIs, and increase client retention.

  • Business Optimization: Empowering digital businesses with contextual intelligence to visualize, quantify and maximize the business value of performance to improve the quality & performance of the business, increase customer satisfaction, and protect brand reputation.

As a Site Reliability Engineer, some of your key responsibilities may include:

  • Maintain responsibility for the design, deployment, and maintenance of production-scale systems.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

  • Use automation to streamline the provisioning, management, and monitoring of applications and services using multiple scripting languages, Java, and infrastructure-as-code.

  • Facilitate blameless Incident Retrospectives to understand root causes, communicate learnings, determine remediation and make us better and closer as a team.

  • Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation.

  • Introduce chaos engineering concepts that promote experimentation in production to identify systemic weaknesses while increasing service resiliency

  • Coordinate with the solution architect to design a highly available solution that meets availability and reliability objectives and to reduce manual activities using automation, when feasible.

  • Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace.

  • Instrumenting applications to enable performance diagnostics and monitoring

  • Collaborating with developers to promote the concept of reliability engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle

  • Monitoring application performance during performance tests or production usage through the use of APM and other monitoring tools to isolate the fault domain, dive deep into application code, and identify root cause of performance issues.

  • Interacting with client and/or Accenture development, operations, and infrastructure resources to recommend solutions to remediate performance issues

  • Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements

  • Developing dashboards and reports to provide ongoing visibility into the performance of client applications

  • Contributing learnings and experiences to the Accenture Performance Engineering community

  • Requirement of 80-100% travel, typically M-TH

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Service Reliability Engineer / Devops Engineer


Posted 2 months ago

VIEW JOBS 4/29/2020 12:00:00 AM 2020-07-28T00:00 . Position Summary The Service Reliability Engineer / DevOps Engineer is responsible for collaborating with internal stakeholders, including software development teams and other information technology resources to support business processes/ production systems and provide operational efficiency using industry standard technologies. This position will work collaboratively with IT and software engineering teams to deploy and operate our systems/platforms, and support client utilization. This will include helping to automate and streamline our operations and processes, along with building and maintaining tools for deployment, monitoring and operations. Additionally, this position will troubleshoot and resolve issues in our dev, test and production environments. Job Duties * Provide operational support for Cloud environment (specifically Azure) , inclusive of network, storage, compute, Operating Systems (Windows, Red Hat), and application services. * Recommend, develop and implement system enhancements that will improve the performance and reliability of the system including installing, upgrading/patching, monitoring, problem resolution, configuration management and security * Provide support for incidents and end user request for services. * Responsibility for analyzing network and system performance and evaluating new tools and systems * Generate detailed technical notes and propagate the internal knowledge-base with information, including developing notes and operating procedures for activities including system monitoring, performance tuning, backup/recovery, server architecture design, and system maintenance * Test new patch/software releases to ensure compatibility to minimize impact and downtime * Manage and enhance build and continuous integration infrastructure; support infrastructure for development teams and enhance tools for development workflows * Develop, Monitor and maintain CI/CD Pipelines for production deployments * Advanced ability to craft clear and concise documentation Qualifications/Experience Education 4 year University degree in Computer Sciences, Software Engineering or equivalent work experience. Relevant work experience * 4-7 years' experience in administration of complex systems environments with a strong understanding of business impact and systems management * 2-4 years' experience managing Azure subscriptions, resources, and billing. * Strong background and hands-on experience in Windows, Windows AD and Linux/Unix System Administration; proficiency in Shell and scripting languages. * Hands on experience in microservices and containerized technologies (Docker, Kubernetes, etc.) and their deployment is a benefit. * Experience with source code version control systems like git, syn, etc. * Experience with modern web architectures & cloud platforms (Azure preferred) * Experience with DevOps automation/configuration management tools like TeamCity, Nagios, ELK, Splunk, Octopus Deploy, Docker, Ansible, Puppet, Chef or an equivalent * Experience with web servers (IIS, Nginx, etc.) and load balances (HAProxy, Azure, etc.) * A working understanding of SQL and MySQL (NoSQL experience is a plus) * A working understanding of code and script (C#, Java, Python, PowerShell, etc.) * Knowledge of database technologies and moderate DBA skills (MongoDB, SQL Server etc) * Knowledge of software configuration, hardware maintenance and access control management * Should be self-motivated and able to meet goals with little oversight Computing C# Java, Shell, Perl, GO, Python Specific knowledge Web architectures & cloud platforms (Azure) , deployment & configuration management (TeamCity, Nagios, ELK, Splunk, Octopus Deploy, Docker, Ansible, Puppet, Chef),Systems (Windows, Windows AD and Linux/Unix System Administration) , Containers (Docker,LXC), Monitoring (Kibana,ElasticSearch) Database (OMongoDB, SQL Server) Any duplication and display of partial or full content of our job advertisement on any support, such as brochures, websites, mail, emails, this list is not exhaustive, is strictly forbidden without prior formal Amadeus' authorisation. Recruitment agencies: Amadeus does not accept agency resumes. Please do not forward resumes to our jobs alias, Amadeus employees or any other company location. Amadeus is not responsible for any fees related to unsolicited resumes. Amadeus Miami FL

Site Reliability Engineer

Solekai Systems Corp