Site Reliability Engineer

Solekai Systems Corp Saint Louis , MO 63150

Posted 2 months ago

Are you ready to step up to the New and take your technology expertise to the next level?

Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture and make delivering innovative work part of your extraordinary career.

People in our Client Delivery & Operations career track drive delivery and capability excellence through the design, development and/or delivery of a solution, service, capability or offering. They grow into delivery-focused roles, and can progress within their current role, laterally or upward.

As part of our practice, you will lead technology innovation for our clients through robust delivery of world-class solutions. You will build better software better! There will never be a typical day and that's why people love it here. The opportunities to make a difference within exciting client initiatives are unlimited in the ever-changing technology landscape. You will be part of a growing network of technology experts who are highly collaborative taking on today's biggest, most complex business challenges. We will nurture your talent in an inclusive culture that values diversity. Come grow your career in technology at Accenture!

The Performance Engineering practice within Accenture Technology is focused on optimizing the performance and scalability of enterprise applications through the combination of:

  • Testing: Analyzing, planning and executing production-like simulations across mobile and web solutions to identify & remediate performance problems, prevent production outages, and guarantee predictable performance.

  • Diagnostics & Monitoring: Instrumenting the complete application architecture to provide real user and system performance data to provide insight into the root cause of all application bottlenecks, enable real time visibility to reduce risk exposure.

  • Performance Analytics: Measuring the relationship between end-to-end performance, user behavior, and business goals to maximize the digital business, improve business KPIs, and increase client retention.

  • Business Optimization: Empowering digital businesses with contextual intelligence to visualize, quantify and maximize the business value of performance to improve the quality & performance of the business, increase customer satisfaction, and protect brand reputation.

As a Site Reliability Engineer, some of your key responsibilities may include:

  • Maintain responsibility for the design, deployment, and maintenance of production-scale systems.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

  • Use automation to streamline the provisioning, management, and monitoring of applications and services using multiple scripting languages, Java, and infrastructure-as-code.

  • Facilitate blameless Incident Retrospectives to understand root causes, communicate learnings, determine remediation and make us better and closer as a team.

  • Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation.

  • Introduce chaos engineering concepts that promote experimentation in production to identify systemic weaknesses while increasing service resiliency

  • Coordinate with the solution architect to design a highly available solution that meets availability and reliability objectives and to reduce manual activities using automation, when feasible.

  • Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace.

  • Instrumenting applications to enable performance diagnostics and monitoring

  • Collaborating with developers to promote the concept of reliability engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle

  • Monitoring application performance during performance tests or production usage through the use of APM and other monitoring tools to isolate the fault domain, dive deep into application code, and identify root cause of performance issues.

  • Interacting with client and/or Accenture development, operations, and infrastructure resources to recommend solutions to remediate performance issues

  • Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements

  • Developing dashboards and reports to provide ongoing visibility into the performance of client applications

  • Contributing learnings and experiences to the Accenture Performance Engineering community

  • Requirement of 80-100% travel, typically M-TH

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer

Equifax

Posted 2 days ago

VIEW JOBS 6/5/2020 12:00:00 AM 2020-09-03T00:00 Do you want to work with the best people, using the best tools, working on the best projects, for the best customers? Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SRE is also an engineering approach to building and running production systems - we engineer solutions to operational problems. As SREs are responsible for overall system operation, we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages. Our SRE culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Equifax brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big, and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to build an environment that provides the support and mentorship needed to learn, grow and take pride in our work. What You'll Do * You will engage in and improve the software development lifecycle - from inception and design, through development, deployment, operation and refinement * You will influence and design infrastructure, architecture, standards and methods for large-scale systems * You will support services prior to production via infrastructure design, software platform development, load testing, capacity planning and launch reviews * You will maintain services during deployment and in production by measuring and monitoring key performance and service level indicators including availability, latency, and overall system health * You will automate system scalability and continually work to improve system resiliency, performance and efficiency * You will practice sustainable incident response as part of an on-call rotation and through blameless postmortems * You will remediate tasks within corrective action plan via sustainable, preventative, and automated measures whenever possible Must Haves * BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required * 2 to 5+ years of experience developing and/or administering software in public cloud * Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives. * Experience in languages such as Python, Ruby, Bash, Java, Go, Perl, JavaScript and/or node.js * Demonstrable cross-functional knowledge with systems, storage, networking, security and databases * System administration skills, including automation and orchestration of Linux/Windows using Chef, Puppet, Ansible, Salt Stack and/or containers (Docker, Kubernetes, etc.) * Proficiency with continuous integration and continuous delivery tooling and practices * Strong analytical and troubleshooting skills Extra Points For Any Of The Following: * You have expertise designing, analyzing and troubleshooting large-scale distributed systems. * You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive * You have experience managing Infrastructure as code via tools such as Terraform or CloudFormation * You are passionate for automation with a desire to eliminate toil whenever possible * You've built software or maintained systems in a highly secure, regulated or compliant industry * You thrive in and have experience and passion for working within a DevOps culture and as part of a team Why Equifax? Equifax is a global data, analytics, and technology company and believes knowledge drives progress. We are there at life-defining moments when people are applying for college, interviewing for a new job, buying a home, purchasing a car, or even starting a small business. The products and insights we provide help people all across the world make better decisions, and we are proud of the role we play in peoples' lives. We power the decisions that move people forward by helping people live their financial best. Won't you join us? Regardless of location or role, the individual and collective work of our people makes a difference in our business. We are looking for individuals who can help us disrupt the marketplace. You will do this by delivering leading-edge technology to build and deliver unparalleled customized insights that enrich both the performance of businesses and the lives of consumers. We will give you the opportunity to drive innovation and automation across the enterprise. This will include tool and process integrations across all business units within Equifax globally. And, if community involvement and social responsibility are important to you, join us and help create shared value in our global communities! Corporate Citizenship is an integral part of our company. We know through experience that community involvement will increase the quality of your career with us and simply makes the world a better place. If this sounds like somewhere you want to work, don't delay, apply today. We're looking for you! Success Attributes of the Technology organization. Does this describe you? * Accountability * Bravery * Curiosity * Collaboration * Think and act differently * Trust * Ownership (build it, own it, run it) * Decide-Execute-Ship We offer * Competitive pay for performance * 401k matching, along with the works: comprehensive healthcare packages, schedule flexibility, collaborative working spaces, work from home opportunities, paid time off, and organizational growth potential * Room for you to do things your way * Grow at your own pace through online courses at Learning @ Equifax * And yes, we also have perks such as team activities, volunteer opportunities, awesome parties, great coffee and in some locations, ping pong tables and food trucks * Not yet Cloud certified? We've got you covered with paid sponsorship, resources and training We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. * Please note: This posting represents multiple roles across various teams, including a range of responsibilities and experience levels.* By applying to this position your application is automatically submitted to the following locations: Example: Atlanta, Alpharetta, Dublin, Chile, etc. To speak to us about this role in more detail apply online.#L1-MD2 Primary Location: USA-St. Louis-2330 Ball Function: Function - Tech Engineering and Service Ops Schedule: Full time Equifax Saint Louis MO

Site Reliability Engineer

Solekai Systems Corp