Site Reliability Engineer

Solekai Systems Corp Dallas , TX 75201

Posted 2 months ago

Are you ready to step up to the New and take your technology expertise to the next level?

Join Accenture and help transform leading organizations and communities around the world. The sheer scale of our capabilities and client engagements and the way we collaborate, operate and deliver value provides an unparalleled opportunity to grow and advance. Choose Accenture and make delivering innovative work part of your extraordinary career.

People in our Client Delivery & Operations career track drive delivery and capability excellence through the design, development and/or delivery of a solution, service, capability or offering. They grow into delivery-focused roles, and can progress within their current role, laterally or upward.

As part of our practice, you will lead technology innovation for our clients through robust delivery of world-class solutions. You will build better software better! There will never be a typical day and that's why people love it here. The opportunities to make a difference within exciting client initiatives are unlimited in the ever-changing technology landscape. You will be part of a growing network of technology experts who are highly collaborative taking on today's biggest, most complex business challenges. We will nurture your talent in an inclusive culture that values diversity. Come grow your career in technology at Accenture!

The Performance Engineering practice within Accenture Technology is focused on optimizing the performance and scalability of enterprise applications through the combination of:

  • Testing: Analyzing, planning and executing production-like simulations across mobile and web solutions to identify & remediate performance problems, prevent production outages, and guarantee predictable performance.

  • Diagnostics & Monitoring: Instrumenting the complete application architecture to provide real user and system performance data to provide insight into the root cause of all application bottlenecks, enable real time visibility to reduce risk exposure.

  • Performance Analytics: Measuring the relationship between end-to-end performance, user behavior, and business goals to maximize the digital business, improve business KPIs, and increase client retention.

  • Business Optimization: Empowering digital businesses with contextual intelligence to visualize, quantify and maximize the business value of performance to improve the quality & performance of the business, increase customer satisfaction, and protect brand reputation.

As a Site Reliability Engineer, some of your key responsibilities may include:

  • Maintain responsibility for the design, deployment, and maintenance of production-scale systems.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

  • Use automation to streamline the provisioning, management, and monitoring of applications and services using multiple scripting languages, Java, and infrastructure-as-code.

  • Facilitate blameless Incident Retrospectives to understand root causes, communicate learnings, determine remediation and make us better and closer as a team.

  • Coordinate with development and platform teams to design and implement zero-downtime deployment approaches, real-time logging, alerting, and monitoring solutions, and code instrumentation.

  • Introduce chaos engineering concepts that promote experimentation in production to identify systemic weaknesses while increasing service resiliency

  • Coordinate with the solution architect to design a highly available solution that meets availability and reliability objectives and to reduce manual activities using automation, when feasible.

  • Identifying, evaluating, and recommending monitoring tools and diagnostic techniques relevant to the application architecture. Assess gaps in as-is monitoring tool capabilities and recommend tools to augment or replace.

  • Instrumenting applications to enable performance diagnostics and monitoring

  • Collaborating with developers to promote the concept of reliability engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle

  • Monitoring application performance during performance tests or production usage through the use of APM and other monitoring tools to isolate the fault domain, dive deep into application code, and identify root cause of performance issues.

  • Interacting with client and/or Accenture development, operations, and infrastructure resources to recommend solutions to remediate performance issues

  • Participating in re-architecture, redesign, and refactoring decisions to satisfy performance requirements

  • Developing dashboards and reports to provide ongoing visibility into the performance of client applications

  • Contributing learnings and experiences to the Accenture Performance Engineering community

  • Requirement of 80-100% travel, typically M-TH

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer

Iron Mountain Incorporated

Posted 2 months ago

VIEW JOBS 4/10/2020 12:00:00 AM 2020-07-09T00:00 At Iron Mountain we protect what our customers value most, from the everyday to the extraordinary, while helping them bridge the physical and digital world. Our people have the opportunity to bring their creativity to a workplace that thrives on change. Here, you will be part of a team that doesn't just embrace what's exceptional. It creates exceptional. As Iron Mountain continues its digital transformation, we are forming a new site reliability team, directly supporting a specific customer's digital imaging solutions, while maintaining close ties to similar teams within the Enterprise Information Technology organization. This team is responsible for availability, latency, performance, efficiency, change management, monitoring, security, emergency response, and capacity planning. The Iron Mountain site reliability engineers create a bridge between development and operations by applying a software engineering mindset to system administration topics. The ideal candidate for this role is a software and systems savvy engineer that is comfortable with designing, building, and integrating solutions across technical and business capability domains with cost and strategic implications. Solutions may consist of proven or unproven technologies or multiple implementation technologies at once within domains that experience rapid change. Our _Site Reliability Engineer_ would need advanced knowledge of infrastructure, scripting skills, and engineering disciplines. This role requires equal parts development and operations with a software engineer mindset. Using technical and operational skills, this role will increase application reliability at scale. What you will do.... + Build software to help operations and support teams + Write clean, high-performance, and well tested, infrastructure code with a focus on reusability and automation + Develop monitoring, define SLAs, SLOs and error budgets for mission critical platforms while helping to coordinate product launches and reliability exercises + Collaborate and contribute with other enterprise teams on Iron Mountain's digital transformation journey, including the impact on infrastructure, networks and security + Work closely with Architects and provide support to senior staff, ensuring designs align with the technological and business directions across the company + Support IT deployments with involvement Platform as a Service (PaaS), Software as a Service (SaaS), or Infrastructure as a Service (IaaS) + Manage central platforms as a service for growth and scale + Implement enhancements to the company's digital and data infrastructure, supporting internal customer's operational needs. + Take part in on-call rotations + Document previous "tribal knowledge" and eliminate tech debt What you bring to Iron Mountain... + Bachelor's Degree or equivalent and 3+ years of relevant work experience + 1 - 2 years with provisioning automation + Experience with Operating System, troubleshooting and coding/scripting using high-level languages + Experience with infrastructure systems that support enterprise data science and analytics capabilities, including streaming and real-time analytics + Deep understanding of common scripting languages (Powershell, Python, Bash, Go). + Experience working with virtualization platforms (VMware, Nutanix, GCP, Azure, AWS) + Experience with current pipeline tools (Terraform, Ansible, Jenkins, Packer, Git) + Involvement in some on-premise to cloud migration or Application Modernization efforts + Experience managing a full application stack with high availability requirements + Full Stack troubleshooting experience including networking, operating system (Windows Server, RHEL/CentOS), HA Proxy, Nginx, RDBMS is preferred + Experience leveraging monitoring tools to meet contracted SLAs with SLO and SLI responsibilities. + Strong written and verbal communication skills + Able to thrive in a collaborative and cross-functional environment Category: Information Technology (IT) Iron Mountain Incorporated, founded in 1951, is the global leader for storage and information management services. Trusted by more than 225,000 organizations around the world in approximately 50 countries, Iron Mountain stores and protects billions of valued assets, including critical business information, highly sensitive data, and cultural and historical artifacts. Providing solutions that include information management, digital transformation, secure storage, secure destruction, as well as data centers, cloud services, and art storage and logistics, Iron Mountain helps customers lower cost and risk, comply with regulations, recover from disaster, and enable a digital way of working. Our Cores Value at s at and Code of Ethics are our north star. They provide a solid base for how we do business and behave every day, so each one of us can experience exceptional. If you have a physical or mental disability that requires special accommodations, please let us know by sending an email to . See the Supplement to learn more about Equal Employment Opportunity. Requisition: J0019244 Iron Mountain Incorporated Dallas TX

Site Reliability Engineer

Solekai Systems Corp