We are leading with bold innovations by embracing risk with a design-driven and iterative approach. We are transforming our IT Operations on a global basis and looking for someone with a strong scripting, operations, and software development background. Unlike anywhere else in the industry, we are creating roles and teams that combine deep software knowledge with operations to drive unmatched service reliability.
Our mission is to deliver services that matter and achieve and sustain operational excellence. You will be at the heart of fullfilling our mission by bringing your DevOps experience to the table to own and help our vision of engineering reliability end to end. You will design and implement continuous improvement of the management, design, and function of our operational environments to achieve speed and reliability to enable business agility and happy users.
You will be part of our technology organization and have a great opportunity to work across various parts of Deloitte, including our development teams and other stakeholders to drive reliabilty upstream in the application lifecycle and across our operational environments.
Technical expertise is critical in order to imagine and drive technical improvements across our database, networking, and infrastructure teams, and to partner with our application teams, implementing more robust and performant applications for our internal solutions and business solutions (Tax, Audit, Consulting, Finance and Advisory Services).
You should be someone excited with the challenge of bringing new thinking to operations and is passionate about imaginging and implementing improvements and relentlessly pursues excellence, is a deep and broad technical expert, and can build trusting relationships across teams.
It's a new and exciting role to drive our organization further in world class operations.
Role Specific Responsibilities
Ensure user visible uptime and quality, providing operational and development expertise in making our systems fail rarely, and automatically fix when they do fail
Minimize manual involvement by designing & implementing continuous improvements that create an operating environment, including the development of new tools, dynamically monitoring, alerting, & automated self-healing & recovery
Participate in architecture and design reviews to provide recommended improvements to the development teams to improve the reliability and performance of applications
Identify and/or analyze problems relating to mission critical services and implement automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions.
Engage in application performance analysis and system tuning, and capacity planning
Perform root cause analysis to identify & implement continuous improvements
Capable of presenting analyses and recommendations to leadership or discussing the technical merits of solutions with engineers and architects.
Own the day-to-day health, uptime, monitoring, and reliability of services and server infrastructure
Practice Agile and Scrum methodologies
Education: Bachelor's Degree, preferably in engineering or related
Years of Experience: 7+
Strong experience with SharePoint platform operations and/or software engineering
Strong experience with PowerShell scripting
Strong experience with TFS 2010+, VSTS, or similar ALM tool
Experience with Git or similar development repo
Working knowledge of Office 365 and Azure, particularly SharePoint Online and Azure Storage
Experience developing in a software development language (e.g., preferably C#/C++)
Experience and knowledge of database technologies, particularly MS SQL
Knowledge of virtualization and its benefits for improving reliability
Strong experience with instrumentation, monitoring, alerting, and responding relative to performance and availability of applications
Capable of technical deep dives into infrastructure, databases, and application, specifically in operating and supporting high-performance, highly available services and infrastructure
Experience in designing for failure, including disaster recovery and business continuity planning
Experience operating and supporting mission-critical applications (e.g. incident and outage management)
Passionate for making things better and driving action with a sense of urgency
Creative and innovative, taking a thorough, design-driven approach
Experience problem solving issues on globally distributed systems and critical product service environments
Knows what is possible using latest networking, infrastructure, database, and application technologies to driving automation and reliability improvements
Brings new thinking to challenge existing technology and processes
Excellent at building relationships across teams
Firm sense of accountability and ownership
Desire to understand our businesses and users
Understanding of the concepts and principles behind DevOps, SRE, CICD, Agile, Lean, etc.
Use of DevOps tools to deliver and operate end-user services a plus (e.g., Chef, New Relic, Puppet, etc.)
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or protected veteran status, or any other legally protected basis, in accordance with applicable law.
Disclaimer: Nothing in this job description/posting shall constitute an offer or promise of employment. If you are not reviewing this job posting on our Careers' site (jobs2.deloitte.com) or one of our approved job boards we cannot guarantee the validity of this posting. For a list of our current postings, please visit us at jobs2.deloitte.com
Requisition code: DE19USAGTS004AJ1204
Deloitte & Touche L.L.P.