By trade we are a technology company, but if you ask anyone that works here, they'll tell you we are a people company. As the industry leader in Accounts Payable (AP) Automation, AvidXchange strives to provide an innovative and collaborative work environment. We do that through focusing on our people, our culture, and ensuring we run our business in a way that enables every employee to achieve their fullest potential and help us create a world class company. Our employees live by our core values, including "Innovate to Change the Game," "Passion About Customer Success," "Win as a Team," and "Have a Blast." Whether you live in Charlotte and can enjoy our corporate campus at the AvidXchange Music Factory, or you live across the country, AvidXchange has locations waiting for you. We are on a mission to create something different at AvidXchange. Love where you work. Live Avidly.
The Principal Site Reliability Engineer is responsible for providing continuous feedback of site health, reliability, availability, and user experience for all AvidXchange core products.The Principal Site Reliability Engineer will also be the technical leader for helping transform and then maintain existing SaaS Operational processes and practices into those of true Site Reliability Engineering.Meaningful and relevant real-time measurements for production environments will be collected, aggregated, analyzed, and ultimately provided as a feedback loop to the business, including Software Engineering and Product, to provide insight and visibility into product performance and activity.The Principal Site Reliability Engineer will provide user experience analysis to internal business partners, executive leadership and product / software engineering teams to help drive changes to increase customer satisfaction, product availability and reliability.In addition to monitoring and insight, a heavy focus will be placed on automation opportunities and automating operational processes to maintain 99.9% availability of AvidXchange core products.
Define and execute strategy to transform existing SaaS Operational processes and practices into those of true Site Reliability Engineering (defining and implementing SLOs aligned to application domains, downtime budgets, error budgets, etc).This includes cross-functional, technical leadership to communicate and coordinate strategy across Operations and Software Engineering.Once implemented, tune and maintain the Site Reliability Engineering strategies and processes.
Define strategy and tools for measuring core product health in production (with opportunities to extend those capabilities all the way back through the entire DevOps pipeline)
Define strategy and methodology for calculating system availability SLAs across AvidXchange products
Define strategy for measuring and testing of site reliability using chaos-monkey based methodologies
Define tool consolidation strategy to optimize spend versus value for our end to end monitoring platform
Define strategy, standardize technologies, and establish patterns for rapid and continuous development and application of automated solutions to address reliability issues and automate manual tasks
Define strategy for the DevOps Principal of 'Feedback" by creating user experience measures for all AvidXchange products
Work with the Software DevOps team to define strategy for DevOps CICD continuous performance testing, monitoring, and reliability strategy using Visual Studio Team Services and other cloud-based tools
Work with the Software DevOps and Performance Engineering teams to define strategy for DevOps CICD performance and monitoring quality gates within the delivery pipeline
Define methodology to measure core product availability across Azure and AvidXchange Cloud using HTTP endpoint testing and synthetic user testing
Maintain automated site availability reporting and data platform
Present usability, reliability, incident, and user experience of AvidXchange products to executive leadership on a weekly basis
Define and report SLOs / SLAs for 99.9% availability to executive leadership and business partners
Influence product delivery teams to implement usability and reliability enhancements leading to improved user experience index scores and improved availability
Provide detailed analysis and troubleshooting for systems outages providing feedback to product / software engineering
Areas of Impact:
Work results influence all AvidXchange products over the next 1 to 5 years
Sets day-today objectives and delivers job responsibilities for self, ops teams, and product teams
A minimum of five (5) to eight (8) years of experience is typically required to perform at expectation.
Bachelor's degree in Computer Science or Information Technology is preferred
Relevant Certifications strongly preferred
6 years or more of Experience with Dynatrace AppMon, Dynatrace SaaS or competing products
Measure site availability using synthetic testing platforms such as Panopta or Gomez
Understanding of web hosting infrastructure and high availability architecture
Experience measuring and monitoring .NET applications, SQL Servers/Database, and Serverless cloud resources or equivalent Java-based experience
Execute queries on Microsoft SQL Server databases defined by existing standard operating procedures.
Using Advanced SQL Server 2014+ including stored procedures, indexes, and functions
Troubleshoot solutions with service oriented or micro service architectures
PowerShell or Linux scripting for creating automated routines for ensuring site availability
Development/coding experience and skills for writing custom automation solutions
Experience working in an Agile software development environment (Scrum / Kanban)
Knowledge and skills surrounding Public Cloud architectures (Azure experience highly desired)
Strong technical leadership and interpersonal skills.
Dependable, motivated and quick learner
Performs analysis of complex systems and presents findings
Defines strategies that impacts all AvidXchange products
Provides consultation to teams throughout AvidXchange
Able to rapidly comprehend the functions and capabilities of new technologies.
Works collaboratively and openly seeks and shares information across the enterprise.
Industry and Enterprise level thinking.
AvidXchange is an equal opportunity employer. AvidXchange is committed to equal employment opportunity in accordance with applicable federal, state and local laws. AvidXchange will not discriminate against
applicants for employment on any legally recognized basis. This includes, but is not limited to: veteran status, race, color, religion, sex, sexual orientation, gender identity, gender expression, national origin, age
and physical or mental disability.
Job FamilySoftware Engineering
1210 AvidXChange Lane, Charlotte, NC 28206, USA