Principal Site Reliability Engineer

Veracode Burlington , MA 01803

Posted 3 weeks ago

Veracode is seeking an enthusiastic, motivated engineer with deep AWS knowledge and the ability to keep up with a high-performing team. This is a chance to be on the leading edge of our evolution to the cloud in a fast-paced environment.

As a member of our SRE team, you will be part of a team migrating existing applications into AWS while seeking opportunities for efficiencies in deployment, monitoring, and cost. In addition to this, you will serve as a subject matter expert and consultant to other teams developing applications in the cloud.

The ideal candidate will be a self-starter who can work with minimal supervision, covering a wide range of duties in a high-energy DevOps environment.

  • BS in Computer Science or equivalent work experience

  • 3+ years' experience architecting and automating AWS infrastructure

  • 3+ years' experience automating deployments in AWS

  • Experience in an Agile environment

  • Strong written and verbal communication

  • Experience with logging and monitoring tools like Sumologic, Splunk, ELK preferred

  • Experience with Docker and ECS or Kubernetes preferred

  • Experience with infrastructure as code like Terraform or Cloudformation preferred

  • Experience with configuration management tools like Ansible or Puppet preferred

  • Experience with metrics collection and aggregation solutions preferred

  • Proficient in one or more programming languages, preferably Python and contemporary engineering practices

  • Working knowledge of software defined networking on cloud and is' automation


o Routing


o Direct Connect

  • Strong written and verbal communication

  • Influence, design and create new architectures, standards and methods for enterprise systems

  • Collaborate with, learn from, and mentor teammates

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer

Nuance Communications, Inc

Posted 1 week ago

VIEW JOBS 8/7/2019 12:00:00 AM 2019-11-05T00:00 Company Overview Nuance Communications, Inc. is the pioneer and leader in conversational AI innovations that bring intelligence to everyday work and life. The company delivers solutions that understand, analyze and respond to human language, amplifying human intelligence. With decades of domain and artificial intelligence expertise, Nuance works with thousands of organizations – in healthcare, telecommunications, automotive, financial services, retail, and more – to create stronger relationships and better experiences for their customers.Join our Healthcare team...caring for clinicians the way they care for patients. Beyond words. We create technology that lets clinicians capture and document care quickly and easily so they can focus their attention on their patients Job Summary Summary: Nuance Communications is looking for a talented individual with proven track record to take up an exciting role of a Site Reliability Engineer within its HealthCare HIM Research and Development Unit. Our customers count on our applications and services to be fast, reliable and secure. The candidate will be working with a talented group of cross-functional individuals to plan, design and build various tools, systems and infrastructure that enables continuous integration, testing, monitoring, elasticity and delivery of products and solutions on our hosted cloud platform. Responsibilities: * Use process and best practices to ensure the platform and applications are stable and performant. * Keep the customer-facing applications and services always available * Proactively identify hurdles to stability and implement self-healing and resiliency initiatives. * Build and maintain tools that will help with day to day activities and orchestration of our cloud environments. * Work to automate detection and resolution of recurring issues in the production environment. * Participate in the Incident and Problem Management processes and assist the teams in ensuring proper RCAs are documented and follow-ups are delivered. * Communicate with software engineers, QA engineers, product management and operations staff on a daily basis, sharing ideas, status on ongoing work, and prioritizing future work. * Implement Infrastructure as a service and Infrastructure as code practices wherever applicable. * Stay up-to-date on relevant technologies internally as well as externally, plug into user groups, understand trends and opportunities to ensure we are using the best possible techniques and tools. * Perform tasks related to securing and keeping the products, tools, and processes that you are responsible for securing. Qualifications Number of Years of Work Experience: 2 Required Skills: * Experience working as a Site Reliability Engineer or a similar role operating a highly scalable and distributed platform. * Experience with operating large-scale production systems in the Cloud (AWS, GCP, Azure). * Experience with monitoring tools like Nagios, New Relic, Prometheus etc. * Experience working in a Linux and Windows environment. * Experience with scripting language like Python, Perl, PowerShell etc. * Experience with SQL or equivalent language. * Experience using source control systems such as GIT. * Experience with configuration management systems like Chef, Puppet, Ansible or Salt etc. * Experience working in an Agile environment. Preferred Skills: * Must be action oriented, capable of multitasking well based on priorities. * Ability to build, use and configure metrics collection, reporting and alerting systems. * Experience developing, deploying and integrating monitoring solutions. * Experience with immutable infrastructure. * Experience with containerization, Docker, Kubernetes, MesOS. * Experience with continuous integration systems like Jenkins, and Azure DevOps. * Familiarity with and enthusiastic for software engineering best practices such as testing, continuous integration and continuous delivery. * Strong understanding of cloud concepts and Infrastructure as Code. * Excellent verbal and written communication and interpersonal skills. * Experience with the Atlassian Tools such as JIRA/Confluence. Education: B.S. in Electrical or Computer Engineering, Computer Science or relevant work experience Additional Information Nuance offers a compelling and rewarding work environment. We offer market competitive salaries, bonus, equity, benefits, meaningful growth and development opportunities and a casual yet technically challenging work environment. Join our dynamic, entrepreneurial team and become part of our continuing success. Nuance Communication Inc. is an equal opportunity employer. We evaluate qualified applicants without regard to race, age, color, religion, sex, national origin, disability, veteran status, gender identity, sexual orientation and other legally protected characteristics. The EEO is the Law poster and its supplement is available here. If you need a reasonable accommodation because of a disability for any part of the employment process, please call 781-565-5086 – Human Resources Department and let us know the nature of your request and your contact information. Nuance Communications, Inc Burlington MA

Principal Site Reliability Engineer