Senior Devops/Site Reliability Engineer

Syrinx Boston , MA 02108

Posted Yesterday

We are hiring a Sr. Site Reliability Engineer who will work with the software engineers to build reliable, high capacity and high-performance infrastructure in support of our mission to reimagine learning for millions of students worldwide. If you know AWS services inside out, have solid networking experience, and you like engineering solutions to solve site reliability and operations problems, you will thrive in this position. The position will be located at our Boston, MA facility.

Essential Accountabilities:

  • Hands-on design, analysis and troubleshooting of highly-distributed large-scale production systems;
  • Ownership of reliability, uptime, capacity, and performance analysis thereof
  • Ensuring the repeatability, traceability, and transparency of our infrastructure automation
  • Identifying highest-impact opportunities to optimize existing systems
  • System design consulting for teams seeking to leverage or improve their production infrastructure
  • Anticipate, build and plan capacity for upcoming product/feature launches


Required Skills:

  • Mastery of AWS services (IAM, EC2, S3, EBS/EFS, ELB/ALB, AutoScaling, RDS and replication techniques, VPC, Subnets, Elastic IP, Route53, CloudWatch, CloudFront, Lambda, CloudFormation, ECS, SNS, ElastiCache);
  • Expertise in container/container-fleet-orchestration technologies (like Docker, Kubernetes, AWS ECS);
  • Expertise in designing and manage escalation response plans from monitoring, react, respond, remediate and retrospect in culturally aligned (proactive, customer focused, collaborative, data-driven and AUTOMATED) ways;
  • Mastery of infrastructure build and configuration automation technologies (like Terraform, Ansible, Puppet, CodeDeploy, Chef);
  • Strong skills in reading, understanding and writing code in at least two of: Javascript, Python, PHP, Go, or Ruby;
  • Strong network engineering skills;
  • Cloud and container native Linux administration/build/management skills (AWS AMIs, Packer, etc.);
  • Significant experience troubleshooting concurrent and distributed system interactions;
  • Expertise with continuous-deployment software development lifecycles in the Cloud (CI/CD);
  • Cloud database operations and deployment experience (RDS MySQL/Postgres/Aurora), caching operations & deployments (Memcache, Redis);
  • Expertise with Lean/Agile deployment processes (ZDT: Blue/Green, Canary, DNS strategies);
  • Familiarity with site and infrastructure monitoring systems (CloudWatch, Datadog, New Relic, Sumologic, Thousand Eyes);
  • Strong problem solving, root cause analysis and systems engineering skills;
  • Good presentation and communication skills;
  • Expertise with SDLC branching, SCM, and code deployment systems (Git/Gitflow, Jenkins, CircleCI, etc.);
  • BS Degree in Computer Science (or related technical field and/or equivalent industry experience).
icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Site Reliability Engineer Devops

Draftkings

Posted 6 days ago

VIEW JOBS 12/2/2019 12:00:00 AM 2020-03-01T00:00 We're reimagining sports and technology. DraftKings is bringing sports fans closer to the games they love and becoming an essential part of their experience in the process. An industry pioneer since our founding in 2012, we believe we can continue to define what it means to be a technology company in sports entertainment. We love what we do and we think you will too. Building the possibilities. We're growing rapidly. As a Senior Site Reliability Engineer, you will help us continue running our applications smoothly as our business scales. DraftKings solves some of the most interesting challenges in the tech industry, and when you join our team, you'll have the opportunity to see your ideas and solutions directly impact our products. What you'll do as a Senior Site Reliability Engineer: * Create self-provisioning infrastructure using tools like Chef, Terraform, and Docker. * Define key metrics and SLAs around new web services being created to support our rapid traffic growth. * You will design and implement monitoring and alerting strategies to enforce application SLAs * Create platform-as-a-service environments where entire subsets of our architecture can be created and destroyed cleanly and reliably. * You will also foster a continuous deployment ecosystem that will allow DraftKings to operate at a massive scale. What skills you will use: * You will have 3+ years with cloud environments and provisioning automation * Deep understanding of common scripting languages (Ruby, Python, Bash). Powershell is a plus. * Experience working with at least one object-oriented language (Java, .C#, etc.) * Working knowledge of networking and web concepts and ability to debug issues down to the packets. * Experience with distributed systems and the challenges with operating them as they scale. Who are we a good fit for? We love working with talented people but more than that, we seek out compassionate co-workers with a collaborative spirit. Our work moves quickly and we're great at coming together to find creative solutions to some of tech's most interesting problems. If that sounds good to you, join us. Apply now As a technology company at our core, DraftKings believes that the best innovation comes from diverse perspectives, thoughts, beliefs, ideas, and experiences. We consistently push boundaries and challenge the conventional to ensure our culture and products reflect the expectations of our employees, and the customers we serve. We're proud to believe that your gender, race, nationality, religion, sexual orientation, status as a protected veteran, or status as an individual with a disability should have nothing to do with our hiring practices. We'll never discriminate against anyone's background or creed. If you're good at what you do, we want you to do it at DraftKings. Draftkings Boston MA

Senior Devops/Site Reliability Engineer

Syrinx