Senior Site Reliability Engineer

Apptio Bellevue , WA 98009

Posted 2 weeks ago


You are an engineer based in the US (West Coast) passionate about observability, automation, and reliability. Your team can count on you to deliver creative and inventive solutions to hard problems.

You are comfortable working with developers, senior leadership, and non-technical individuals to help provide value to the broader organization. You take opportunities to fix problems, mentor your peers, and step outside your comfort zone to develop your skillset.


The platform and site reliability engineering team at Apptio is responsible for building our Kubernetes platform in AWS and driving the adoption of SRE across our engineering teams.

Our SREs work closely with our product teams to raise the quality and reliability bar for our products and services and provide feedback and guidance for the adoption of cloud technologies in all stages of the development cycle.

In a typical day on this role, you will interact with Golang, Kubernetes, Puppet, GitlabCI, ArgoCD, Docker, Confluence, Jira, Slack, and AWS. If you don't know all these tools, don't worry, we are not expecting that you know them all. We understand that technology evolves quickly these days.

We are looking to hire someone with these skills

  • Experience in a large-scale, distributed Linux/Unix environment

  • Experience with any high-level programming languages (e.g., Golang, Ruby, Python)

  • Knowledge of configuration management tools (e.g., Puppet)

  • Knowledge of Infrastructure as code (e.g, Terraform, Cloudformation)

  • Familiarity with RESTful systems and their APIs

  • Familiarity with cloud providers such as AWS, Azure, or Google Cloud Platform

We'd be delighted to hire someone who:

  • Has experience collaborating with team members in a mostly-remote team

  • Has implemented workflows with tools like Kubernetes and Prometheus

  • Has worked as an SRE embedded in product teams

  • Mentoring peers and sharing skills

  • Great communication skills

  • Familiarity with observability

Apptio benefits include Company-Paid employee health, dental, vision, life, and disability insurance, and generous contributions to a health savings account. We also offer participation in a flexible spending account, 401k, and other voluntary programs.

The company

Apptio is the business management system of record for hybrid IT. We transform the way IT runs its business and makes decisions. With our cloud-based applications, IT leaders handle, plan, and optimize their technology investments across on-premises and cloud. With Apptio, IT leaders become strategic partners to the business by demonstrating the value of IT investments, accelerate innovation, and shift their technology investments from running the business to digital innovation. Hundreds of customers choose Apptio as their business system of record for hybrid IT. For more information, please visit

We are an equal opportunity employer and value diversity at our company. We do not discriminate by race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer Principal Networking Cloud Architect


Posted 2 weeks ago

VIEW JOBS 3/18/2020 12:00:00 AM 2020-06-16T00:00 At Okta our motto is "Always On", and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. If you like to be challenged and enjoy seeing your networking designs run at scale with automation, testing and tuning then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, "If you have to do something more than once, automate it," and who can rapidly self-educate on new concepts and tools. For this role we are looking for an experienced Network Architect who has a passion for designing complex cloud-based network infrastructure in AWS that can scale and grow with the company. You will work on: * Designing and building Okta's production infrastructure with a focus on networking and security at scale * Promoting and applying best practices for building scalable and reliable network services across engineering * As a subject matter expert work with our team at Amazon Web Services * Developing and maintaining technical documentation, network diagrams, runbooks, and procedures * Supporting a 24x7 online environment as part of an on-call rotation You are an ideal candidate if you: * Have experience automating and deploying large scale network production services in AWS (VPC, VPC peering, ALB/NLB, EC2, IAM, KMS) or similar experience in GCP or Azure * Deep knowledge in network design, cloud based static and dynamic routing including BGP, packet capture analysis, anycast/unicast, load balancers and session management * Have strong Linux fundamentals * Exposure to FedRAMP, SOC2 or other compliance programs * Prefer scripting for operational tooling in Bash, Ruby, Python, Go or similar Education and Training: * BS. Computer Science (plus) or relevant experience Okta is an equal opportunity employer #LI-SM1 Okta Bellevue WA

Senior Site Reliability Engineer