Site Reliability Engineer, Govcloud (Senior/Staff/Principal)

Okta Bellevue , WA 98009

Posted 5 months ago

About Okta

At Okta our motto is "Always On", and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. We've created an integrated system that securely connects any person via any device to the technologies they need to do their most significant work.

Job Overview

If you like to be challenged and have a passion for solving problems at scale with automation, testing and tuning, then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, "If you have to do something more than once, automate it," and who can rapidly self-educate on new concepts and tools.


  • Design, build, and monitor Okta's production infrastructure

  • Respond to production incidents and determine a preventive solution

  • Troubleshoot complex reliability and performance issues

  • Automate manual processes, evolve our monitoring tools, and develop technical documentation

  • Support a highly available online environment as part of an on-call rotation once per quarter

Qualifications & Requirements

  • Due to federal data handling requirements, candidate must be a US Citizen

  • Computer Science (plus) or relevant experience

  • Background with Linux systems administration and strong scripting skills in Bash, Ruby, Python, Go, etc.

  • Experience supporting Docker containers and web applications running on Java / Apache / Tomcat in a live production environment

  • Strong expertise with production services in AWS such as EC2, ECS, KMS, Kinesis, CloudWatch

  • Previous experience with automating systems and infrastructure via Ansible, Chef or Terraform

  • Solid understanding of networking concepts and IP protocols

  • Background using and supporting Splunk, Zabbix, or related tools

  • Experience working in a source controlled environment with Relational Databases, such as MySQL

  • Knowledge of NoSQL systems such as Redis, Cassandra is desired

Our Culture

Okta is an active, vibrant place that rewards creativity and unconventional thinking. We know that forging new connections between people and technology is no small feat, so we stick together. We work hard and challenge each other. We offer excellent benefits, competitive compensation, career growth opportunities, flexible time-off, catered lunches / free snacks, and much more!

  • We believe that work is a never-ending process of learning and iteration.

  • We work on extremely complex problems.

  • Your colleagues will be really smart (and cool to hang out with).

  • We work on products that make millions of people's work lives better.

  • We're funded by the industry's most respected investors.

  • You'll have the opportunity to change technology forever.

Okta is an Equal Opportunity Employer.


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Principal Engineer Network Architecture

Amazon.Com, Inc.

Posted 2 days ago

VIEW JOBS 12/10/2019 12:00:00 AM 2020-03-09T00:00 Project Kuiper is an initiative to launch a constellation of Low Earth Orbit satellites that will provide low-latency, high-speed broadband connectivity to unserved and underserved communities around the world. Come work at Amazon! The Role: Help architect and design network solutions from concepts to working solutions for customers around the world. You will work with HW/SW/FW teams to implement a new advanced broadband wireless networking solution for Low Earth Orbit satellites. Our engineers anticipate high-stake challenges, as well as design and architect networks that are not allowed to fail and must be able to scale. To continue successful growth, we need principal architects who can push systems to their limits: * Design network topologies, architectures, and services that solve for new requirements of constantly changing network routing, SDN networks, and challenging requirements of space-based solutions * Design ahead of current customers' or technology needs, predict and solve for future problems * Architect and design High Level Design (HLD) * Influence technology roadmaps, construct and test his/her solutions If you want to become a member of a team that makes strategic decisions daily with a great customer-centric impact, this is your opportunity. Export Control Requirement: Due to applicable export control laws and regulations, candidates must be a U.S. citizen or national, U.S. permanent resident (i.e., current Green Card holder), or lawfully admitted into the U.S. as a refugee or granted asylum. Amazon.Com, Inc. Bellevue WA

Site Reliability Engineer, Govcloud (Senior/Staff/Principal)