Senior Site Reliability Engineer

Karros Technologies LLC Austin , TX 78704

Posted 2 weeks ago

Karros Technologies gets students to school safely and on time.

Karros Technologies tackles three important problems to help public and private schools transport their students to and from class. Route optimization allows transportation departments to meet continually growing transportation needs with diminishing resources. GPS tracking and predictive machine learning reconcile the differences between planned routes with data pulled live from their fleet of buses. Engineering of scalable distributed software in the cloud allows Karros to provide vital services to public school districts and their transportation departments at reasonable costs.

Karros Technologies builds on four decades of transportation experience by applying state-of-the-art technology and modern software development processes to the development and design of new and existing products. The result is reliable and real-time delivery of transportation information directly to the hands of students and parents.

Your day-to-day:
  • Implement automation and industry best practices to run our large-scale, rapidly growing infrastructure with minimum human intervention.
  • Design and implement monitoring and recovery tools that help us meet performance and availability SLAs.
  • Research, design, and implement solutions for fault tolerance, monitoring, performance enhancements, capacity optimization, disaster recovery, and configuration management of systems and applications.
  • Evaluate and implement 3rd-party platforms as core elements of our own solution, e.g., streaming platforms, ETL platforms, etc.
  • Configure and build tools for Continuous Integration and Continuous Deployment for our Microservices.
  • Set standards with the development team for how code should be optimally structured for easy deployment.
  • Recommend new technologies to ensure quality and productivity.
Technical stack & patterns:
  • Kafka & Kafka Streams for high-performance and real-time processing;
  • NiFi for data pipelining, tooling, and ETL;
  • Java Spring Boot for distributed microservices;
  • ElasticSearch for persistence;
  • Event sourcing & command sourcing;
  • and the following: Linux, Terraform, AWS, Kubernetes, Jenkins.

Requirements

What you bring to the team:

  • Experience architecting container-based microservice platforms using Java;
  • Experience with ElasticSearch or similar DB;
  • Experience writing scalable, high-performant, instrumented and clean code;
  • Good understanding of Amazon Web Services including ECS, CloudFormation, IAM, RDS, etc.;
  • Experience working on teams with heavy emphasis on DevOps, Automation, CI/CD, and Quality;
  • Excellent written and verbal communication skills;
  • An ability to work with a minimum of supervision while collaborating with colleagues in multiple departments and time zones;
  • Experience working in an Agile development environment;
  • Bachelor's degree or equivalent industry experience, and 6+ years of professional experience as a software test engineer, system programmer, or software developer.

Please include Linkedin or Github link.

This is not considered a remote position. Attending meetings at the office a minimum of 2-3 days every two weeks is required.

Benefits

  • Competitive health care plan (medical, dental, and vision);
  • Matching 401(k) contributions;
  • Flexible work-from-home policy -- however, you are required to meet in person for some Scrum rituals;
  • Flexible work environment that encourages personal and career growth;
  • Training and convention opportunities to help expand your skill set.

Karros Technologies LLC is an equal opportunity employer.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Site Reliability Engineer

Enverus

Posted 2 weeks ago

VIEW JOBS 10/9/2020 12:00:00 AM 2021-01-07T00:00 Senior Site Reliability Engineer Why YOU want this position Since our founding as a groundbreaking provider of oil & gas data, we have evolved our solutions to cover oil & gas analytics, trading & risk, and business automation for customers across the energy industry. Enverus represents this growth while bringing us closer together as one team. Enverus delivers business-critical insights to the global energy industry through a state-of-the-art SaaS platform built on industry-leading data and energy analytics. Our solutions deliver value across the entire energy value chain, empowering customers to be more agile, efficient, and competitive. The range of energy industry participants we serve includes exploration and production (E&P) companies and related businesses such as oilfield services, midstream, capital markets, power generators and utilities, energy traders, and downstream commercial & industrial energy consumers. We are currently seeking a highly driven Senior Site Reliability Engineer to join our Technology Engineering team in Austin, TX. This role offers the opportunity to join a rapidly growing company delivering industry-leading solutions to customers in the world's most dynamic and fastest-growing sector. Enverus is the right company at the right time. Performance Objectives * Work in a team that manages, deploys and improves our Commodity Data Solutions infrastructure and customer installations. * Your team will be responsible for keeping our infrastructure humming as new releases and maintenance updates are rolled out * You will help organize, secure, and automate existing infrastructure and deployments * You will work closely with developers to provide feedback and drive operational improvements within our products and operations infrastructure * You will be responsible for ensuring that our platform is stable and balanced * Maintain high site uptime, while embracing rapid change and growth * Scale infrastructure to meet increasing demand and evolving technology * Help the dev teams working on our codebases realize zero downtime deployments * Develop and improve operational practices and procedures * Implement, monitor, and maintain CI/CD frameworks * You will coordinate and participate in on-call rotations * Automate, automate, automate Competitive Candidate Profile * You have excellent communication and collaboration skills * You demonstrate the ability to succeed in a high-pressure environment with rapidly changing priorities * You are an excellent problem solver, and willing to roll up your sleeves to take on any issue thrown your way * You have a desire not just to resolve problems, but to fully understand them and prevent them in the future * 5+ years of professional Windows and Linux server administration * 3+ years of Amazon Web Services (AWS) administration * 2+ years of experience within a high-performance, 24x7, DevOps or SysOps team * You seek out opportunities to improve, fix bugs, and challenge assumptions * You have experience working with global teams (North America, Europe, Asia) * You have experience with the following technologies: * Docker * Container Orchestration (Nomad, Kubernetes, ECS) * Configuration Management tools (Chef, Puppet, Ansible) * Infrastructure as Code (Terraform, Cloudformation) * ActiveMQ * Wildfly * Java, Ruby, or Python programming experience is a plus * You have proven experience in mentoring more junior team members * You prefer to lead the charge, not just keep up with it Enverus Austin TX

Senior Site Reliability Engineer

Karros Technologies LLC