Sorry, this job is no longer accepting applications. See below for more jobs that match what you’re looking for!

Systems Engineer, Site Reliability Engineering

Expired Job

BHO Tech Palo Alto , CA 94301

Posted 2 months ago

Occupations: IT-Software Development:
Computer-Network Security, Database Development-Administration, Desktop Service and Support, Enterprise Software Implementation & Consulting, General-Other: IT-Software Development, IT Project Management, Network and Server Administration, Software-System Architecture

Job Description:
Site Reliability Engineering (SRE) is what you get when you treat operations as if its a software problem.
Our mission is to progress, protect, and provide for the software and systems behind all of Googles public services - Search, Ads, Gmail, Android, YouTube, and AppEngine, to name just a few - with an ever-watchful eye on their availability, latency, performance, and capacity.
This is an unusual job, unlike others in the industry.
Like traditional operations groups, we keep important, revenue-critical systems up and running despite hurricanes, bandwidth outages, and configuration problems . Unlike traditional operations groups, we also have full access to and authority to fix, extend, and scale the code to keep it working and harden it against all the vagaries of the Internet.
We hire people from both systems and software backgrounds.
Strong candidates will have experience with both.

Job Requirements:
Responsibilities Design, write and deliver software to improve the availability, scalability, latency, and efficiency of Google's services.
Solve problems relating to mission critical services and build automation to prevent problem recurrence; with the goal of automating response to all non-exceptional service conditions.
Influence and create new designs, architectures, standards and methods for large-scale distributed systems.
Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.
Conduct periodic on call duties using a follow-the-sun model.
Minimum qualifications BS degree in Computer Science or related technical field, or equivalent practical experience.
Experience in one or more of: C, C++, Java, Perl, Python, Go, or scripting experience in Shell and Perl.
Experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols.
Networking: experience with network theory e.g.
TCP/IP, UDP, ICMP, etc., MAC addresses, IP packets, DNS, OSI layers, and load balancing.
Kris Young Account Director BHO Tech San Jose, San Francisco CA Phone: x 823
See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Download the
LiveCareer app and find
your dream job anywhere

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Site Reliability Engineer (Sre)


Posted 4 days ago

VIEW JOBS 11/12/2018 12:00:00 AM 2019-02-10T00:00 Job Description: Cloudera is looking for an accomplished Site Reliability Engineer (SRE) to play a key role in advancing Cloudera's product offerings in the cloud. In this role, you will be at the intersection of two white-hot areas in today's technical landscape: the Cloud and Big Data. Over the past few years, Cloudera has experienced tremendous growth, making us the leading contributor to the Hadoop ecosystem and a leading provider of enterprise solutions for Big Data. The purpose of this team is to accelerate Cloudera's next stage of growth by enabling our customers to unlock the full potential of the cloud and Hadoop. On this team, you will be immersed in many exciting, innovative technologies and projects that will be critical to our customers' data management needs in the cloud. Responsibilities * Track our cloud customer SLAs and be on-call to ensure total conformity to these customer commitments. * Create and maintain complete and accurate documentation for the purpose of operational audits including security and compliance. * Continuously review and enhance processes and operating procedures needed to maintain the most cost effective enterprise-grade cloud infrastructure. * Innovate and automate improvements to our Cloud Operations. * Identify and promote best practices and patterns for the setup, configuration and management including databases, servers, networking and storage systems. Minimum Requirements * Experience building a successful SaaS offering from scratch - influencing technology decisions, building processes and best practices. * 5+ years industry experience in a DevOps, Site Reliability Engineering or Software Engineering role. * Experience programming with Java, C++ or Python * Experience supporting production SaaS and adhering to other key metrics such as reliability and high availability. * Experience with performance analysis, troubleshooting, tuning, and capacity planning. * Experience with automating deployment of software to production instances and owning software releases. * Participated in an on-call rotation to help ensure services stay up and running. * Strong Linux and systems experience. * Experience with cloud technologies such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform. * Experience with Jenkins and continuous integration. * Experience with modern configuration management frameworks like Puppet, Chef, Ansible, or Saltstack. * B.S. or M.S. Computer Science or equivalent experience. Pluses * Experience with Spinnaker, Rundeck, or other orchestration tools. Why Cloudera? * Joining Cloudera is a fantastic opportunity to work with some of the best engineers in the industry who are tackling challenges that will continue to shape the Big Data revolution. We foster an engaging, supportive and productive work environment where you can do your best work. The team culture values engineering excellence, technical depth, grassroots innovation, teamwork and collaboration. We welcome "The Startup Spark", a desire to create new things, dive in wherever there's a need, and learn new things. * Amazing people - We are a fun and smart team, including many of the top luminaries in Hadoop and related open source communities. We frequently interact with the research community, collaborate with engineers at other top companies and host cutting edge researchers for tech talks. * Innovative work - Cloudera pushes the frontier of big data and distributed computing, as our track record shows. We test and deploy our code on clusters with hundreds of nodes, terabytes of RAM, and petabytes of storage. We work on high-profile open source projects, interacting daily with engineers at other exciting companies, speaking at meet-ups, etc. * Great culture - Transparent and open meritocracy. Everybody is always thinking of better ways to do things, and coming up with ideas that make a difference. We build our culture to be the best workplace in our careers. Cloudera Palo Alto CA

Systems Engineer, Site Reliability Engineering

Expired Job

BHO Tech