Site Reliability Engineer- Compute Platform

Bloomberg New York , NY 10007

Posted 3 months ago

A Service Reliability Engineer (SRE) at Bloomberg is a hybrid of systems and software engineering who is trusted to improve the stability and availability of the production environment through automation. They are responsible for monitoring, provisioning / configuration / orchestration, capacity management, deployment and rollback, incident management, and SDLC practices.

The Compute Platform team is responsible for providing the bare metal infrastructure on which all of Bloomberg's applications and services reside. Our team is trusted to engineer a hardware platform which maximizes server performance on a standardized hardware configuration. We are also entrusted to architect the platform for tomorrow by partnering with industry leading vendors and thoroughly evaluating leading hardware for inclusion in Bloomberg's compute infrastructure. As a Compute Platform SRE you will solve challenging technology problems by building architecturally sound, high-quality platforms that enable Bloomberg to exceed critical business objectives.

What's in it for you?

You'll work with modern open-source tooling while maintaining mission-critical systems hosting a wide array of applications. We'll depend on you to advise on design, architecture, and scaling of Compute Platform Specifications for a wide array of internal customers and infrastructure platforms. In addition, you'll play a critical role in improving the stability of existing hardware platforms to ensure quality, stability, and scalability of Bloomberg's applications and services.

You'll Need to Have

  • Demonstrated experience programming and testing Python, Ruby, Go, or C/C++

  • Experience working in a 24/7 production engineering organization

  • Ability to listen, communicate, evaluate, problem solve, multi-task, and prioritize in a high-pressure, mission-critical, and rewarding team environment.

We'd Love to see

  • Deep expertise troubleshooting complex distributed systems

  • Experience with creating and improving documented procedures and/or playbooks

  • Working knowledge of Chef, Puppet, Ansible, or Salt

  • Familiarity with open source configuration, orchestration, and CI/CD tools

  • Deep understanding of TCP/IP and Unix networking

  • Knowledge of Linux or Windows internals

If this sounds like something you would be passionate about apply! We'll get in touch with you to let you know what the next steps are.

Bloomberg is an equal opportunities employer, and we value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Site Reliability Engineer

Peloton Cycle

Posted Yesterday

VIEW JOBS 1/28/2020 12:00:00 AM 2020-04-27T00:00 Peloton is looking for a Senior Site Reliability Engineer to work with teams across the organization to build and maintain monitorable, performant, reliable and highly-scalable software systems. We are a small, fast-paced, growing team of engineers tackling challenging problems at scale and headquartered in a brand new headquarters in the heart of Manhattan. Software and systems engineers with interest and/or experience in system automation are encouraged to apply for this position. THE ROLE: * Evangelize best practices for building and operating highly reliable systems * Serve as subject matter expert in observability and monitoring * Consult in system design to meet reliability and capacity requirements * Automate infrastructure and configuration management * Conduct timely post-mortems of production infrastructure incidents * Assist with all aspects of operational security and compliance * Seek out potential threats to security and reliability and advocate solutions * Participate in an on-call rotation to receive escalations * We work with Amazon Web Services, Chef, Python, Ubuntu, Nginx, Jenkins, Terraform, Akamai, Elemental CANDIDATE REQUIREMENTS: * Know when to triage and when to dive down into a root-cause analysis * Passion for reliable, scalable, observable software with strong sense of ownership * Deep experience with Linux system administration * Experience developing and monitoring mission-critical systems * Substantial experience with a programming language like Python, Golang, Java, C * Working knowledge of a centralized configuration tool like chef, puppet, or ansible * Experience with or interest in learning about streaming applications and media servers * Bonus: experience configuring and monitoring CDNs. We use Akamai, Cloudfront, Cloudflare ABOUT PELOTON: Founded in 2012, Peloton is a global interactive fitness platform that brings the energy and benefits of studio-style workouts to the convenience and comfort of home. We use technology and design to bring our Members immersive content through the Peloton Bike, the Peloton Tread, and Peloton Digital, which provide comprehensive, socially-connected fitness offerings anytime, anywhere. We believe in taking risks and challenging the status quo by continuously innovating and improving. Our team is made up of passionate brand ambassadors, and we know that together, we go far. Headquartered in New York City, with offices, warehouses and retail showrooms in the US, UK and Canada, Peloton is changing the way people get fit. Peloton has been named to many prestigious industry lists, including Fast Company's Most Innovative Companies, CNBC's Disruptor 50, Crain's New York Business' Tech25 and Fast50, as well as TIME's Genius Companies. Visit www.onepeloton.com/careers to learn more about joining our team. Peloton Cycle New York NY

Site Reliability Engineer- Compute Platform

Bloomberg