Senior Cloud Reliability Engineer

Wayfair LLC Boston , MA 02298

Posted 2 weeks ago

Senior Cloud Reliability Engineer

Wayfair is a leader in the e-commerce space for all things home. We live and breathe modern technologies. We are a "move fast break things, re-think old standards" team with a startup feel and continuous deployment.

We're looking for smart, logical thinkers who produce and advocate for performant, scalable designs. We are as much concerned about thought leadership, community involvement, and the ever-changing SRE (Site Reliability Engineering) landscape as we are with senior technical skills!

As a SRE Cloud Engineer you'll have a multitude of opportunities to flex your strengths as well as learn new things while directly assisting our internal customers. We contribute to (and create) bleeding edge open source projects and continuously push the envelope to explore the future of eCommerce and distributed infrastructure systems. The Cloud team focuses on moving workloads into the cloud, and designing and deploying the framework that our public cloud platforms are built upon.

What you'll do

  • Write clean, high-performance, and well tested, infrastructure code with a focus on reusability. (Puppet / Python / Terraform / Packer)

  • Create and maintain detailed documentation

  • Establish, maintain, and adhere to Wayfair technical standards, policies, and procedures

  • Leverage software development skills to enable self-service deployment of distributed systems

  • Recommend and implement infrastructure best practices in alignment with standard SRE principles and provide guidance on system performance and throughput expectations.

  • Path finding missions; taking existing platforms at Wayfair and helping move them to the cloud

  • Designing, implementing, and maintaining the Wayfair public cloud PaaS on Google Cloud

  • Researching and finding the right tools for the job, and if they don't exist creating them

  • Supporting teams who are transitioning to a multi-cloud world - Wayfair Operates on a true hybrid cloud model which includes, on prem (VMWare & K8s), GKE, and GCE.

What You Have

  • Experience with developing, building, securing and operating sophisticated and highly automated Cloud infrastructure (GCP or AWS) a must.

  • Prior success in automating and maintaining an efficient large scale real-world production environment.

  • Familiar with deployment patterns/strategy (blue/green, canary, rolling, draining)

  • Development experience with continuous integration (CI/CD) and automation tools such as GIT, Jenkins, Packer, Terraform, Puppet, Etc

  • The ability to design, author, and release code in languages like Go, Python, Ruby or Java

  • Experience managing full application stack with high availability requirements

  • Experience working with web technologies such as Nginx/PHP/etc at large scales

  • Able to engage in high-level client-side architecture discussions

  • Experience with performance tuning

  • Advanced Operating System Skills with knowledge of Linux internals

  • Ability to communicate effectively both verbally and in writing

  • Proven ability to collaborate and work well within a team

About Wayfair Inc.

Wayfair is one of the world's largest online destinations for the home. Whether you work in our global headquarters in Boston or Berlin, or in our warehouses or offices throughout the world, we're reinventing the way people shop for their homes. Through our commitment to industry-leading technology and creative problem-solving, we are confident that Wayfair will be home to the most rewarding work of your career. If you're looking for rapid growth, constant learning, and dynamic challenges, then you'll find that amazing career opportunities are knocking.

No matter who you are, Wayfair is a place you can call home. We're a community of innovators, risk-takers, and trailblazers who celebrate our differences, and know that our unique perspectives make us stronger, smarter, and well-positioned for success. We value and rely on the collective voices of our employees, customers, community, and suppliers to help guide us as we build a better Wayfair - and world - for all. Every voice, every perspective matters. That's why we're proud to be an equal opportunity employer. We do not discriminate on the basis of race, color, ethnicity, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, or genetic information.

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Remote Senior Cloud Network Engineer


Posted Today

VIEW JOBS 5/13/2021 12:00:00 AM 2021-08-11T00:00 <div> <p><strong>REMOTE - Senior Cloud Network Engineer</strong></p> <p><strong>This role is with a Syrinx Retail e-Commerce Partner based in Boston, MA</strong></p> <p><strong>Fully Remote for now - Local candidates preferred (Prefer someone to be on-site post Covid)</strong></p> <p><strong>This role can be contract, contract to hire, or full-time</strong></p> <p> </p> <p>We are seeking an experienced <strong>Senior Cloud Network Engineer</strong> for development of networking software used in Commercial, Install and Portable professional audio markets. In this position you will lead and drive comprehensive system solutions and API designs. These systems solutions also may require integration of 3rd party hardware and software. Must possess strong organizational skills, development skills, and relevant educational background in software application integration and development on a variety of platforms including mobile, cloud, and desktop. This person will be responsible for development, integration, and testing of networked applications.</p> <p> </p> <p><strong>Key Responsibilities</strong></p> <p> </p> <ul> <li style="padding: 0; margin: 0;">Develop and implement AWS Transit Gateway in Cloud strategy in multiple AWS regions</li> <li style="padding: 0; margin: 0;">Collaborate with DevOps groups to deploy AWS Transit Gateway using code</li> <li style="padding: 0; margin: 0;">Migrate connectivity to 50+ VPCs in AWS from AWS Direct Connect to Transit Gateway</li> <li style="padding: 0; margin: 0;">Design and implement network segmentation and edge security in cloud to fulfill compliance requirements.</li> <li style="padding: 0; margin: 0;">Leverage and enhance Cisco SD-WAN solution to implement redundant, robust, and reliable connectivity to AWS Cloud applications from all offices across the globe</li> <li style="padding: 0; margin: 0;">Architect a strategy to improve performance for partners in APAC and EU regions accessing multiple applications in Azure via secure IPSec VPN connections</li> <li style="padding: 0; margin: 0;">Design and Implement AWS Storage Gateway at multiple remote locations to enable faster and dependable data transfer and replication to and from Cloud</li> <li style="padding: 0; margin: 0;">Perform cost analysis, identify over-provisioned/under-utilized resources, and develop a detailed response plan</li> <li style="padding: 0; margin: 0;">Design and deploy a service to automate monitoring and updating the active AWS Direct Connect connections using AWS Serverless framework leveraging services such as Lambda (in Python), DynamoDB, CloudWatch, and Step Functions</li> <li style="padding: 0; margin: 0;">Leverage AWS Control Tower for provisioning new accounts and enforce Service Control Policies via AWS Organizations </li> </ul> </div> Syrinx Boston / Remote MA

Senior Cloud Reliability Engineer

Wayfair LLC