Senior Data Platform Engineer

Stitch Fix San Francisco , CA 94118

Posted 4 months ago

About the Team

The Workflow, Environment, and Execution team is responsible for the data processing and ETL platform that data scientists use to build analysis pipeline and develop ML and statistical models used across every aspect of our business.

The Workflow, Environment, and Execution team is responsible for:

Workflow: Designing complex pipelines, scheduling and pipeline status tracking.

Environment: Building images (docker, AMIs, and RPMs), robust infrastructure for interactive Jupyter notebooks and RStudio sessions.

Execution: Extend and maintain Flotilla, our self-service framework that dramatically simplifies the process of defining and executing containerized job as well as build interfaces into our infrastructure and data warehouse.

About the RoleYou're excited about this opportunity because

  • you will identify areas of our model building pipeline infrastructure that need improvement and then plan and execute those improvements.
  • you will develop evolvable and robust systems for data scientists to easily define, deploy, and debug their ETL.
  • you will improve our hosted interactive computing environments -- long running Docker containers with Jupyter notebooks and RStudio installed and configured.
  • you will define and provide the next version of client interfaces into our data warehouse where simplicity and evolvability is key.
  • you will maintain reliable infrastructure for data scientists. Remote execution, ECS cluster management, Docker image building, Artifactory reliability, etc.

We're excited about you because

  • you have strong skills in at least one of the languages the team uses -- Python, Go, and Java.
  • you have at least 4 years of industry experience doing software development with exposure to data warehouses, ETL, and/or working with data scientists managing model building pipelines.
  • you are self motivated, disciplined and goal oriented.
  • you enjoy partnering with data scientists & have experience with data science workflows and/or ETL in traditional data warehouses.
  • you are empathetic to users and are motivated by building useful tools for our data scientists.
  • you are a seen as a role model for high quality engineering. This includes best practices in testing, communication, writing well structured code and setting an example the rest of the team can learn from.

Why you'll love working at Stitch Fix...

  • We are a group of bright, kind and goal oriented people. You can be your authentic self here, and are empowered to encourage others to do the same!

  • We are a successful, fast-growing company at the forefront of tech and fashion, redefining retail for the next generation

  • We are a technologically and data-driven business

  • We are committed to our clients and connected through our vision of "Transforming the way people find what they love"

  • We love solving problems, thinking creatively and trying new things

  • We believe in autonomy & taking initiative

  • We are challenged, developed and have meaningful impact

  • We take what we do seriously. We don't take ourselves seriously

  • We have a smart, experienced leadership team that wants to do it right & is open to new ideas

  • We offer competitive compensation packages and comprehensive health benefits

  • You will be proud to say that you work for Stitch Fix and will know that the work you do brings joy to our clients every day

About Stitch Fix

Stitch Fix is an online personal style service for men and women combining art and science to disrupt and redefine the retail industry. We're the first fashion retailer to blend expert styling, proprietary technology and unique product to deliver a refined and deeply personalized shopping experience. We leverage vast amounts of client data to make decisions throughout the company. All of this results in a simple, powerful offering to our customers and a successful, growing business. We believe we are only scratching the surface of our opportunity; and we're looking for incredible people to contribute! We'd love for you to help us carry on the trend.

upload resume icon
See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Sr Data Scientist Platform Manipulation


Posted Yesterday

VIEW JOBS 1/15/2019 12:00:00 AM 2019-04-15T00:00 Sr. Data Scientist - Platform Manipulation San Francisco, CA Who We Are: Twitter users generate many terabytes of data every single day; Twitter engineers run hundreds of experiments; Twitter data scientists craft increasingly sophisticated models of users and content. The Analytics team's mission is to empower product and business through impactful and creative applications of experimentation, machine learning, and data analysis. What You'll Do: You will be a key member of the Investigations Data Science group within the Analytics team working closely with our partners in Trust & Safety, Public Policy, Legal, and Product to detect and mitigate platform manipulation and other malicious activity.. Your work will directly influence the exciting new product areas that Twitter is building. As such, you will: * Conduct analyses to learn from our vast amount of data * Apply statistical techniques to model suspicious user behavior, identify mechanisms for on-platform manipulation, and size mitigation opportunities. * Write complex data flows using SQL, Spark, Scalding, R and Python scripts. * Communicate findings to executives and cross-functional product teams. Who You Are: * Capable of operation at senior level or above as a Data Scientist or ML Engineer. * You have 3 plus years of industry or graduate level research experience working on political motivated manipulation, or relevant security issues. * You are a self-starter who is capable of learning on the job, takes initiative, and can thrive within a large team. You can pivot from blockers and develop a new approach when there are no precedents. * You form sound hypotheses for largely unknown problems (through a combination of product domain knowledge and logical reasoning) and quickly iterate on data exploration. * You are passionate about protecting open conversations on Twitter. You are a strategic thinker and are able to synthesize methodology and data into actionable product and/or public policy strategy from your analyses. We are committed to an inclusive and diverse Twitter. Twitter is an equal opportunity employer. We do not discriminate based on race, ethnicity, color, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran status, genetic information, marital status or any other legally protected status. San Francisco applicants: Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Twitter San Francisco CA

Senior Data Platform Engineer

Stitch Fix