Data Engineer

Brandless San Francisco , CA 94118

Posted 2 months ago

Brandless Overview:

Brandless is reinventing modern consumption by making better stuff at fairer prices available to everyone. We're starting with our everyday essentials collection: better-for-you versions of the things you use all the time, from snacks to soap to spoons. All for just $3. We created and curated products that match your values and requirements, from organic to gluten-free to paraben-free. Plus, with every order placed on Brandless.com, a meal is donated to people facing hunger through Feeding America.

Team Overview:

The data team plays a crucial role in the growth of Brandless as we partner with engineering, product, marketing, operations and other teams to lead data-driven decisions and product design. We are excited to reinvent modern consumption and we are the data experts behind our mission.

As the first Data Engineer at Brandless, you will support the analytics and algorithms teams to scale and improve our data infrastructure. We maintain complex analytics data models from a variety of sources and production machine learning models, but we have only just begun building our infrastructure. You will take these tools to the next level and build the systems we need next.

The team is growing quickly and each member will greatly shape the future of our impact on Brandless. You will have the opportunity to both learn from existing members and mentor your future colleagues.

This position is located in our HQ in San Francisco.

Essential Responsibilities:

  • Own and maintain our core data warehouse in Redshift. This will include building out monitoring systems and determining when other datastores might be needed as we grow to a massive scale.

  • Onboard both internal and external data sources to empower our analytics and algorithm teams.

  • Optimize, refactor and grow our existing data model ETL infrastructure (Airflow).

  • Partner with the engineering team to help design data structures for new features in production and ensure the data team isn't impacted by these changes.

  • Work closely with the algorithms team to maintain and improve our production machine learning model serving infrastructure.

  • Work closely with the analytics team to support our business intelligence tool (Looker).

  • Monitor new systems, tools and technologies in the data engineering world and bring these tools to Brandless.

Minimum Qualifications:

  • BS in Computer Science, Mathematics, Statistics or similar technical field of study, or equivalent practical experience

  • At least 2 years of data engineering experience at a fast growing company

  • Expertise in SQL and data model design

  • Experience maintaining large-scale data stores (Redshift, BigQuery, etc..)

  • Proficiency in production-level Python programming

  • Experience implementing and optimizing a workflow scheduler (Airflow, Luigi, etc..)

  • Working knowledge of Spark and distributed systems

  • Working proficiency and communication skills in verbal and written English

Preferred Qualifications:

  • MS/PhD in a technical field of study

  • Experience with R and other statistical tools

  • Experience developing large scale in-production machine learning serving systems

  • Working knowledge of the LookML language

Brandless provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics. In addition to federal law requirements, Brandless follows applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.


See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Download the
LiveCareer app and find
your dream job anywhere
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Data Engineer

Addastaff

Posted Yesterday

VIEW JOBS 11/18/2018 12:00:00 AM 2019-02-16T00:00 <strong>Title:</strong> Data Engineer <br /> <strong>Location:</strong> San Francisco, CA<br /> <strong>Job Type:</strong> Long-term contract<br /> <br /> <strong>Overview:</strong><br /> We’re looking for a talented and passionate Data Engineer.  The Data Engineer  is responsible for designing, developing, implementing and maintaining the data warehouse. The Data Engineer works with users to define requirements, develop, test, document, implement and maintain production applications and utilities for the Company’s Data Warehouse.<br /> <br /> <strong>Job Duties:</strong> <ul> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Strong experience with full software development life-cycle, architecting scalable platforms, object oriented programming, database design and agile methodologies</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Experience in Application Development and Object Oriented Programming Analysis and Design (OOAD). Experience object oriented programming (OOP) concepts using Python, and Java.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Familiarity with SQL (SQL Server) and strong database programming skills using JDBC, performing optimized bulk updates</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Overall experience in multi-threading, synchronous/asynchronous networking programming, state management in these technologies</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Ability to conda install, package and release python modules. Use Design Patterns and built Web services using Python, Flask.</li> </ul> <strong>Qualifications:</strong> <ul> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Proficient in SQL databases like SQL Server, Oracle. Knowledge of using MongoDB/redis is highly desirable.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Good knowledge in using version controls systems such as GIT and setting CI/CD pipelines using jenkins .</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Experience in working with AWS(Amazon Web Services) cloud platform. Experienced in working with various Python Integrated Development Environments like IDLE, PyCharm,Atom, Eclipse, PyDev and Sublime Text.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Experience with Requests, Numpy, Scipy, Matplotlib, and Pandas python libraries during development lifecycle.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Deep understanding of HTTP methods, RESTful architecture.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Familiarity with basic UNIX / Linux internals, basic cryptography & security.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Ensure validity and effectiveness of code using PyChecker and PyLint.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Involved in Unit testing and Integration testing of the code using PyTest.</li> <li style="padding: 0; margin: 0;" style="padding: 0; margin: 0;">Bachelors Degree in Engineering, Information Technology, Computer Science  </li> </ul> <strong>​</strong> Addastaff San Francisco CA

Data Engineer

Brandless