Senior Data Scientist

Greendot Pasadena , CA 91101

Posted 1 week ago

Job Summary

As a Senior Data Scientist at Green Dot, you will be helping drive revenue and reduce costs across the organization by data mining, testing, analyzing and transforming data into actionable insights. You'll own aspects of the data creation and management, data pre-processing, modeling and inference considerations. You will also be involved in our efforts to develop and mature a scalable program to test and learn more about our customer base. You will be part of a team that is leading the next wave of data analytics at a whole new scale. You will primarily be responsible for leveraging statistical analysis to design and integrate models to support or automate decision making throughout the business, including but not limited to acquisition, customer care, customer experience, fraud detection, revenue forecasting, and marketing. You are also expected to conduct deep-dive analysis on historical data and provide forward-looking insights and recommendations for the business with evidence-based understanding of customer behavior and the business. You must be comfortable with ambiguity while working in a fast-paced dynamic environment.

Job Responsibilities

  • Use statistical and programming software combined with analytical skills to mine massive sets of data to formulate new models or algorithms for predicting future events.

  • Perform exploratory data analysis to deeply understand the customer, the business, and drive insight that would not otherwise be seen using summary level descriptive analysis.

  • Conduct non-modeling type of deep-dive analysis on urgent business problems upon request and respond with insights and recommendations for the business in a timely manner.

  • Implement models with technology and data engineers on various platforms.

  • Understand the potential impact on operations and business in terms of benefits as well as risks, and be able to assess the robustness and reliability of the approach.

  • Discuss your analysis in any level of detail with your peers or executives, tailoring your presentations to the knowledge level of your audience in both small or large group settings.

Job Requirements

  • MS or PhD degree in a quantitative discipline (e.g., statistics, operations research, bioinformatics, economics, computational biology, computer science, mathematics, physics, electrical engineering, industrial engineering)

  • 4 to 6 years of relevant work experience in data analysis or related field (e.g., as a statistician / data scientist / economist)

  • Experience using statistical software such as Python, RStudio or SAS

  • Ability to write complex SQL code

  • Strong analytical competencies and excellent creative problem-solving skills

  • Advanced understanding of statistical and predictive modeling techniques such as machine learning, decision trees, probability networks, clustering, regression, and neural networks, and their application to business decisions

  • Applied experience in natural language processing (NLP) a plus

  • Experience with data visualization and knowledge of business intelligence reporting tools

The Senior Data Scientist is an upper-level position in 605's data science group, and is focused on the statistics-heavy, technical/backend aspects of data analytics. At 605, the Data Science and Client Analytics teams share a baseline knowledge of statistics and machine learning, R and python, and familiarity with relational databases and the scale/variety of 605's data assets. In contrast to the Client Analytics team, Data Scientists bring to the table some deeper technical skills, including:

Hands-on experience with industry-standard predictive modeling solutions such as scikit-learn, xgboost, Spark ML, and in a production setting
Familiarity with diverse methods for supervised learning, unsupervised learning, ETL pipelines in general
A more nuanced understanding of cloud-based (Amazon Web Services) computing resources/infrastructure, especially the need for and methods of parallelization for analytics tasks
Experience in designing and implementing wrapper/tool/utility functions for automated tasks, to be used by less-technical Analytics team members, and organizing them into distributable, regularly maintained and tested R/python packages
Proficiency with source code management (git) and related code development/review workflows, as well as continuous integration tools like Travis and/or Jenkins

Data Scientists at 605 are generally involved in at least two different projects at any given time and work alongside other data scientists or analysts. Projects produce both client-facing deliverables as well as internal tools/datasets consumed by other various teams at 605. This role could potentially be based out of one of 605's offices (in New York City or Pasadena, CA), or it could be full-time remote, dependending on the circumstances

Requirements

Masters degree in a quantitative, scientific, or engineering field and at least 2 years experience in a data science industry position
Advanced-level proficiency in either R or Python (baseline proficiency in both)
Intermediate-level proficiency with Apache Spark (via Python, R, Scala, and/or Java), with application to machine learning and/or ETL pipelines
Knowledge of diverse modeling algorithms for supervised learning, including most of the following: scikit-learn, xgboost, Spark ML,
Experience with most of the following AWS services: Redshift, S3, EC2, EMR, and Glue
Advanced-level proficiency with Linux/Unix operating system, command-line/shell environments, accessing remote machines and using Docker containers
Experience with git and

Preferred Skills

Doctoral degree in a quantitative, scientific, or engineering field and/or at least 4 years experience in a data science industry position
Past work with household-level or person-level data sets including demographic, CRM, and/or self-reported (survey) data
Past work with time-stamped video consumption/viewing data or device usage data
Advanced-level proficiency in both R and Python (including class/function structure, package design and management)
Advanced-level proficiency with Apache Spark (via Python, R, Scala, and/or Java), including optimization for local and cluster scale applications
Experience with all of the following AWS services: Redshift, S3, EC2, EMR, and Glue
Knowledge of information security best practices

Benefits

Comprehensive health and dental insurance for employees and their families
Life insurance
401k with match, eligible for match after one year
Pre-tax flexible compensation plan for medical, transit, parking or dependent care expenses
PTO & Sick days—if you're sick, you stay home
Work-from-home Fridays
A kitchen stocked with sodas, snacks, yogurt and other goodies
A tight-knit startup community who likes to eat! We celebrate everyone's birthdays, have frequent team lunches, and do events in and out of the office
605 is an active participant in conferences

