Dataiku's mission is big: to enable all people throughout companies around the world to use data by removing friction surrounding data access, cleaning, modeling, deployment, and more. But it's not just about technology and processes; at Dataiku, we also believe that people (including our people!) are a critical piece of the equation.
We are looking for a Data Science Intern to join Dataiku in our New York office for a summer internship.
As part of the Data Science team, you will work on improving the 1) controls over and 2) explainability of machine learning algorithms available on Dataiku's Data Science Studio (DSS). This includes contributing to open source software commonly used in DSS (e.g. scikit-learn), and creating plugins in DSS. Time-permitting, an intern may have the opportunity to work with a Dataiku client to study the effectiveness of these new controls and explainability tools.
Researching potential ways to add logical constraints over different commonly-used machine learning algorithms
Submitting pull requests to open source software libraries to implement these constraints
Improve existing model-explainability plugins for DSS built by the Data Science and R&D teams (e.g. LIME)
Create interesting visualizations to help DSS users understand the impacts of their models at the model-level and individual row-level
Strong python skills
Strong understanding of data science techniques and machine learning algorithms
Prior experience using git and GitHub
Nice to Have:
Software development experience
To fulfill its mission, Dataiku is growing fast (having just closed a $101 Series C round in December 2018 and looking to double in 2019), but still maintains a startup spirit. Dataiku serves its global customer base from its headquarters in New York City as well as offices in Paris, London, Munich, Singapore, and Sydney. Each of our offices has a unique culture, but underpinning local nuances, we always value curiosity, collaboration, and can-do attitudes.