At eHealth, we are passionate about solving our nation's toughest problems to bring more suitable, accessible, and affordable health insurance to Americans. We are seeking a talented data engineer to join our new and growing data team, which is already making a valuable impact on the entire company. This person will help us develop cutting-edge data tools and pipelines to drive better and faster decision making within our company and to better serve our customers. This is a fast-paced, collaborative, and iterative environment requiring quick learning, agility, and flexibility.
Develop and improve the current data architecture, emphasize on data quality, improve monitoring and strength the data availability.
Collaborate with Data Scientists to implement advanced analytics algorithms that exploit our rich data sets for statistical analysis, prediction, clustering and machine learning
Design and build highly scalable data integration / ETL pipelines to improve data accessibility and consumption.
Employ the big data technologies and run pilots to design the data architecture to scale with the increased data sets in the domain.
Bachelors or Masters in Computer Science, Engineering, or a related quantitative field.
3+ years of overall work experience including Data Engineering, Database Engineering, Business Intelligence
2+ years of experience with building scalable and reliable date pipelines using technologies like Spark, AWS EMR, Kafka, etc
Experience with designing digital data platforms leveraging clickstream data from Adobe Analytics
Demonstrable skills and experience using SQL with large data sets
Proficient in one of Programming languages (e.g., Python, Ruby, Shell Scripting, Scala)
Proven experience in data modeling, ETL development, and data warehousing, or similar skills.
Proven track record of successful communication of data infrastructure, data models, and data engineering solutions through written communication, including an ability to effectively communicate with both business and technical teams.
Master degree in Computer Science, Math, Physics, Engineering or related technical field, or equivalent work experience.
Working experience with implementation of scalable models using various statistics and machine learning toolkits (Pandas, SciPy, Scikit-learn, MLlib, Spark ML, etc.).
Strong data modeling skills, Knowledge about statistical models and data mining algorithms
Strong experience in designing and implementing data APIs.
Knowledge of healthcare insurance industry, products, systems, business strategies, and products.
Working experience with large healthcare related datasets, including EHRs, medical claims data, and health population surveys. Experience in building healthcare data pipelines would be a big plus.
Experience working with call center operations.
Experience designing and operating very large Data Warehouses.
Familiarity with Linux.
Familiarity with the DevOps concepts
eHealth is an Equal Employment Opportunity employer. It is our policy to provide equal opportunity to all employees and applicants and to prohibit any discrimination because of race, color, religion, sex, national origin, age, marital status, sexual orientation, genetic information, disability, protected veteran status, or any other consideration made unlawful by applicable federal, state or local laws. The foundation of these policies is our commitment to treat everyone fairly and equally and to have a bias-free work environment.