Data Scientist

Yale University New Haven , CT 06501

Posted 5 days ago

Essential Duties

The Data Scientist will function in a multidisciplinary team environment, providing comprehensive machine learning data analysis with a focus on deep learning methods for multiple clinical and epidemiological studies of cardiovascular outcomes research conducted by investigators at the Center for Outcomes Research and Evaluation (CORE). This role involves planning, development, and implementation of methodology for specific research projects.

Under the direction of the Principal Investigator and Program Co-Director, the ideal candidate will perform a variety of duties involving the application of machine learning skills to the analysis of research studies, and will work as a member of a research team to provide input in the design of study, perform data analysis, and lead or assist drafting analytical sections for peer-review publication for various projects. The candidate is expected to lead several efforts for which recent deep learning methods are appropriate, and for which large amounts of complex data are available.

All research and programming code must be regularly documented and archived per best practices. Responsibilities will also include participating in the design, implementation, and maintenance of high performance computing (HPC) environments with graphical processing units (GPU) for accelerated deep learning computation. General responsibilities of the Data Scientist will be to coordinate and lead regular meetings that include agenda and materials preparation as well as meeting minute documentation, and to develop and manage timelines and research activities for research projects to ensure project goals and deliverables are met.

The Data Scientist will communicate information and data in written and verbal form to colleagues as well as project and senior management teams. The individual will also be responsible for contributing to the development of training and presentation materials and assisting with the planning and coordination of training and seminars.Develop and execute new and/or highly complex algorithms and statistical predictive models and determine analytical approaches and modeling techniques to evaluate potential future outcomes. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses.

Manage analytical projects from data exploration, model building, performance evaluation, through implementation. Develop work plans and monitor progress and project timelines. Document coding and changes to work plans using established work group methods in GitHub.

Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work. Attend weekly team meetings to discuss team and project-related activities, issues, change, communications, and updates.

Required Education and Experience

Master's Degree in computer science, applied/computational mathematics, engineering, biostatistics, statistics, or a quantitative field such as astronomy or geology, and 2 years of hands-on experience in deep learning or an equivalent combination of education and experience.

Background Check Requirements

All candidates for employment will be subject to pre-employment background screening for this position, which may include motor vehicle, DOT certification, drug testing and credit checks based on the position description and job requirements. All offers are contingent upon the successful completion of the background check. Please visit www.yale.edu/hronline/careers/screening/faqs.html for additional information on the background check requirements and process.

Position Focus:

The Data Scientist will function in a multidisciplinary team environment, providing comprehensive machine learning data analysis with a focus on deep learning methods for multiple clinical and epidemiological studies of cardiovascular outcomes research conducted by investigators at the Center for Outcomes Research and Evaluation (CORE). This role involves planning, development, and implementation of methodology for specific research projects.

Under the direction of the Principal Investigator and Program Co-Director, the ideal candidate will perform a variety of duties involving the application of machine learning skills to the analysis of research studies, and will work as a member of a research team to provide input in the design of study, perform data analysis, and lead or assist drafting analytical sections for peer-review publication for various projects. The candidate is expected to lead several efforts for which recent deep learning methods are appropriate, and for which large amounts of complex data are available.

All research and programming code must be regularly documented and archived per best practices. Responsibilities will also include participating in the design, implementation, and maintenance of high performance computing (HPC) environments with graphical processing units (GPU) for accelerated deep learning computation. General responsibilities of the Data Scientist will be to coordinate and lead regular meetings that include agenda and materials preparation as well as meeting minute documentation, and to develop and manage timelines and research activities for research projects to ensure project goals and deliverables are met.

The Data Scientist will communicate information and data in written and verbal form to colleagues as well as project and senior management teams. The individual will also be responsible for contributing to the development of training and presentation materials and assisting with the planning and coordination of training and seminars.


Preferred Education, Experience and

Skills:

Experience in leading conference publications, particularly in the deep learning field. Strong background in data analysis with a diverse set of platforms (e.g.

R, Python). Knowledge of advanced analytic approaches, including signal processing, image analysis, and supervised and unsupervised machine learning. Experience with version control, unit testing, and continuous integration environments.

Posting Disclaimer

The intent of this job description is to provide a representative summary of the essential functions that will be required of the position and should not be construed as a declaration of specific duties and responsibilities of the particular position. Employees will be assigned specific job-related duties through their hiring departments.

Affirmative Action Statement:

Yale University considers applicants for employment without regard to, and does not discriminate on the basis of, an individual's sex, race, color, religion, age, disability, status as a veteran, or national or ethnic origin; nor does Yale discriminate on the basis of sexual orientation or gender identity or expression. Title IX of the Education Amendments of 1972 protects people from sex discrimination in educational programs and activities at institutions that receive federal financial assistance.

Questions regarding Title IX may be referred to the University's Title IX Coordinator, at TitleIX@yale.edu, or to the U.S. Department of Education, Office for Civil Rights, 8th Floor, Five Post Office Square, Boston MA 02109-3921. Telephone: 617.289.0111, Fax: 617.289.0150, TDD: 800.877.8339, or Email: ocr.boston@ed.gov.



See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Download the
LiveCareer app and find
your dream job anywhere
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Data Engineer (Infrastructure & Architecture)

Patientwisdom

Posted 1 week ago

VIEW JOBS 11/6/2018 12:00:00 AM 2019-02-04T00:00 <p>PatientWisdom is a digital platform that creates, delivers, and captures value by turning patient perspectives into actionable information. Our goal is to improve health and the delivery of healthcare by listening to patients. Our tools help patients, families, and caregivers feel better; help clinical encounters work better; and help healthcare providers and organizations do better. We love what we do. We’re looking for hard-working, fun, creative people who share our passion, want to be part of a culture that integrates professional and personal growth with helping others succeed, and are excited about creating a better healthcare future.</p><p>The Data Engineer works with the CTO and CEO, Software Engineers, and other team members to implement PatientWisdom’s data vision utilizing extensive knowledge of databases and data engineering practices.</p> <p>Specifically, the Data Engineer will:</p> <ul> <li>Build human-fault-tolerant data pipelines; design, construct, install, test and maintain highly scalable, secure, and reliable data management systems; research opportunities for acquiring new data and new uses for existing data</li> <li>Create data storage solutions with the ability to scale, ensure systems meet business requirements and industry practices</li> <li>Ensure that data can be curated, processed, reported, and delivered to provide actionable information that meets current and emerging business needs</li> <li>Design, develop, and deploy high-quality code for a modern web app, applying Agile development and coding standards</li> <li>Drive the selection and administration of appropriate data storage solutions; maintain clean and usable data and ensure a deterministic pipeline</li> </ul> <ul> <li>Coordinate with the PatientWisdom team (i.e., Product Design, Engineering, and others as needed) during the requirements-definition process</li> <li>Plan and write supporting technical requirements and design documentation according to company standards.</li> <li>Follow in-house review, deployment and support processes using test-driven development concepts, peer review, unit and integration testing to ensure quality code is delivered</li> <li>Design data processing systems, develop test code, scripts, test tools, and test cases to validate software releases; develop data set processes for data modeling, mining and production; build and maintain data structures and databases; analyze data and enable machine learning</li> <li>Ensure that code fosters PatientWisdom privacy and security standards; build high-performance algorithms, prototypes, predictive models and proof of concepts</li> <li>Integrate new data management technologies and software engineering tools into existing structures</li> <li>Employ a variety of languages and tools (e.g. scripting languages) to marry systems together</li> <li>Recommend ways to improve data reliability, efficiency and quality</li> <li>Proudly and professionally represent PatientWisdom in sales, implementation, and/or funding efforts, as directed by the CEO and/or CTO</li> <li>Contribute to the overall success of PatientWisdom, as directed by the CEO and/or CTO</li> </ul> <p>Additional responsibilities:<br></p> <ul> <li>Work in a fast-paced environment</li> <li>Maintain Lean thinking and management, including reliable implementation of Agile development</li> <li>Share ideas for improvement in a positive, constructive manner</li> <li>Apply problem solving skills, knowledge of best practices and Agile methodologies to gather requirements, solution design, development and testing</li> <li>Complete tasks on time, on budget, and according to expectations</li> <li>Study relevant languages and tools to increase their knowledge and capabilities</li> <li>Contribute positively to the success of the company and teammates</li> </ul><p><strong>Requirements</strong></p><p><strong>Required Skills &amp; Qualification</strong><br></p><ul> <li>Fluency in SQL and NoSQL technologies, data modeling tools</li> <li>Proven coding ability more than one of Python, C/C++, Java, Perl, Ruby</li> <li>At least four years of data engineering experience with the ability to work independently</li> <li>Experience on Agile software development teams</li> <li>Proven ability to communicate with team members</li> <li>Proven ability to manage and organize work</li> <li>Passion for software development</li> </ul><p><strong>Desired Skills</strong></p><ul> <li>Experience with machine learning</li> <li>Statistical analysis and modeling</li> <li>Predictive modeling, NLP and text analysis</li> <li>Data mining</li> <li>Formal Agile and/or Lean training</li> <li>Experience with: <ul> <li>Searching and analyzing large datasets</li> <li>Analytics/Reporting/Visualization</li> <li>Docker</li> <li>AWS</li> </ul> </li> <li>Unique skill sets (e.g., MatLab, SAS, R)</li> <li>Master’s degree, preferably in a related field (e.g., Engineering, Computer Science, Statistics, Applied Math)</li> </ul><p><strong>Experience / Licenses / Training</strong></p><ul><li>Industry certifications (e.g., Google Certified Professional - Data Engineer)</li></ul><p><strong>Benefits</strong></p><ul> <li>Salary DOE</li> <li>Awesome healthcare benefits for you and your family</li> <li>Work with cutting edge technology</li> <li>Excellent equipment</li> </ul><p><br></p><p>Local candidates only, please. No recruiters.</p> Patientwisdom New Haven CT

Data Scientist

Yale University