Data Engineer

Amazon.Com, Inc. Boston , MA 02298

Posted 2 months ago

Amazon Elastic File System (EFS) is built in Boston, MA. It's a fully managed service that makes it easy to set up and scale shared file storage in the AWS Cloud. Amazon EFS is designed for a wide variety of use cases: data analytics, video rendering, genomics analysis, web serving, content management, and home directories, to name a few.

As a Data Engineer in AWS EFS you will work on the data pipeline and analytics to provide business and engineering stakeholders key insights into our customers' filesystem performance. You will get the exciting opportunity to interact with very large data sets in one of the most complex data warehouse environments. You will have the opportunity to help business and engineering stakeholders determine what performance metrics they should be tracking and establish new and expand existing automated data collection to feed into the data pipeline. You will regularly apply your analytical and problem solving skills and perform analysis with tools like Jupyter, SageMaker, and Pandas so we better understand customer's file system performance.

Day-to-day you will:

  • Work closely with product management, sales, and business stakeholders to analyze data from a multitude of sources.

  • Design, implement, and maintain a data pipeline and analytical environment using third-party and in-house reporting tools, modeling metadata, and building reports and dashboards.

  • Use creative problem-solving to automate the collection and analysis from available data sources in order to deliver actionable output.

  • Iteratively improve analysis and identify new metrics to improve analytics.

  • 1+ years of experience as a Data Engineer or in a similar role

  • Experience with data modeling, data warehousing, and building ETL pipelines

  • Experience in SQL

  • 1+ years of industry experience in software development, data engineering, business intelligence, data science, or related field with a track record of manipulating, processing, and extracting value from large datasets

  • Demonstrated strength in data modeling, ETL development, and data warehousing

  • Experience using big data technologies (Hadoop, Hive, Hbase, Spark etc.)

  • Knowledge of data management fundamentals and data storage principles

  • B.S. degree in mathematics, statistics, computer science or a similar quantitative field.

  • Experience in writing complex, optimized SQL queries across large datasets.

  • Experience with data analysis tools like Jupyter and Pandas.

  • Experience working with a diverse set of business and engineering stakeholders at all levels

  • Experience with AWS technologies including Redshift, SageMaker, EMR, RDS, S3, and Kinesis

  • Demonstrated ability to coordinate projects across functional teams, including engineering, sales, product management, finance, and operations

  • Proven track record of successful communication of analytical outcomes through written communication, including an ability to effectively communicate with both business and technical teams

Amazon is committed to a diverse and inclusive workforce. Amazon is an equal opportunity employer and does not discriminate on the basis of race, ethnicity, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Data Engineer


Posted 1 week ago

VIEW JOBS 10/20/2020 12:00:00 AM 2021-01-18T00:00 <p>We're currently seeking a Data Engineer for the Data Integration Platform. In this role, you will leverage your biology, chemistry, and/or pharmaceutical experience to architect highly impactful solutions and deliver these solutions to our customers in the enterprise pharmaceutical industry and biotech companies.</p><p><br>You will start as a data engineer in our delivery team, owning the prototype and implementation of a customer solution. Your responsibility includes</p><p><br></p><ul> <li>Research and prototype data acquisition strategy for scientific instruments used in the lab</li> <li>Research and prototype file parsers for instrument output files (excel, pdf, sometimes vendor-specific binary formats)</li> <li>Design and build data models for scientific instruments and CRO/CDMO reports</li> <li>Design and build data pipelines, unit tests, integration tests, utility functions using Python</li> <li>Build visualization, report and dashboards using Spotfire, Tableau, Jupyter notebook and etc.</li> <li>Work with the customer to test and make sure the solution fulfills their requirements and solves their need</li> <li>Coordinate project kickoff meetings; manage the customer relationship throughout the project; and conduct formal project closeout meetings</li> <li>Facilitate internal project post-mortems to identify areas of improvement on the next implementation</li> </ul>During this process, you will work with delivery team lead to provide detailed estimates for the number of billable hours per implementation; manage implementation scope and transform the technical spec into agile user stories and technical tickets; develop sprint cadence plan for completing the project. You will maintain a project budget to drive decisions and ensure on-time, on-budget, and on-scope delivery.<p>You will communicate very closely with the rest of the delivery team, product management and engineering team to identify potential improvements to the Data Integration Platform.</p><p><br>As you gain experience as a data engineer, you can</p><ol> <li>Take on larger projects that have a more complicated scope and continue to grow as a data engineer.</li> <li>Take on more platform and product features and continue to grow into a software engineer.</li> <li>Take on more responsibility in pre-sales as Solution Architect. In this role, you will</li> <li>Work closely with the sales to understand Life Sciences companies’ need for data collection, data integration, data management, and data science.</li> <li>Design and architect the solution using our Data Integration Platform.</li> <li>Create and negotiate Statement of Work</li> <li>Perform pre-sales demo for Life Science companies</li> <li>Perform pre-sales investigations and prototypes of data sources, data models</li> </ol><p><strong>Requirements</strong></p><p>Basic requirements<br></p><ul> <li>Proficient with Python</li> <li>Proficient with SQL</li> <li>Passionate about science and building solutions to make the data more accessible to the end users</li> </ul><p>Nice-to-have: </p><ul> <li>Elasticsearch, science background or experience with scientific instruments</li> <li>Experience with tools like Spotfire, Tableau, Jupyter notebook (any of them)</li> <li>Excellent communications skills, attention to details, and the confidence to take control of project delivery</li> <li>Ability to understand a highly technical product and communicate with the product management and engineering team effectively</li> <li>Strong project and account management skills</li> <li>Strong interpersonal and proactive problem-solving skills</li> <li>Ability to think creatively on how to solve projects risks without reducing quality</li> <li>Team player and ability to roll up your sleeves and do what it takes to make the team successfulBest,</li> </ul><p><strong>Benefits</strong></p><ul> <li>Ability to choose and craft the next step in your career path as a Data Engineer, Software Engineer or Solution architect. Room for you to explore your interest, establish your skillset and tailor your direction based on your passion and interest.</li> <li>Professional development fund for you to gain relevant certificate such as AWS Solution Architect certificate and etc.</li> <li>Competitive salary and equity and in a fast-growing company</li> <li>Comprehensive Medical, Dental and Vision coverage</li> <li>Generous paid time off (PTO)</li> <li>Catered team lunch every Wednesday and a variety of free snacks</li> <li>Convenient location in Boston Downtown Crossing area; close to the T (red/orange/green/blue line) and South Station</li> <li>Open office environment for maximum collaboration</li> </ul> Tetrascience Boston MA

Data Engineer

Amazon.Com, Inc.