We are now looking for a Senior Data Engineer - AI Data Platform
NVIDIA is hiring senior distributed systems and data engineers to scale up its AI infrastructure and deep learning platforms! You will need to have strong programming skills and a deep understanding of data science technologies. You should have production grade experience working with heterogeneous data types at scale, cloud technologies, distributed storage & compute systems, and distributed services architecture. You will require excellent communication and planning skills. Together, we will help advance NVIDIA's capacity to build and deploy leading solutions for a broad range of AI based applications such as autonomous vehicles, healthcare, virtual reality, graphics engines and visual computing.
What you'll be doing:
Orchestrate large PB sized data storage and compute clusters across bare metal and cloud that support distributed AI infrastructure.
Design and program scalable data lake interfaces, microservices, and web technologies that support ingesting and querying structured data.
Architect and program high performance compute and data pipelines that support efficient queries. Enabling efficient data selection is a key ingredient to successful machine learning!
Build and implement support for versioned, traceable, and immutable datasets in a data lake in a distributed and scalable manner.
Stand up distributed systems for mining and analyzing data to help AI leaders and researchers make data driven decisions for data collection, diversity, training, and evaluation.
We will spend a majority of the time writing and peer reviewing high performance, high quality, and well tested and well architected code.
What we need to see:
You have a BS or MS in Computer Architecture, Computer Science, Electrical Engineering or a related data intensive Engineering Degree with 7+ years of relevant experience in a programming intensive role.
Strong programming background that incorporates methodologies like data structures, design patterns, OOP, and test driven development.
A technical authority with strong experience in traditional big data technologies, databases, analytics, and common architectures.
Built and orchestrated massive business critical data clusters, infrastructure, services, and ETL pipelines in a cloud environment.
Proven experience in collaborating with multiple teams to collect and process large amounts of data, building microservices, and RESTful APIs.
An expert programmer in Go, C/C++, Scala, and SQL.
Advanced expertise in MapReduce, Hadoop, Hive, Presto, Spark.
Highly motivated with strong interpersonal skills, you have the ability to work successfully with multi-functional teams, principles and architects and coordinate effectively across organizational boundaries and geographies.
Ways to stand out from the crowd:
Experience with structured data such as Avro, Parquet, Protobuf, Thrift, and concepts like schema evolution.
Experience with full-stack web based visualization technologies to help provide data insights.
Strong understanding of Docker and orchestration systems such as Kubernetes.
Do you have a go getter attitude to dive deeper and understand technical requirements?
NVIDIAis widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and talented people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative and autonomous computer scientist with a real passion for distributed systems and parallel computing, we want to hear from you.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.