Lead ML Engineer - Recommender Systems (P4190)

8451 Chicago , IL 60602

Posted 4 weeks ago

G3 - Lead ML Engineer, Relevancy Team - Personalization & Loyalty Strategy (P4190)

Relevancy Team is responsible for making relevant and personalized customer experiences for Kroger's E-commerce site, which ranks among the top 10 ecommerce companies in the US. We deliver trillions of recommendations to the Kroger website at scale and make them available to millions of Kroger customers.

The team has a rich portfolio of sciences which include product and coupon recommender systems, substitute recommendations, and shoppable recipes. We are seeking a talented and experienced lead ML engineer to join our data science team, specialized in building search and recommender systems.

What does the role entail? (Responsibilities)

ML System Architecture. Design the architecture of a ML system, including all parts of the ML life cycle, model serving infrastructure and integration with other software components.

Use advanced knowledge of different model architectures to implement ML algorithms and ensure scalability. Strong skills to choose appropriate data representation methods, utilizing tools and tech to integrate sciences into ML pipelines.

Training and Inference pipelines. Implement efficient ML training pipeline to automate the process of loading data, preprocessing, training models and saving model artifacts.

Design the inference pipeline to handle real-time or batch inference requests in production. Implement the inference pipeline to load the trained model artifacts, preprocess input data and generate predictions. Modularize the pipeline components to facilitate code reuse and maintainability.

Performance Optimization. Proactively work towards improving the performance of the ML models for speed, efficiency, and resource utilization.

Research and implement different optimization techniques as needed for different recommender systems. Optimize inference pipeline for low-latency and high throughput performance.

API Development/Integration. Setting up APIs for model inference, monitoring model performance, and managing model versions. Develop, define, and integrate the different API endpoints that accepts user input and returns recommendations.

Model Deployment. Responsible for model deployment to cloud infrastructure like GCP, using containerization technologies like Docker and orchestration frameworks like Kubernetes in non-prod environments.

Collaborate with other engineers to deploy ML models into production environments. Ensure model is load-tested to handle concurrent requests and that it performs well under different traffic conditions. Greater focus to ensure the ML systems are reliable, scalable, and efficient.

Monitoring and Logging. Responsible for setting up tools to track the model's performance, response times, and error rates.

Configure logging to capture relevant information for debugging and auditing. Actively developing processes and standards for efficient model versioning, rollback processes, quality controls and health checks.

Tech leadership. Research state of the art machine learning algorithms, processes, patterns, and tooling to identify new opportunities for implementation across the enterprise. Proactively use their understanding of different algorithms and frameworks for ML and distributed systems and computing to build scalable and reliable ML systems.

Collaborate with cross-functional teams. Collaborate closely with other data scientists, data engineers, and full stack engineers to implement data science solutions.

Actively work with platform engineers and full-stack software engineers in managing ML applications, recommending changes to writing infrastructure code and other IAC tools to setup a scalable infrastructure. Drive innovations in utilizing ML pipelines such as Vertex AI Pipelines for streamlined operations, metadata tracking, and continuous modeling experiences. Proactively work with data engineers to efficiently manage, share, and reuse machine learning features for both offline and online settings.

Mentoring. Provide guidance and mentorship to junior data scientists, especially on code and methodologies, and developing and sustaining technical pipelines for all parts of data science life cycle. Contribute to talent acquisition efforts, and work towards building a high-performing data science team

What skills and experience do you need? (Requirements)

  • Bachelor's/Master's degree or equivalent in computer science, data science, statistics, mathematics, analytics, or related discipline.

  • 4+ years of proven experience building deep learning models for large-scale recommender systems.

  • 3+ years of experience using Deep Learning frameworks such as Tensorflow, Pytorch, Fast.ai, Mxnet or HuggingFace.

  • 3+ years working with NLP/ML libraries (spacy, scikit-learn, SparkNLP).

  • 3+ years of experience working with cloud platforms like Azure or GCP.

  • 3+ years pf experience working with Python and Spark with track record of optimizing pipelines for efficiency and scalability to handle increasing data volumes. Experience working with Databricks is a plus.

  • Strong understanding of the internal workings of different ML frameworks, and practical experience in customizing architectures and loss function for specific use cases.

  • Hands-on experience developing software tools that scale (i.e. Python packages) and using end-to-end tooling to develop, test, and deploy these tools (i.e. CI/CD)

  • Other data science-adjacent technology experience would be beneficial but is not required, including Docker, Rest APIs, Fast API, Linux, and basic shell scripting.

  • High level of independence, able to make time-sensitive decisions rapidly and solve urgent problems without escalation.

  • Natural curiosity, welcomes and embraces change.

  • Excellent problem-solving skills and a proactive approach to addressing challenges.

  • Prior experience in the retail or e-commerce industry is a plus.

  • Must be able to learn from others and teach others and work collaboratively as part of a highly interdependent team.

  • Ability to communicate complex ideas effectively to both technical and non-technical stakeholders.

  • Experience with training, developing, recruiting, coaching, and/or inspiring highly technical associates.

Why join our team? (Rewards)

Impact millions of people. As a member of our data science team, you will have the opportunity to make a tangible difference in the lives of millions of customers by delivering relevant and personalized recommendations that enhance their grocery shopping experience. Your work will directly contribute to increasing customer satisfaction and loyalty, driving business outcomes for our company.

Continuous learning and development. Challenge yourself.

We are committed to fostering a culture of continuous learning and development. You will have access to resources and support for expanding your knowledge and skills in cutting-edge technologies, including recommender systems, machine learning and artificial intelligence. Our team encourages exploration and experimentation, providing opportunities to stay at the forefront of industry advancements.

Work on new developments in recommender systems. Join a team at the forefront of innovation in recommender systems and AI.

You will have the chance to contribute to pushing the boundaries of what's possible in personalized recommendation technology. You will have the chance to work on exciting projects that leverage the latest developments in deep learning architectures and data science methodologies.

#LI-SSS


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove

Lead ML Engineer - Recommender Systems (P4190)

8451