Landing AI San Diego , CA 92140
Posted 4 weeks ago
About the company:
RapidFire AI is a cutting-edge deep tech startup specializing in scaling Machine Learning solutions. We are dedicated to empowering customers to effortlessly scale their AI workloads, ensuring they stay at the forefront of innovation in their industries.
About the Role:
We are seeking a highly motivated and skilled Cloud DevOps Engineer to join our growing team. In this role, you will be responsible for deploying and managing customer applications on Amazon Web Services (AWS), as well as integrating a custom application that directly manages operations on AWS instances. You will collaborate closely with developers and customer-facing personnel to ensure seamless execution and high availability.
Responsibilities:
Design, develop, and deploy applications on AWS infrastructure, following best practices and security guidelines. Potentially extend these services to other public cloud service providers
Use containerization and orchestration tools like Docker and Kubernetes to quickly deploy and maintain applications on clusters
Use Infrastructure-as-Code (IaC) tools like Terraform or Cloud
Configure and manage AWS infrastructure services, including EC2, S3, EBS, EFS, EKS, and Lambda
Monitor and troubleshoot AWS resources to ensure optimal performance and availability
Collaborate with developers to define and implement cloud infrastructure strategies for applications
Implement continuous integration and continuous delivery (CI/CD) pipelines for efficient application deployments
Deliver and maintain applications for customer accounts via a virtual private cloud (VPC) on AWSDesign, develop, deploy, and maintain spend aggregation and billing strategies for different applications
Integrate custom applications that manage and configure customer AWS instances based on specific needs
Automate routine tasks and implement monitoring solutions for proactive issue identification and resolution
Communicate effectively with customers to understand their requirements and ensure successful application deployments
Stay up-to-date with the latest advancements in cloud technologies and best practices
Qualifications:
4+ years of experience as a Cloud Engineer, Cloud DevOps, or similar role using AWS infrastructure services
Proven experience with AWS deployments and architectures, including VPC, security, authentication, and billing practices
4+ programming experience with Python and shell scripting like Bash4+ years deploying production applications with Docker and Kubernetes
Familiarity with IaC tools like Terraform or CloudFormationUnderstanding of CI/CD methodologies and tools
Basic experience with version control systems (e.g., Git) and collaborative development workflows
Excellent communication and collaboration skills
Ability to work independently and as part of a team
Strong problem-solving and analytical skills
A passion for learning and staying updated with the latest cloud technologies
Nice to have:
AWS certifications (e.g. AWS Certified Solutions Architect - Associate)
Kubernetes certifications (eg. CKA - Certified Kubernetes Administrator)
Demonstrated experience deploying Deep Learning tools like PyTorch or TensorFlow on GPU clusters in production
Knowledge of tools like MLFlow or similar
Knowledge of equivalent infrastructure services on other public cloud providers
Landing AI