The EC2 Server Provisioning and Management team is seeking a talented Systems Development Engineer to own automating operational and business practices. You should be a self-starter with a bias towards independent problem solving. This will include identifying and resolving issues, as well as automating and improving repetitive processes. You will become an expert in the process by which EC2 transforms servers into host machines capable of accepting customer compute instance launches. You'll drive prioritization of operational issues, and drive automation design & scaling requirements in support of the wider organization.
The ideal candidate must be detail-oriented, have superior verbal and written communication skills, strong organizational skills, able to juggle multiple tasks at once, able to work independently and can maintain professionalism under pressure. You must be able to dive deep into a problem space to identify solutions that automate server provisioning processes. You must be able to accurately prioritize projects, make sound judgments, work to improve the customer experience, and get the right things done.
The EC2 Server Provisioning and Management team is developing next-generation services to support the development of new EC2 instance types and maintain the elasticity facade that our customers depend on everyday. We are looking for engineers with a passion for building at the interface between software and hardware to deliver rock-solid services and platforms at EC2-scale. Come be part of a dynamic, fast moving team that works every day at the fore front of innovation!
Bachelor's degree in Computer Science or related field or 4 years of equivalent experience
2+ years of experience in Systems and Network Administration, DevOps or Site Reliability Engineering
2+ years of experience with focus in areas such as systems, network, and/or application security
2+ years of experience with web protocols and Linux/Unix tools and architecture
2+ years of experience building automated tools in Python, Java and/or Ruby
2+ years of experience with agile software development practices
5+ years of experience in Systems and Network Administration, DevOps or Site Reliability Engineering
Experience with open-source technologies
Excellence in technical communications with both technical and non-technical peers
Experience with maintaining distributed systems and web services
Understanding of design for scalability, performance and reliability
Automation, testing or monitoring framework development
System engineering experience in the cloud
Self-drive to keep moving things forward even in the face of ambiguity and imperfect knowledge (resilient to hazards of "analysis paralysis")
Strong sense of ownership and drive