Amadeus Bogota , NJ 07603
Posted 7 days ago
Job Title
Senior Service Reliability Engineer
Front Office group oversees the front office applications used by Travel Agents and Travel Management Companies across the world (flagship application being Selling Platform Connect). The products are enterprise class products, hosted in private / public clouds and built primarily using standard open software on a Java platform.
Within the Front Office group, you will be part of the FOROPS A team in charge of security, imple-mentations, deployments, and support of our applications.
In this role you'll:
Maintain Amadeus distribution services on the cloud by measuring and monitoring sys-tem health, availability, latency and all operability criteria.
Coordinate incident response, post-incident investigation and practice follow-up/post-mortems reports
Engage continuous improvement of the whole product and platform lifecycle through development design, deployment, operations and refinement.
Adapt and improve the monitoring, alerting, log management, auto-recovery (Prome-theus, Splunk, Grafana etc.) to meet our SLAs/OLAs
Coordinate, review and support major releases / technical changes with application owners and middleware organizations.
Optimizing on-call rotations and processes for all supported applications
Support of the production and customer facing test environments for Amadeus distribu-tion portfolio
Coordinate incident recoveries and follow-up actions acting a liaison between Opera-tions and Development teams.
Support the migration to the Amadeus Cloud Services (ACS), powered by OpenShift, Az-ure, Kubernetes, Docker, RedHat JBoss EAP 7 & Java 8
Strong and deep troubleshooting on a cloud (OpenShift / Kubernetes) environment
Setup and implement the Continuous Integration/Continuous Deployment (Software Workbench, Jenkins-based delivery pipeline)
Adapt and improve the monitoring, alerting, log management (Prometheus, Splunk, Grafana, ...)
DevOps expert & architecture role within Agile framework
Review and support server-side infrastructure changes
Provide and update documentation of production architecture and operational guide-lines.
Review application functional change requests and environment configuration manage-ment
Coordinate and validate major releases and service packs for Amadeus distribution prod-ucts.
Monitor and improve the availability and reliability of our products.
Availability and stability of production environments, respect of SLAs
Support and follow-up of all problems related to production environment.
Support the migration to the Amadeus Cloud Services and the DevOps mindset.
Proactive monitoring, problem investigation and technical troubleshooting
Automate manual tasks and improve tooling, monitoring, alerting.
Participation in Agile scrum teams
Provide expertise on production architecture solutions and guidelines for operability and stability.
Provide and update documentation.
About the ideal candidate:
Experience with Unix/Linux and cloud systems
Experience in programming in Java, Groovy, Python and building tools.
Experience in designing and troubleshooting large-scale distributed systems.
Ability to debug, optimize and automate routine tasks.
Systematic problem-solving approach
Working at Amadeus, you will find
A critical mission and purpose
A truly global DNA - Everything at Amadeus is global, from our people to our business, which translates into our footprint, processes, and culture.
Great opportunities to learn
A caring environment
A complete rewards offer
A flexible working model
A diverse and inclusive community
A Reliable Company
Diversity & Inclusion
We are an Equal Opportunity Employer and seek to hire the best candidate regardless of age, beliefs, disability, ethnicity, gender or sexual orientation.
Amadeus