The Infrastructure Senior Cloud Support Administrator should have a wide range of knowledge and skillsets in a wide variety of technical topics. This is an operational administrative/engineering role supporting OS and infrastructure issues. Ideal candidate will have Linux system administration experience with a focus on diagnosing & troubleshooting OS related issues and providing root cause and preventative measures to clients. This is an L3 role so candidate should be able to act as an SME on RHEL based systems.
Supporting cloud infrastructure tending to escalations and providing root cause analysis and providing innovative ways to improve process to prevent future issues that could result in downtime for client.
Implement and maintain cloud infrastructure and services within large managed service enterprise environment
Create and maintain automation scripts to support provisioning and deployment of multiple software stacks for both linux and windows including database, web, and messaging applications
Develop and administer technologies for continuous build, integration, and deployment of software and agents needed to support multiple software stacks
Coordinate scheduling and deployment of patches and compliance updates for large number of servers using supported systems (ServiceNow, CIAP, BigFix)
Collaboration with technical support professionals from numerous teams in a fast paced results driven environment
Ability to work independently on multiple projects/technologies/security and compliance efforts to drive them to completion
Identify and communicate preventative measures to reduce future incidents
Create and maintain technical written documentation related to architecture changes, troubleshooting both OS (Linux and/or Windows) and Infrastructure (vCenter)
Utilizing Ansible Chef and other tooling to build out and automate infrastructure.
Pioneering an agile organization that is quickly adaptable to change and able to meet the demands of the businesses we support.
Design testing approaches, complex processes, reporting streams, and create automation of repetitive tasks
Review requirement documents, define hardware requirements and examine and update processes and procedures as necessary
Develop projects required for design of metrics, analytical tools, benchmarking activities and best practices
5+ year experience with RHEL configuration/administration of environment of 1000+ servers. RCHE level knowledge preferred
2+ year experience working with one or more scripting languages (e.g Python, Bash, Ruby, Perl, Powershell)
2+ year experience working with Chef and its integration into automation in a virtualized Linux environment
5+ year experience working in a large high availability clustered environment (VMware Preferred)
3+ year experience in one or more DevOps, Orchestration/Configuration management and integration with multiple back-end systems (Ansible, Puppet, Docker, etc)
5-8 years of experience in Infrastructure Technologies delivery with a proven track record of operational process change and improvement
Ability to work with virtual and in-person teams, and work under pressure or to a deadline
Experience in a Financial Services or large complex and/or global environment preferred
Effective written and verbal communication skills
Deep understanding of the Linux OS and ability to troubleshoot, diagnose and fix issue at the OS level
Understand how CPU and Memory and swap is used in the OS as well as at a cluster level and how to use tools to measure and interpret within the OS
Proficient in TCP/IP and good understanding of networking and how issues affecting the servers performance
Experience with VMware and using vCenter and vRops to analyze infrastructure issues at the cluster, hosts and individual vm layer including generating meaningful reports to identify contention issues that could negatively affect vm performance
Understanding of Linux and windows based systems administration skills in a cloud or virtualized environment.
Experience and knowledge with VMware clustered technologies (high availability, DRS, VMFS, Vmotion, V2V migrations, snapshots, templates)
Familiarity with storage and backup technologies and its integration with virtual machines in a shared clustered environment (SAN, NAS, Netapp, VADP, NFS, SMB)
Understands capacity management in a virtual environment and has experience using tools to track (thin provisioning, usage vs allocation, vROPS
Working knowledge of IaaS, PaaS, and SaaS cloud architectures and the capabilities and challenges involved with their implementation in cloud environment
Understanding of web and application server technologies and how they work together to create a functioning stack (Apache HTTPD, Tomcat, Nginx, IIS)
Advanced Excel data analysis and manipulation skillset (vlookup, match/index, array formulas, pivot)
Knowledge of web services, API gateways and application integration development and design
Knowledge and experience with Chef cookbooks, recipes, nodes, runlists, and attributes and how it integrates into cloud provisioning and configuration tasks
Understanding of Ansible and its use as a devops tool to support automation of manual tasks
Experience with agile scrum practices
Understanding of Active directory and LDAP and how that fits with security model of managing and access to servers
Grade :All Job Level
All Job FunctionsAll Job Level
All Job Functions
Time Type :Full time
Citi is an equal opportunity and affirmative action employer.
Minority/Female/Veteran/Individuals with Disabilities/Sexual Orientation/Gender Identity.
Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity CLICK HERE.
To view the "EEO is the Law" poster CLICK HERE. To view the EEO is the Law Supplement CLICK HERE.
To view the EEO Policy Statement CLICK HERE.
To view the Pay Transparency Posting CLICK HERE.