Digital S/W Engineer Lead Analyst Site Reliability Engineering

Citigroup Inc. Irving , TX 75061

Posted 3 months ago

The Digital S/W Eng Lead Analyst is a strategic professional who stays abreast of developments within own field and contributes to directional strategy by considering their application in own job and the business. Recognized technical authority for an area within the business. Requires basic commercial awareness. There are typically multiple people within the business that provide the same level of subject matter expertise. Developed communication and diplomacy skills are required in order to guide, influence and convince others, in particular colleagues in other areas and occasional external customers. Significant impact on the area through complex deliverables. Provides advice and counsel related to the technology or operations of the business. Work impacts an entire area, which eventually affects the overall performance and effectiveness of the sub-function/job family.


  • Accountable for executing and driving results on large-scale efforts or multiple smaller efforts and serving as a development lead for most medium and large projects. This includes expertise with application development methodologies and standards for program analysis, design, coding, testing, debugging and implementation.

  • Accountable for exhibiting a strong understanding of client core business functions.

  • Required to support situations in which end user consultation is required to identify system function specifications and incorporate them into overall system design and delivery. Additionally, utilize comprehensive knowledge of multiple areas within technology to achieve technological objectives.

  • Independent work style, requiring little or no guidance by more senior developers. Decisions will make a significant, measurable impact on the business goals for the client organization. During team discussions you will play a significant role with TPMs and engineering managers to determine potential risks to a schedule.

  • Assist in the planning and managing of application development assignments generally involving large budgets, cross functional projects or multiple projects. This includes effectively understanding and analyzing both technical and business risks and impact.

  • Expected to effectively communicate those risks to the business owners, so that they can make informed decisions.

  • Accountable for providing guidance on architecturally significant efforts during the preplanning phase, and ensuring principles and best practices are followed prior to initiation of work. In doing so, closely watch and evaluate Digital roadmaps, including impacts to support upcoming journeys.

  • Publish design review extensions, and provide documented guidance aligned to sprint plans and timelines.

  • Be part of the design review board that will focus on the design process, search for generic patterns, and, at the same time, share best practices across the organization.

  • Publish design patterns across lines of business and domain commonalities. Drive design reviews for Next Gen Architecture (NGA) and Plan of Record (POR) projects, supporting design principles and best practices.

  • Participate in micro services and NGA code reviews.

  • Empower SDEs and their teams by mentoring and coaching.

  • Have a comprehensive understanding of the business domain, the systems, and the products in your space. Understand their accountabilities, boundaries, limitations, scale factors and the reasons behind architectural decisions.

  • Provide a long-term perspective for business and technology choices; using technical judgment to vet architecture as required.

  • Able to direct teams on how to develop and deliver systems that are efficient with resource usage such as hardware, runtime, performance, load, and memory requirements.

  • Responsible for broader design decisions and development of long-term strategies that significantly influence the development process and standards.

  • Accountable for Design Reviews of Agile and Plan of Record (POR) projects as well as accountable for Code Reviews of Next Gen Architecture (NGA) projects and are expected to elaborate, promote and communicate Design Patterns applicable to NGA architectures and solutions.

  • Accountable for providing architectural guidance to the SDE's based on best practices and in alignment with CTO guidelines and platform.

  • Drive clarity and work with complete independence as business and or technical strategy is not defined.

  • Provide the corresponding architectural guidance, and conduct design reviews and code reviews based on the projects assigned to your LOB. The product definition and technical planning is out of scope.

  • Accountable for the overall strategy and for driving the teams inside and outside of your organization to deliver expected results. Drive mindful discussion with business and technical stakeholders that lead to timely decisions. Participate in discussions to drive smart trade-off decisions that balance efforts, delivery timelines, features, and technical constraints. Identify and remove blockers and always find the path forward in challenging situations.

  • Create plans that have a clear path to delivery. Solve for dependencies between agile and waterfall delivery efforts. Help your teams organize for delivery while maximizing resources for the greater good of the Digital organization. Understand engineering best practices and apply best practices to the software development lifecycle (SDLC)

  • Accountable for Design Reviews for Citi Agile and POR projects as well as Code Reviews for the NGA projects. Accountable for providing architectural guidance to the SDE's based on best practices and in alignment with CTO guidelines and platform. Elaborate, promote and communicate Design Patterns applicable to NGA architectures and solutions.

  • Elaborate, bring and communicate clear metrics on Design Reviews, Code Reviews, CI/CD and Design Patterns adoption.

  • Communicate progress, anticipate bottlenecks, provide escalation management, identify, assess, track and mitigate issues/risks at multiple levels. Recognize discordant views and take part in constructive dialog to resolve them.

  • Demonstrate the ability to implement continuous improvement and the induction of new technology. Demonstrate examples of influence in scrum teams beyond your own area of focus.

  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.


  • 6-10 years of relevant experience in an Apps Development role or senior level experience in an Enterprise Architecture role with subject matter expert in one or more areas.

  • Exhibit expertise in all aspects of technology by understanding broader patterns and techniques as they apply to Citi's internal and external cloud platforms (AWS, PCF, Akamai)

  • Lead resources and serve as a functional SME across the company through advanced knowledge of algorithms, data structures, distributed systems, networking, use of knowledge and experience to lead, architect, and drive broader adoption forward.

  • Acquire relevant technology and financial industry skills (AWS PWS) and understand all aspects of NGA technology including innovative approaches and new opportunities.

  • Demonstrate knowledge on automating code quality, code performance, unit testing, and build processing in the CI/CD.


  • Bachelor's/University degree, Master's degree preferred

Grade :All Job Level

  • All Job FunctionsAll Job Level

  • All Job Functions

  • US

Time Type :Full time

Citi is an equal opportunity and affirmative action employer.

Minority/Female/Veteran/Individuals with Disabilities/Sexual Orientation/Gender Identity.

Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity CLICK HERE.

To view the "EEO is the Law" poster CLICK HERE. To view the EEO is the Law Supplement CLICK HERE.

To view the EEO Policy Statement CLICK HERE.

To view the Pay Transparency Posting CLICK HERE.

Apply Now

icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Site Reliability Engineer

Mastech Digital, Inc.

Posted 1 week ago

VIEW JOBS 1/11/2020 12:00:00 AM 2020-04-10T00:00 Location: Irving, TX Job Code: 207231 Posted: Jan 15, 2020 Description: Mastech Digital provides digital and mainstream technology staff as well as Digital Transformation Services for leading American Corporations. We are currently seeking a Senior Site Reliability Engineer for our client in the Federal Services domain. We value our professionals, providing comprehensive benefits, exciting challenges, and the opportunity for growth. This is a Contract position and the client is looking for someone to start immediately. Duration: 12+ Months Contract (Possible Contract to hire) Location: Irving TX Role: Senior Site Reliability Engineer Primary Skills: Software Enineering Role Description: The Senior Site Reliability Engineer would need to have at least 8+ years of experience. This role requires a full stack developer with experience in Java, Node.JS, Angular, JavaScript and Python. Looking for someone with extremely strong UI/UX, front-end development and troubleshooting skills. Any AI/ML experience with TensorFlow, PyTorch or AWS SageMaker would be a plus. Responsibilities: * Identify and fix issues with scalability, latency, throughput and bottlenecks * Utilize your knowledge of distributed systems to identify and fix network, system and service level issues * Automate to reduce toil * Logging and monitoring setup, be able to analyze logs, identify trends and analytics and use that to drive root cause analysis * Identify parts of the system that do not scale, provide immediate palliative measures and drives long term resolution of these incidents * Identify and implement changes for the product architecture from reliability, performance and availability perspective with a data driven approach * Identify Service Level Indicators (SLIs) that will align the team to meet the availability and latency objectives (SLOs) * Make monitoring alert on symptoms and not on outages * Design, build and maintain core infrastructure pieces that allow applications scaling to support thousands of concurrent users * Design and development of a high traffic, distributed, resilient, modern web application framework * Act as a subject matter expert for SRE when interfacing with external teams and business partners * Guide, mentor and lead the team in ensuring multiple conflicting priorities are resolved Must Have Skills: * Knowledge of development and troubleshooting in multiple application environments. This includes but is not limited to Linux/Unix, AWS, Kubernetes, and web servers such as Apache/NGINX. * Monitoring tools such as New Relic and synthetic monitoring tools such as Catchpoint * Experience with scripting tools such as Python, Bash, and Groovy. * Knowledge of programming and developing in web/server design such as JavaScript and Java * CICD and pipeline automation * Load balancing the application including proxies and CDN * Implement "Infrastructure as Code" * Knowledge of logging infrastructure, strategies and implementation using tools such as Splunk, ELK, etc. * Working with Databases such as Oracle and Postgres * Strong experience in Linux, networking, TCP/IP troubleshooting * Ability to work across diverse audiences and demonstrate leadership and independent thinking * Strong analytical and problem-solving skills, including the ability to work with large, disparate data sets, performing data mining, validation and analysis Education/Certifications: * B.S. in Computer Science or equivalent, Masters preferred, with 8+ years' work experience * Experience to include development, deployment and support on Linux platforms in large scale user environments Desired Skills: * AIOPs design, strategy and implementation * Have experience with Nginx, HAProxy, Docker, Kubernetes, Ansible, or similar technologies * Containerization platforms and tools such as Docker and Kubernetes with Helm * Experience as a full stack developer with emphasis on system engineering * Experience in SAFe and Agile methodologies * AWS certification would be a plus * New Relic certification would be a plus Education: Bachelor's degree in Computer Science, Electrical/Electronic Engineering, Information Technology or another related field or Equivalent. Experience: Minimum 8+ years Relocation: This position will not cover relocation expenses Travel: No Local Preferred: Yes Recruiter Name: Tarun Garg Recruiter Phone: 877.884.8834 (Ext: 2188)/412 200 1197 (Ext: 2188) Equal Employment Opportunity Mastech Digital, Inc. Irving TX

Digital S/W Engineer Lead Analyst Site Reliability Engineering

Citigroup Inc.