HPC Systems Engineer

Augusta University Augusta , GA 30903

Posted 2 months ago


About Us

Augusta University is Georgia's innovation center for education and health care, training the next generation of innovators, leaders, and healthcare providers in classrooms and clinics on four campuses in Augusta and locations across the state. More than 10,500 students choose Augusta for educational opportunities at the center of Georgia's cybersecurity hub and experiential learning that blends arts and application, humanities, and the health sciences. Augusta is home to Georgia's only public academic health center, where groundbreaking research is creating a healthier, more prosperous Georgia, and world-class clinicians are bringing the medicine of tomorrow to patient care today.

Our mission and values make Augusta University an institution like no other. We are part of the University System of Georgia.

Location

Augusta University-

Our Health Sciences Campus: 1120 15th Street, Augusta, GA 30912

College/Department Information

The mission of the Division of Information Technology at Augusta University is to anticipate and respond effectively to a changing world with agile, innovative, robust and secure services that educate and inspire students, empower clinicians, educators, researchers and administrators, advance learning, discovery and care.

Job Summary

Provide the wide range of technical experience required for all aspects of operational support of AU HPC services. Typical day-to-day tasks include monitoring, management, configuration and issue resolution for Linux HPC clusters, storage, and ancillary research technology systems.

On-demand tasks would include, but are not limited to, user management and support, application deployment, documentation, and consultation as a HPC SME. This is a customer facing position that will work directly with cluster users facilitating productive use of computational resources. This will include support of labs, training facilities and general IT office software and hardware. Cross-functional support with internal and external resources is key to success as well.

Responsibilities

Responsibilities include but are not limited to:

Support all aspects of AUHPCS operations, upgrades, patching, and expansion. Consult with users to adopt HPC environment best practices and facilitate productive use of HPC resources.

Assist users in job submission and results analysis where applicable during and after HPC jobs are completed. Attend meetings as a representative of the Technology Services division, gathering requirements, understanding objectives, and providing analysis and feedback.

  • Ensure availability of existing HPC resources, monitoring all resources using relevant metrics and make recommendations for required and/or desired expansion efforts.

  • Monitor HPC and ancillary systems reporting and mitigating issues utilizing external support resources when necessary.

  • Deploy scientific applications in clusters and assist in the deployment of user managed applications.

  • Provide technical support and documentation for HPC service security, recover-ability and access including deploying security, operating system, and cluster software patches.

  • Update step-by-step checklists for routine HPC functions.

  • Assist in documentation in support of maintenance, configuration, monitoring, and management of enterprise HPC components, devices, applications, and systems.

Establish and maintain key relationships with customers, vendors and business owners.

Work with vendors, contractors, an third parties to assess partnership opportunities and establish relationships.

Perform all other related duties/tasks as assigned.

Required Qualifications

Bachelor's degree from an accredited college or university with 3 years' experience administering complex LAN/WAN environments OR Associate's degree from an accredited college or university with 5 years' experience administering complex LAN/WAN environments OR High School diploma, GED or equivalent from a recognized State or Federal accrediting organization with 9 years' experience in administering complex LAN/WAN environments.

Preferred Qualifications

Strong experience administering Red Hat Enterprise Linux systems

Experience administering clusters utilizing Bright Computer Cluster Manager software

Experience with configuration management tools such as Ansible, Chef, Puppet, and Salt

Experience with HPC application deployment tools (e.g. Easybuild)

Experience with HPC based genomics/bioinformatics -applications

Experience participating in a formal project management framework

Knowledge, Skills, & Abilities

KNOWLEDGE

Experience in HPC orchestration stack installation, administration, and patching

Experience with Linux HPC clusters and workload managers, preferably SLURM

Experience with high performance storage and parallel file systems (e.g. GPFS, Lustre)

Experience in cloud based HPC implementations (e.g. Azure, VMWare)

Experience with MPI software such as Intel, openMPI, and MPICH/MVAPICH2

Experience with compiler software like Fortran, GCC, Intel, and NVIDIA

Experience with languages such as Perl, Python, R, C, Fortran, and CUDA

Experience with batch programming and Linux shell scripting

Experience with HPC fast interconnects (e.g. 10/40 Gigabit Ethernet, infiniband)

Experience with Intel, AMD, and NVIDIA hardware platforms used in HPC environments

Experience with CPU/GPU application environments and parallel invocation

Experience with scientific software workflows and pipelines

Experience with deploying and managing Linux applications in an LMOD framework

Experience building and customizing software from source

Experience with Linux container development, preferably Singularity

Experience with multi-tier HPC application deployment

Experience utilizing debugging applications, utilities, and logs for problem isolation

SKILLS

Efficient planning and execution skills

Strong interpersonal skills

Good presentation skills

Good knowledge of current HPC technologies

ABILITIES

Ability to work effectively to investigate and resolve complex hardware and software issues

Ability to help users recognize and mitigate job performance problems

Ability to successfully contribute to large complex projects with strict time constraints and budgets

Shift/Salary/Benefits

Shift: Days/M-F *Work outside of the normal business hours may be required.

Pay Grade: 20

Salary: Minimum $55,869/annually Midpoint $75,423/annually

Salary to be commensurate with qualifications of the selected candidate within the established range (generally minimum midpoint) of the position.

Recruitment Period: Until filled

Augusta University offers a variety of benefits to full-time benefits-eligible employees and some of our half-time (or more) employees.

Benefits that may be elected could include health insurance, dental insurance, life insurance, Teachers Retirement System (or Optional Retirement Plan), as well as earned vacation time, sick leave, and 13 paid holidays.

Also, our full-time employees who have been employed with us successfully for more than 6 months can be considered for the Tuition Assistance Program.

Consider applying with us today!

Conditions of Employment

All selected candidates are required to successfully pass a Background Check review prior to starting with Augusta University.

If applicable for the specific position based on the duties: the candidate will also need to have a credit check completed for Positions of Trust and or approved departmental Purchase Card usage. Motor vehicle reports are required for positions that are required to drive an Augusta University vehicle.

For Faculty Hires: Final candidates will be required to provide proof of completed academic degree(s) as well as post-secondary coursework in the form of original transcript(s). Those candidates trained by a foreign institution will also be required to provide an educational/credential evaluation.

All employees are responsible for ensuring the confidentiality, availability, and integrity of sensitive [patient, student, employee, financial, business, etc.] information by exercising sound judgment and adhering to cybersecurity and privacy policies during their employment and beyond.

Other Information

This position is also responsible for promoting a customer-friendly environment and providing superior service to our patients, students, faculty, and employees. "Augusta University is a patient-and family-centered care institution, where employees partner every day with patients and families for success."

Augusta University is a tobacco-free environment, and the use of any tobacco products on any part of the campus, both inside and outside, is strictly prohibited.

Equal Employment Opportunity

Augusta University is proud to be an equal opportunity employer welcoming applicants from underrepresented groups, including individuals with disabilities and veterans.

How To Apply

Consider applying with us today!

https://www.augusta.edu/hr/jobs/

Select University Faculty & Staff > External Applicants if you are a candidate from outside the university.

Select University Faculty & Staff > Internal Applicants if you are a current university employee.

Search for job ID 265514

If you need further assistance, please contact us at 706-721-9365


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
Senior Linux HPC Systems Engineer

Oak Ridge National Laboratory

Posted 2 weeks ago

VIEW JOBS 4/12/2024 12:00:00 AM 2024-07-11T00:00 Requisition Id 12859 Due to the security clearance requirements of this position, we are unable to consider EAD and Green Card holders. Additionally, Visa spo Oak Ridge National Laboratory Oak Ridge TN

HPC Systems Engineer

Augusta University