HPC Administrator

Apex Systems, Inc Rockville , MD 20850

Posted 2 months ago

Apex Systems Inc. is immediately seeking a Linux Engineer

  • HPC (High Performance Computing) who is a self-starter, highly organized and has a strong drive for quality. This position will support one of our highly regarded clients in the government arena, and offer a strong upside for growth.

    If you want to work with a dynamic group of dedicated, technical professionals on a collaborative team that supports a life-saving mission of global importance, we encourage you to apply. Position: Linux Engineer

  • HPC Location: Greater Washington DC Area Clearance:

    Public Trust Timeline: Immediately Schedule: Monday-Friday Linux Engineer

  • HPC Job Description:
    You will be hands on with the design, build, and configuration of services necessary for research computing, and underlying scripting capabilities.

    This role requires regular, close collaboration with Data Center, VmWare, Oracle, Storage, Backup, SQL Administrators, as well scientific investigators. This is a customer service focused environment with the associated business needs of an advanced research environment, requiring high reliability and 24 x 7 up time. Role & Responsibilities for Linux Engineer

  • HPC:
    Provide hands on design, build, and configuration of services necessary for research computing, and underlying scripting capabilities
    Diagnose hardware and software problems, and replace defective components
    Perform data backups and disaster recovery operations.
    Maintain and administer computer networks and related computing environments, including computer hardware, systems
    Collaborate with other Systems Administrators who oversee the Data Center, VmWare, Oracle, Storage, Backup, SQL Administration, as well scientific investigators.
    Use strong analysis and decision-making skills to conduct briefings and participate in technical cross-functional meetings
    Focus on process and documentation with a demonstrated understanding of change and configuration management principles in support of a validated environment.
    You'll be utilizing an exceptional Linux/UNIX scope and associated project management skillset as well.
    Requirements for Linux Engineer

  • HPC:
    In depth knowledge of UNIX/Linux HPC systems, system administration tools, methodologies, and security practices. CentOS and Scientific Linux Knowledge.
    Domain knowledge in HPC and system software such as cluster management/provisioning tools, job schedulers, MPI, etc
    In-depth understanding of one or more of the technology fields in HPC such as storage (FC, SAS, iSCSI, FCoE), high speed interconnects (InfiniBand, 10GigE, etc), cluster file systems (pNFS, Lustre, PVFS), provisioning tools, MPI.
    Proactive communication to manage scope and schedule for multiple priorities
    Participate and adhere to change management processes
    Customer Service Focus Business Needs (Support Science), Reliability, Up Time (24 x7) etc
    Resolve escalated issues from Tier 2 and Tier 3 to assist desktop support team
    Design and implement disaster recovery plans, implement backups with tools like Networker.

Actively monitor IT maintained systems with monitoring tools such as xymon Be available on page for monitors and alerts during after hours Rack and stack servers and network equipment
RPM / Package building
Versioning experience in code and packages, including use of "diff" files
Exeperience writing shell scripts in a production *nix environment
Programming languages (Python, C, Ruby, Perl, R) CFEngine or Puppet

Bachelors degree in Computer Science, Information Systems, Engineering; equivalent professional experience may be considered in lieu of formal education
RHCE (RHEL 7); CCNA & MCP desired
Ability to obtain Public Trust Clearance
Please note that as a member of the Apex Systems team, youd be eligible for Health, Dental, Vision and Life Insurance; Short Term Disability; Hospitalization Coverage; Direct Deposit; Weekly Pay Periods; Training and Development Programs; and our Referral Program.

Skills:
HPC, Red Hat
Permanent



See if you are a match!

See how well your resume matches up to this job - upload your resume now.

Find your dream job anywhere
with the LiveCareer app.
Download the
LiveCareer app and find
your dream job anywhere
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove
HPC Engineering Lead

Saic

Posted 4 weeks ago

VIEW JOBS 10/18/2018 12:00:00 AM 2019-01-16T00:00 HPC Engineering Lead (Job Number:441002) Description: JOB DESCRIPTION: This position will provide technical leadership and systems engineering management for High Performance Computing. This role will be responsible for systems planning, system and lower level requirements development, system design, analyses and trade studies as it relates to High Performance Computing (HPC) solutions that provide a robust, reliable, cost-effective, and scalable infrastructure for Next Generation Gene Sequencing and other high throughput data analysis requirements. Support shall include providing planning, analysis, design, development, testing, configuration, installation, implementation, integration, maintenance and management of all HPC related hardware, software and infrastructure. Job Specific Responsibilities: * Provide efficient and effective HPC Support of HPC-related hardware, software, and infrastructure components to maximize the performance and availability of HPC solutions. * Provide timely and effective maintenance and repair support on HPC-related hardware, software, and infrastructure components to maximize the performance and availability of HPC solutions. * Provide efficient performance monitoring of all HPC- related hardware, software, and infrastructure components, including the issuance of timely and accurate notification of HPC-related issues. * Provide after-hours monitoring and timely resolution commensurate with the mission criticality of the affected system(s). * Support, maintain, and enhance as required, established strategies for applications hosting to ensure continuity of business operations and timely recovery in the event of disaster. * Collect, store, and analyze data relevant to HPC solutions to perform and report accurate root causal analysis of all related issues and support trend analysis and forecasting. * Effectively manage the procurement and maintenance of all supplies, materials, and supporting software licenses and service agreements required to ensure supported HPC-related hardware, software, and infrastructure components to maximize the performance and availability of HPC solutions. * Ensure effective change and configuration management of all supported HPC solutions to establish and maintain consistency of their performance, security, and functional and physical attributes with approved requirements, design, and operational information throughout its life. * Assist in the development and maintenance of standard operating procedures for operation, maintenance, and repair of HPC hardware, software, and infrastructure components. * Ensure all HPC-related data and documentation is added to and maintained current within the Knowledge Database and Document Library to provide efficient access to a complete and current source of operationally relevant structured and unstructured data. VENDORS/TOOLS/MANAGEMENT SYTEMS * Cisco Systems, Nexus * Brocade * Microsoft Windows * Linux * VMware * EMC * DDN * Dell * IBM TECHNICAL SPECIALTIES * Performance Management * Requirements Development * High Performance Computing Cluster * HPC cluster and management tool, job schedulers * SAN-Storage * Security * Systems Architecture * Network Systems Architecture * Communication Systems T ECHNOLOGIES * Cisco Systems * Routing/Switching * Networking * Unified Communications * VMware * Unified Communications (including Skype for Business and Polycom VT) Qualifications: REQUIRED QUALIFICATIONS & EXPERIENCE * Bachelor Degree in Computer Science, Computer Engineering or related field * Minimum of 10 years of experience as Information Systems Engineer * Minimum seven years HPC experience * In depth knowledge of HPC cluster and software such as cluster management/provision tools, job schedulers(SGE, PBS, etc), parallel file system, MPI, MPICH etc * Deep understanding and knowledge of HPC technology such as storage, high speed interconnects, infiniBand, 10GigE etc. cluster file systems (GPFS, Lustre, etc) * Experience with scientific computing support include scientific computing software and application support. Experience with bioinformatics, biomechanics software and application support is a plus. * Eight to Ten Years of Experience in Team Leadership * Five Years of Experience providing formal documentation and Briefings * Public Trust L5 Security Clearable DESIRED QUALIFICATIONS & EXPERIENCE * Master of Science in Computer Science, Computer Engineering * Graduate Degree in Computer Science, Computer Systems Engineering * Experience with or knowledge of HHS EPLC SAIC Overview:SAIC is a premier technology integrator providing full life cycle services and solutions in the technical, engineering, intelligence, and enterprise information technology markets. SAIC is Redefining Ingenuity through its deep customer and domain knowledge to enable the delivery of systems engineering and integration offerings for large, complex projects. SAIC's approximately 15,000 employees are driven by integrity and mission focus to serve customers in the U.S. federal government. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $4.5 billion. For more information, visit saic.com. For information on the benefits SAIC offers, see My SAIC Benefits. EOE AA M/F/Vet/Disability Job Posting: Oct 16, 2018, 11:30:55 AM Primary Location: United States-MD-ROCKVILLE Clearance Level Must Currently Possess: None Clearance Level Must Be Able to Obtain: Other Clearance Potential for Teleworking: No Travel: None Shift: Day Job Schedule: Full-time Saic Rockville MD

HPC Administrator

Apex Systems, Inc