Job details


Apply Now


Job TitleHigh-Performance Computing (HPC) Systems Engineer
Companyaxle
Job LocationRockville, MD
Workplace Type
Job Type
Job Category
Min Pay0
Max Pay0
Pay Currency
Pay Cycle
Last Seen 21 hour(s) ago
Description (ID: 2025-0683) Axle is a bioscience and information technology company that offers advancements in translational research biomedical informatics and data science applications to research centers and healthcare organizations nationally and abroad. With experts in biomedical science software engineering and program management we focus on developing and applying research tools and techniques to empower decision-making and accelerate research discoveries. We work with some of the top research organizations and facilities in the country including multiple institutes at the National Institutes of Health (NIH).   Axle is seeking a High-Performance Computing (HPC) Systems Engineer to join our vibrant team at the National Institutes of Health (NIH) supporting the The National Center for Advancing Translational Sciences (NCATS) located in Rockville MD.   Benefits We Offer: 100% Medical Dental & Vision Coverage for Employees Paid Time Off and Paid Holidays 401K match up to 5% Educational Benefits for Career Growth Employee Referral Bonus Flexible Spending Accounts: Healthcare (FSA) Parking Reimbursement Account (PRK) Dependent Care Assistant Program (DCAP) Transportation Reimbursement Account (TRN)   Overview:  The High-Performance Computing (HPC) Systems Engineer will support the Scientific Computing and Informatics (SCI) team at The National Center for Advancing Translational Sciences (NCATS). This role will focus on the design optimization security and maintenance of HPC and cloud-based infrastructures that enable cutting-edge biomedical research through scalable secure and high-performing computing environments.      Responsibilities:  Design configure and maintain scalable HPC clusters for optimal performance.  Support documentation and ATO (Authority to Operate) processes.  Ensure infrastructure design compliance with federal security standards and best practices.  Implement monitoring tools such as XDMoD for transparency and user reporting.  Integrate platforms such as JupyterHub and job schedulers (e.g. Slurm) for improved interactivity.  Develop and manage AWS-based infrastructure using Terraform Packer and Ansible.  Automate deployment workflows to streamline provisioning updates and scaling.  Manage systems involved in AWS Secure Cloud Bridging (SCB) and STRIDES initiatives.  Implement CIS benchmark-aligned system hardening using OpenSCAP.  Administer optimized compute images (CPU/GPU) for scientific workflows.  Leverage tools such as OpenHPC Warewulf and Ansible for environment management.  Lead and coordinate quarterly patch cycles.  Partner with researchers and external stakeholders on critical projects.  Facilitate solution transitions to other NIH centers and collaborators.  Contribute to publications and team objectives through deep technical engagement.      Qualifications:  Federal ATO processes experience required  HPC architecture and performance optimization is required  Scientific software development and deployment  High-speed network and parallel file system architecture  Troubleshooting diagnostics and technical support  Strong communication and multitasking skills  Programming & Scripting:   Languages - Pascal BASIC Delphi Visual Basic C C++  Scripting - Bash Perl Python Ruby PEAR Tcl  Systems & Network Administration:   Linux – RHEL/CentOS SUSE Debian Ubuntu  Windows – 95–10 NT–Server 2016  Networking – Active Directory TCP/IP v4/v6 DHCP DNS WINS  Legacy – NOVELL 3.1–5 VPN Citrix Terminal Services  Monitoring & Management Tools:   Nagios Ganglia HP BAC Precise i3  SGI SMC HP PCM Bright Cluster Manager (incl. Data Analytics)  Infrastructure & Automation:   Puppet Cobbler Ansible Chef  Red Hat Satellite Kickstart RPM optimization  File Systems & Archiving:   Panasas (DirectFlow/panfs) DDN (GPFS) SGI DMF StorHouse/RFS (Filetek)  HPC Tools & Job Scheduling:   MOAB/MAUI Torque PBS Pro Windows HPC Scheduler  Visualization & Remote Access:   Nice DCV EnginFrame VNC OpenText Exceed OnDemand Web Remote Desktop  Containerization & GPU:   Docker Kubernetes Kubeflow NVIDIA DGX-1 GPU systems  Databases:   SQL Server (2000–2008) MySQL Zope  High-Speed Networking:   Infiniband Mellanox OFED Voltaire Force10  Proven experience in:  HPC architecture and performance tuning  Cybersecurity in HPC/cloud environments  Infrastructure as Code (AWS Terraform Ansible Packer)  Supporting scientific workflows in research environments        Disclaimer: The above description is meant to illustrate the general nature of work and level of effort being performed by individuals assigned to this position or job description. This is not restricted as a complete list of all skills responsibilities duties and/or assignments required. Individuals may be required to perform duties outside of their position job description or responsibilities as needed. The diversity of Axle’s employees is a tremendous asset. We are firmly committed to providing equal opportunity in all aspects of employment and will not tolerate any illegal discrimination or harassment based on age race gender religion national origin disability marital status covered veteran status sexual orientation status with respect to public assistance and other characteristics protected under state federal or local law and to deter those who aid abet or induce discrimination or coerce others to discriminate. Accessibility: If you need an accommodation as part of the employment process please contact: careers@axleinfo.com This role has a market-competitive salary with an anticipated base compensation range listed below. Actual salaries will vary depending on a candidate’s experience qualifications skills and location. #INDPSD Salary Range$150000 - $160000 USD
Apply Now