Senior HPC Engineer

New York City
Posted 1 month ago

Job Title: Senior HPC Engineer
Location: New York, NY
Comp: Up to 400k

Responsibilities

The Senior HPC Engineer will be responsible for:

  • Researching, testing, recommending, implementing, and maintaining large-scale, resilient, distributed systems
  • Designing and maintaining a multi-petabyte distributed storage system
  • Optimizing resource utilization and job scheduling
  • Analyzing performance issues at scale
  • Troubleshooting node-level issues, such as kernel panics and system hangs
  • Documenting architecture and procedures for users and other members of the Systems team

Qualifications

If you have not used the following commands, please do not apply: vmstat, top, uname, ps, git, make, rpm, ping, tcpdump/wireshark.

The ideal candidate will have:

  • At least 5 years of experience in Linux administration in a financial services or research background
  • Hands-on knowledge of distributed filesystems, such as, GPFS, Lustre and object storage
  • Extensive experience with HPC or cloud scheduling, such as, GridEngine, HTCondor, SLURM, Mesos and Nomad
  • Experience with configuration management
  • Strong knowledge of local and distributed I/O performance tuning
  • Experience with open source applications to build enterprise-level systems
  • Previous working experience with x86 hardware testing and integration
  • Fluency in at least one scripting language and bash
  • Expert knowledge of SSH, iptables, NFS, DNS

Candidates should also be:

  • Organized, responsible, and meticulous
  • A strong communicator
  • Proactive and willing to take initiative
  • Able to manage and prioritize multiple tasks in a fast-paced environment
  • Able to work both with a team and independently

Job Features

Job CategoryFull Time

Apply Online

A valid phone number is required.
A valid email address is required.