Enterprise HPC Systems Admin.

Full Time
El Paso, TX 79968
$75,500 a year
Posted
Job description

Responsible for designing, implementing, and administering High Performance Computing (HPC) clusters. Performs system administration duties on several high performance multi-platform clusters, cluster management, virtualization, cluster visualizations, and job scheduling

Essential Functions


Designs, implements and administers High Performance Computing (HPC) clusters. Performs system administration duties on several high performance multi-platform clusters, cluster management, virtualization, visualization clusters and job scheduling.

Facilitates the acquisition of hardware and software products and services for the Research and Academic Data Center (RADC) and Administrative Data Center (ADC). Monitors the availability of patches and updates and evaluates the importance to the environment and schedules installations accordingly.

Demonstrated ability to provision the various components of the system appropriately, batch, I/O and manage HPC nodes. Ability to use revision control systems and a configuration management system for an HPC environment.

Interacts effectively with a broad range of colleagues such as researchers, professors, research assistants, colleges and departments throughout campus. Supports a diverse user population from researchers, professors, research assistants, colleges and departments throughout campus with the administration and installation of HPC operating systems.

Provides reliable and efficient backups/restores for all managed systems in the RADC. Sets up and maintains host and network based security of the RADC and ADC resources. Coordinates with vendors to resolve hardware and software problems to systems in the RADC and ADC.

Participates in a 24-hour, 7-day on-call support rotation and off-hours maintenance windows. Manages software applications in the production environment provided to HPC users. Complies with all State and University policies.

Required Qualifications


Bachelor's degree; and three years related experience and training; or equivalent combination of education and experience.

Preferred Qualifications


In depth knowledge of common server hardware architecture including servers (CPU, bus, memory), SANS, disk arrays, network hardware. In depth understanding of Operating Systems (e.g., Windows, UNIX, Solaris, VAX/VMS), including processes, files, memory management and I/O systems; distributed information systems including 2 and 3 tier designs, and web based systems; networking services and protocols (e.g., TCP/IP, SSL, FTP, Telnet, LDAP). In depth understanding of IP networking, basic routing, TCP ports and network services, including SSH, LDAP, SFTP and HTTP(S). Basic understanding of change control and configuration management, patch management, high availability systems, structured design and support methodologies. Program system support tasks in C, Java, Perl, batch/shell, or other general purpose programming language; perform complex performance analysis including system processes, I/O subsystems, networks and other related components. Occasional travel may be required.

Five years related experience and have expert Enterprise Architecture engineering knowledge of clustered HPC environments; or equivalent experience. Must have experience as a systems administrator. Knowledge of Linux and UNIX operating systems, including scripting and programming proficiencies. Must have advanced ability to analyze, design and architect complex IT systems. Must have experience with multi-threading and parallel processing tools and environments. Demonstrate abilities in sustaining high-performance servers, associated high-performance networks/high speed interconnects and parallel file systems.

Experience installing and maintaining clustered environments, including automated installation methods. Demonstrated ability to proactively learn, adapt to and use new hardware/software technologies. Demonstrate experience in programming system maintain tasks in C, Java, Perl, batch/shell, or other general purpose programming language; perform complex performance analysis including system processes, I/O subsystems, networks and other related components. Additional experience with Fortran, C, C++, MPI, OpenMP, math libraries, log configuration and security plans in a Linux environment. Demonstrate the ability to configure and maintain the overall security of a HPC systems. Hardware maintenance and repair experience including isolation of failing components, removal and replacement of node level equipment, and component replacement such as disks, memory, or motherboards.

Must be organized with a strong ability to deliver tasks on time, manage multiple efforts and be able to work with minimal supervision.

Working Conditions


The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

While performing the duties of this job, the employee is regularly required to sit and talk or hear. The employee is occasionally required to stand or walk. The employee must occasionally lift and move up to 50 pounds.

The work environment characteristics described here are representative of those an employee encounters while performing the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

While performing the duties of this job, the employee is occasionally exposed to extreme cold and risk of electrical shock. The noise level in the work environment is usually moderate.

EO/AA Statement


In keeping with its Access and Excellence mission, The University of Texas at El Paso is committed to an open, diverse, and inclusive learning and working environment that honors the talents, respects the differences, and nurtures the growth and development of all. We seek to attract faculty and staff who share our commitment.

The University of Texas at El Paso is an Equal Opportunity / Affirmative Action Employer. The University does not discriminate on the basis of race, color, national origin, sex, religion, age, disability, genetic information, veteran status, or sexual orientation and gender in employment or the provision of services in accordance with state and federal law. Discrimination on the basis of sex includes an employee’s or prospective employee’s right to be free from sexual harassment under Title IX of the Higher Education Amendments of 1972.

For accommodation information for employees and applicants with disabilities, please contact UTEP's Equal Opportunity Office at eoaa@utep.edu.

gatheringourvoice.org is the go-to platform for job seekers looking for the best job postings from around the web. With a focus on quality, the platform guarantees that all job postings are from reliable sources and are up-to-date. It also offers a variety of tools to help users find the perfect job for them, such as searching by location and filtering by industry. Furthermore, gatheringourvoice.org provides helpful resources like resume tips and career advice to give job seekers an edge in their search. With its commitment to quality and user-friendliness, gatheringourvoice.org is the ideal place to find your next job.

Intrested in this job?

Related Jobs

All Related Listed jobs