Position Details




View All Details Back To Current Job Opportunities Position Closed

Senior HPC Systems Administrator

Job Number Full/Part Time Schedule Salary
26796931 Full Time 8AM - 5PM $96,700 - $184,100

Position Information

The High-Performance Computing Center (HPCC) at the University of California, Riverside (UCR) has an opening for a Senior HPC Systems Administrator. In this exciting leadership position, you will manage state-of-the-art research computing infrastructure in support of the science conducted by researchers at UCR. The Senior HPC Administrator provides technical leadership for UCR's largest high-performance computing (HPC) infrastructure, manages a complex portfolio of responsibilities at a campus-wide level, and advises upper administration on strategic decisions in research computing. The HPCC enables cutting-edge research in a wide range of science, engineering and biomedical disciplines by providing the computing hardware, software and expertise to enable pioneering discoveries. UCR is a vibrant research and teaching university with a diverse student, staff and faculty body located in beautiful Southern California. UCR is an equal opportunity employer that values and respects the importance of a diverse and inclusive workforce. In this position you won’t work alone, instead you will be part of a creative, dynamic work environment where you will collaborate with supportive colleagues.

RESPONSIBILITIES
○ Support, maintain, enhance, and expand the Linux-based HPC cluster consisting of hundreds of physical CPU/GPU nodes with thousands of cores, a multi-petabyte parallel big data storage system with backup and a high-speed internal network.

○ Supervise HPC facility staff.

○ Monitor, optimize and troubleshoot performance and functionality of the infrastructure.

○ Manage security of all HPC, networking and storage components in accordance with university policy and best practices.

○ Install, maintain, and troubleshoot research and general HPC environment software.

○ Automate and document processes throughout the HPC infrastructure including upgrades, software installs, and deployments of new hardware and services.

○ Develop and publish user and technical documentation on the use of systems. Directly support researchers, course instructors, and students to enhance success within the HPC environment. Participate in training sessions instructing users best practices for running research applications on HPC systems and managing big data storage.

MINIMUM QUALIFICATIONS
○ Bachelor’s degree in a computational field, followed by 6 years of post-baccalaureate work experience, which includes at least 3 years of Linux and/or HPC administration in a professional environment, or an equivalent combination of education and experience.
○ Excellent team and outreach abilities to network and collaborate with key contacts outside their own area of expertise.
○ Fluency in two or more programming languages and environments used in research computing such as Bash, Python, C/C++, R, Java, Tensorflow, PyTorch, Jupyter Notebooks, Rstudio Server, and Matlab.
○ Commitment to lifelong learning.

ADDITIONAL DESIRED QUALIFICATIONS
○ Experience supervising a team of computational experts.
○ Experience configuring and fine-tuning job schedulers and resource managers (Slurm, PBS, etc.).
○ Experience with parallel programming and computing on Linux clusters using C/C++, Fortran, Python, MPI, OpenMP, multithreading and multicore technologies on CPU and GPU architectures.

OTHER POSITION DETAILS
Offer will be based on the successful candidate's education and related experience.

Some of the job duties can be performed remotely with some in-person requirements.


**

Education

Education Requirements

Degree Requirement
Bachelor's degree in related area and/or equivalent experience/training. Required

Experience

Experience Requirement
6 - 10 years of related experience. Required
Experience supervising a team of computational experts. Preferred
Experience configuring and fine-tuning job schedulers and resource managers (Slurm, PBS, etc.). Preferred
Minimum of 3 years of Linux and/or HPC administration in a professional environment. Required
Experience with parallel programming and computing on Linux clusters using C/C++, Fortran, Python, MPI, OpenMP, multithreading and multicore technologies on CPU and GPU architectures. Preferred

Special Conditions

Special Condition Requirement
Must pass a background check. Required

Minimum Requirements

General knowledge of other areas of IT. Thorough understanding of and experience with systems-related issues and actions that can be taken to improve or correct performance.
Demonstrated skills associated with adapting equipment and technology to serve user needs. Demonstrated comprehensive understanding of how system management actions affect other systems, system users and dependent/related functions.
Advanced experience writing and editing the most complex scripts used to perform system maintenance and administration.
Basic knowledge of how to apply technologies and systems to meet business needs.
Ability to write technical documentation in a clear and concise manner.
Understanding of system performance monitoring and actions that can be taken to improve or correct performance.
Demonstrated advanced knowledge, skills and abilities associated with system problem identification and resolution. Experience with design, configuration, operation, repair, and tuning of technology systems.
Knowledge of the design, development and application of technology and systems to meet business needs.
Self-motivated and works independently and as part of a team. Demonstrates problem-solving skills. Able to learn effectively and meet deadlines.
Ability to elicit and communicate technical and non-technical information in a clear and concise manner.
Experience leading a team of IT professionals.
Advanced knowledge of computer security best practices and policies including demonstrated experience securing most complex server-based software.
Excellent team and outreach abilities to network and collaborate with key contacts outside their own area of expertise.
Fluency in two or more programming languages and environments used in research computing such as Bash, Python, C/C++, R, Java, Tensorflow, PyTorch, Jupyter Notebooks, Rstudio Server, and Matlab.

Preferred Qualifications

Extensive experience in instructing user workshops for HPC systems, usage of Linux environments, programming languages, and big data management using local and/or cloud computing solutions.

Ability and hands-on experience in developing and maintaining web-based user documentation for HPC systems.

Advanced understanding and hand-on experience in administering and optimizing HPC networks and switches, such as Infiniband networks.

Advanced understanding and hand-on experience in administering and optimizing parallel storage systems with many PBs (1PB = 1000TB) of storage space.

Advanced understanding and hand-on experience in administering and optimizing complex HPC systems in large user environments with hundreds of users.

Additional Information

In the Heart of Inland Southern California, UC Riverside is located on nearly 1,200 acres near Box Springs Mountain in Southern California; the park-like campus provides convenient access to the vibrant and growing Inland region. The campus is a living laboratory for the exploration of issues critical to growing communities' air, water, energy, transportation, politics, the arts, history, and culture. UCR gives every student, faculty and staff member the resources to explore, engage, imagine and excel.

UC Riverside is recognized as one of the most ethnically diverse research universities in the country boasting several key rankings of which we are extremely proud.

  • UC Riverside is proud to be ranked No. 12 among all U.S. universities, according to Money Magazine's 2020 rankings, and among the top 1 percent of universities worldwide, according to the 2019-20 Center for World University rankings.

  • UC Riverside is the top university in the United States for social mobility. - U.S. News 2020

  • UCR is a member of the University Innovation Alliance, the leading national coalition of public research universities committed to improving student success for low-income, first-generation, and students of color.

  • Among top-tier universities, UC Riverside ranks No. 2 in financial aid. - Business Insider 2019

  • Ranked No. 2 in the world for research, UCR's Department of Entomology maintains one of the largest collections of insect specimens the nation. - Center for World University Rankings

  • UCR's distinguished faculty boasts 2 Nobel Laureates, and 13 members of the National Academies of Science and Medicine.


The University of California is an Equal Opportunity/Affirmative Action Employer with a strong institutional commitment to the achievement of excellence and diversity among its faculty and staff. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status, or any other characteristic protected by law.

For information about our generous employee benefits package, visit: Employee Benefits Overview

Job Description Details

More Information

General Campus Information

University of California, Riverside
900 University Ave.
Riverside, CA 92521
Tel: (951) 827-1012

Career OpportunitiesUCR Libraries
Campus StatusMaps and Directions

Department Information

Human Resources
1160 University Ave.
Riverside, CA 92521

Fax: (951) 827-6493
E-mail: jobshelp@ucr.edu

Footer