General Information
Job Description | SYS ADM 3 | Working Title | Linux System Administrator |
---|---|---|---|
Job Code | 007304 | Grade | 23 |
Department Name | RED Research Centers - D01238 | Department Head | |
Supervisor | Effective Date | 03/01/2021 |
Position(s) Directly Supervised
Job Code | Title | FTE |
---|
Generic Scope
Experienced professional who knows how to apply theory and put it into practice with in-depth understanding of the professional field; independently performs the full range of responsibilities within the function; possesses broad job knowledge; analyzes problems/issues of diverse scope and determines solutions. |
Custom Scope
Applies skills as a seasoned, experienced systems infrastructure professional with a full understanding of industry best practices and campus, medical center or Office of the President policies and procedures to resolve a wide range of issues that are moderately complex in scope. Selects methods and techniques to obtain solutions. Evaluates new technologies including performing simple to moderate cost/benefit analyses. |
Department Custom Scope
UC Riverside's (UCR) research computing infrastructure is provided by a central high-performance computing (HPC) facility. This facility operates Linux clusters with over 6,000 CPU cores, 75TB of total system RAM and several GPU nodes. Big Data storage is handled by a centralized GPFS-based storage cluster with over 4PB of disk space for production and backup storage. The incumbent will be part of a team responsible for the systems administration of this HPC infrastructure, including development of software for parallel computing, network management, data security and user training. |
Education & Experience Requirements
Education Requirements
Degree | Requirement |
---|---|
Bachelor's degree in related area and/or equivalent experience/training. | Required |
Advanced degree. | Preferred |
Experience Requirements
Experience | Requirement |
---|
License Requirements
Certification Requirements
Certification | Requirement |
---|
Educational Condition Requirements
Condition | Requirement |
---|
Key Responsibilities
Description | % Time |
---|---|
Defines, designs and implements systems, services and technology solutions. Proposes and implements system or device enhancements such as software, hardware and network configuration, updates and installations for projects or services of moderately complex scope. | 15 |
Manages systems and services for a facility of moderate size and makes recommendations for purchase or upgrade of new computer hardware, software and services. Performs moderately complex analysis to acquire, install, modify and support operating systems, databases, utilities and Internet / intranet-related tools. Plans, designs and implements moderately complex system updates and rollouts. May perform moderately complex networking tasks and interoperability assessments for interconnected servers or components of clusters for communication.
|
15 |
Writes and executes complex scripts and may write software in support of systems management, log analysis and other system administration duties for multiple integrated systems. | 20 |
Performs complex security control activities to prevent unauthorized access to networked resources. May assist with maintenance of security systems for network equipment and provide recommendations on network access controls.
|
20 |
Provides training for users of UCR's HPC infrastructure in the form of in person training, instruction of user workshops, and development of online user manuals. | 10 |
Compiles software using multiple compilers including support for MPI, MKL, LAPACK, BLAS, and BOOST. Optimizes kernel level issues, user accounts and driver compatibility. | 20 |
Knowledge, Skills & Abilities
Knowledge/Skill/Ability | Requirement |
---|---|
Ability to elicit and communicate technical and non-technical information in a clear and concise manner. | Required |
Self-motivated and works independently and as part of a team. Demonstrates problem-solving skills. Able to learn effectively and meet deadlines. | Required |
Basic knowledge of how to apply technologies and systems to meet business needs. | Required |
Ability to write technical documentation in a clear and concise manner. | Required |
Understanding of system performance monitoring and actions that can be taken to improve or correct performance. | Required |
Knowledge of the design, development and application of technology and systems to meet business needs. | Required |
General knowledge of other areas of IT. Thorough understanding of and experience with systems-related issues and actions that can be taken to improve or correct performance. | Required |
Demonstrated skills associated with adapting equipment and technology to serve user needs. Demonstrated comprehensive understanding of how system management actions affect other systems, system users and dependent/related functions. | Required |
Demonstrated experience writing and editing complex scripts used to perform system maintenance and administration. | Required |
Advanced knowledge of computer security best practices and policies including demonstrated experience securing server-based software. | Required |
Knowledge of Linux systems, kernels, and architectures of HPC systems, networks and large-scale storage systems. | Preferred |
Knowledge of software management and compilation under Linux OSs and their optimization for HPC environments. | Preferred |
Knowledge of queuing systems and workload managing software, such as Slurm, Torque, SGE or similar. | Preferred |
Special Requirements & Conditions
Special Condition | Requirement |
---|---|
Must pass a background check. | Required |
Other Special Requirements & Conditions
|
Level of Supervision Received
GeneralSupervision |
Environment
Working Environment
Campus |
Other Requirements
Items Used
|
Physical Requirements
|
Mental Requirements
|
Environmental Requirements
|
Critical Position
Is Critical Position: No |