Failure Analysis Engineer

 

Description:

Minimum Qualification:

 

8+ years of experience in High Performance Computing Design, implementation and Support

Strong flavor agnostic Linux System Administration Experience

Strong Scripting experience for automation using Python, Bash, Perl

Good experience with xCat and BCM (Bright Cluster Manager)

Good working experience in Parallel file system setup and high speed interconnects

Strong knowledge in HPC cluster benchmarking like HPL, IOR, OSU

Experience in installation of HPC job schedulers, setting up various scheduling policies and application integration with scheduler

Experience with Configuration management tools like Chef, Ansible, Puppet

Knowledge in setting up HPC workloads in Cloud – AWS/AZURE

Experience in license management like flexlm, RLM

Experience in installing, profiling and running opensource applications

Good communication and project management skills

Knowledge in Tensor-Flow, CUDA, R-Studio, R and Docker

Organization Ubique Systems
Industry Other Jobs
Occupational Category HPC Admin
Job Location Dublin,Ireland
Shift Type Morning
Job Type Full Time
Gender No Preference
Career Level Experienced Professional
Experience 8 Years
Posted at 2024-11-21 4:15 pm
Expires on 2025-01-05