Devops Engineer (GPU Bare Metal) Job at Cloud Analytics Technologies LLC, Houston, TX

N2dpS1NPemE2alM3WFk3ZDdTaU9YbnZUVlE9PQ==
  • Cloud Analytics Technologies LLC
  • Houston, TX

Job Description

JOb Discription: 


Required Key Skills: 
Bare Metal, Open-Source Virtualization (QEMU/KVM etc.), Linux/Unix, Automation/Orchestration (Puppet/Chef/Ansible etc.),  DevOps , AI/ML

We need engineers skilled in multiple areas here to support both GPU Bare Metal and GPU VM products. We also need engineers skilled in AI/ML  DevOps and Linux Sysadmins.

GPU Bare Metal

Required Skills

  • Proven ability to orchestrate bare metal linux systems at scale including building automation for firmware updates, bios config management, configuring PXE environments.  
  • Deep Linux systems experience including troubleshooting network interfaces, developing and applying configuration management, security best practices and monitoring and alerting.  
  • Strong automation mindset.  Expert knowledge in 1 or more orchestration tools such as MaaS, Salt, Chef, Ansible or Puppet.  
  • Strong communication skills.  Your job will involve writing detailed documentation for others to pick up or leading knowledge sharing sessions with operations teams.  
  • Bonus skills include:
    • Hands-on experience in High Performance Computing (HPC) clustered environments from Nvidia or AMD.  Experience in performing automated wide scale testing on NCCL or other frameworks. 
    • Network engineering experience with VyOS platforms. 

What You'll Be Working On:

  • Provisioning and automating GPU Bare Metal deployments
  • DevOps - Assist customer support and CloudOps teams with GPU specific knowledge/debugging during customer escalations
  • Performance testing, analysis and monitoring
  • Firmware, BIOS, Kernel upgrades and testing

GPU Virtual Machines

Required Skills

  • Strong understanding of Linux based operating systems
  • Deep experience with the internals of QEMU, KVM, Linux kernel and libvirt. Strong proficiency in C.
  • Strong knowledge of DO’s proprietary services and how they intersect with our virtualization stack.

Job Tags

Similar Jobs

Nigel Frank

Salesforce Administrator Job at Nigel Frank

 ...Salesforce Administrator a0MP9000009UxLd.3_1762466727 Job Title: Salesforce Administrator Location: Palmetto Bay (3 days onsite) Job Summary: We are seeking a skilled Salesforce Administrator to join our team in Palmetto Bay. The ideal candidate will be responsible... 

Vensure Employer Solutions

Sports Event Analyst - Queens, NY Job at Vensure Employer Solutions

 ...Summary Want to live out your dream of getting paid to attend sporting events? Are you responsible, independent and a strong problem...  ...notice Monitor quality of your work and strive to improve your performance Meet once per weeks for a short meeting with the team. You... 

Care Partners

Events Marketing Coordinator Job at Care Partners

 ...Marketing Coordinator Benefits: ~ Compensation: $28 - $32/hour (Negotiable)~ Type: Full-Time, FLEXIBLE HYBRID SCHEDULE ~ Schedule...  ...values your voice.~ Work-Life Bliss: Team outings, company events, and a commitment to putting you first.~ An Office That Feels... 

jobgether

Customer Success Manager (Remote from US) Job at jobgether

 ...behalf of a partner company. We are currently looking for a Customer Success Manager in United States. This role is focused on building long-...  ...to $133,000 USD, depending on experience and location. ~ Remote or office-based work flexibility depending on location. ~... 

GZA GeoEnvironmental

Environmental Compliance Engineer Job at GZA GeoEnvironmental

 ...GeoEnvironmental, Inc. (GZA) is currently seeking an experienced Environmental Engineer / Scientist or Environmental Health and Safety (EHS...  ...years of relevant experience and knowledge of applicable EH&S laws and regulations ~ Professional Engineer (P.E.) desired ~...