Server Operations Engineer

Centific
$90,000 - $95,000

About The Position

About Centific Centific is a frontier AI data foundry that curates diverse, high-quality data, using our purpose-built technology platforms to empower the Magnificent Seven and our enterprise clients with safe, scalable AI deployment. Our team includes more than 150 PhDs and data scientists, along with more than 4,000 AI practitioners and engineers. We harness the power of an integrated solution ecosystem—comprising industry-leading partnerships and 1.8 million vertical domain experts in more than 230 markets—to create contextual, multilingual, pre-trained datasets; fine-tuned, industry-specific LLMs; and RAG pipelines supported by vector databases. Our zero-distance innovation™ solutions for GenAI can reduce GenAI costs by up to 80% and bring solutions to market 50% faster. Our mission is to bridge the gap between AI creators and industry leaders by bringing best practices in GenAI to unicorn innovators and enterprise customers. We aim to help these organizations unlock significant business value by deploying GenAI at scale, helping to ensure they stay at the forefront of technological advancement and maintain a competitive edge in their respective markets. About Job Responsibilities: Responsible for planning the server or rack, arranging the server mounting or rack installation, and following up the installation of the server operating system Responsible for the daily maintenance, troubleshooting, repair and follow-up break-fix of the server and other hardware Maintain data on internal systems including Asset Management, Ticketing, & rack elevations Work with remote vendors or junior SOE to solve hardware batch failures and problems Oncall on duty, responsible for dealing with the problems raised by the business owner side Collect and check online asset data Submit and track the part RMA or media destruction process Server network troubleshooting Server lifecycle management Other server operation related work

Requirements

  • Bachelor’s Degree in Computer science, Electrical engineering or any other relevant fields
  • Strong ability to work under pressure
  • Strong learning ability, broad technical interest
  • Strong sense of responsibility, full of enthusiasm for work
  • Good communication skills in English, Mandarin is preferred, good team work spirit
  • Ability to work independently
  • Knowledge of the interdependencies of data center functions and technologies
  • Experience with massive remote OS installation such as PXE boot
  • Can understand and run Shell or Python scripts
  • Familiar with simple automation tools, such as Ansible
  • Familiar with Linux system, able to locate hardware faults
  • Strong analytical and problem solving skills
  • Basic TCP/IP knowledge concepts – Subnetting, VLANs, DNS, Ipv6
  • Ability to perform general troubleshooting
  • Knowledge of out-of-band/lights-out server communication methods, such as IPMI and NCSI
  • Strong documentation skills and habits

Responsibilities

  • Responsible for planning the server or rack, arranging the server mounting or rack installation, and following up the installation of the server operating system
  • Responsible for the daily maintenance, troubleshooting, repair and follow-up break-fix of the server and other hardware
  • Maintain data on internal systems including Asset Management, Ticketing, & rack elevations
  • Work with remote vendors or junior SOE to solve hardware batch failures and problems
  • Oncall on duty, responsible for dealing with the problems raised by the business owner side
  • Collect and check online asset data
  • Submit and track the part RMA or media destruction process
  • Server network troubleshooting
  • Server lifecycle management
  • Other server operation related work
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service