Hadoop Administrator

CapgeminiColumbia, SC
6d$56,186 - $106,050Hybrid

About The Position

We are looking for an experienced Hadoop Administrator (MapR) to manage and support our production-grade MapR Hadoop clusters. The ideal candidate will have hands-on experience running mission‑critical, multi-tenant Hadoop environments, ensuring high availability, performance, and security across diverse workloads.

Requirements

  • 4+ years of hands-on experience as a Hadoop/MapR Administrator.
  • Strong knowledge of MapR-FS, MapR-DB, MapR Streams, YARN, Hive, Spark, and Kafka.
  • Experience managing production clusters with high availability requirements.
  • Proficiency in Linux administration (RHEL/CentOS/Ubuntu).
  • Expertise with multi-tenant Hadoop environments, resource governance, and user management.
  • Strong skills in shell scripting, automation, and performance tuning.
  • Experience with monitoring tools (Grafana, Nagios, Prometheus, MapR MCS).
  • Understanding of Hadoop security (Kerberos, SSL, ACLs, AD integration)

Nice To Haves

  • Experience migrating from MapR to other Hadoop distributions (Cloudera, Hortonworks) - a plus.
  • Knowledge of Cloud (AWS/Azure/GCP) big data services.
  • Experience with CI/CD pipelines, DevOps tools, or containerized big data workloads.

Responsibilities

  • Manage, monitor, and optimize MapR-based Hadoop clusters in a production environment.
  • Oversee daily operations, including cluster health checks, performance tuning, capacity planning, and system upgrades.
  • Perform installation, configuration, patching, and version upgrades for MapR distributions and ecosystem tools.
  • Configure, support, and optimize multi-tenant setups, ensuring isolation, resource fairness, and performance SLAs.
  • Manage YARN queues, ACLs, quotas, and security policies per tenant requirements.
  • Collaborate with data engineering and analytics teams to onboard new tenants and align resource needs.
  • Diagnose and resolve cluster performance issues, bottlenecks, and failures across MapR FS, YARN, Spark, Hive, Kafka, and related services.
  • Manage node failures, job failures, and cluster rebalancing while maintaining uptime.
  • Automate alerting, monitoring, and reporting using tools like MapR Control System (MCS), Grafana, Nagios, Prometheus, or similar.
  • Implement and manage security features such as Kerberos, Ranger/Sentry policies, SSL encryption, and user/role management.
  • Ensure compliance with internal and external standards for data governance and security.
  • Maintain reliable backup and DR strategies using MapR snapshots, mirrors, and failover configurations.
  • Ensure 24/7 availability of production environments and participate in on-call rotation.
  • Develop automation for operational tasks using Python, Shell scripting, or Ansible.
  • Streamline cluster provisioning, monitoring, and deployment pipelines.

Benefits

  • Flexible work
  • Healthcare including dental, vision, mental health, and well-being programs
  • Financial well-being programs such as 401(k) and Employee Share Ownership Plan
  • Paid time off and paid holidays
  • Paid parental leave
  • Family building benefits like adoption assistance, surrogacy, and cryopreservation
  • Social well-being benefits like subsidized back-up child/elder care and tutoring
  • Mentoring, coaching and learning programs
  • Employee Resource Groups
  • Disaster Relief

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service