About The Position

Citi, a leading global bank with approximately 200 million customer accounts and operations in over 160 countries, is seeking a Big Data Engineer (Assistant Vice President) to join its Enterprise Operations & Technology teams. These teams are responsible for developing and maintaining the technology solutions that underpin all of Citi's operations, from ensuring security and managing global resources to providing employees with the necessary tools and designing digital architecture for a first-class customer experience. The role involves reimagining client and partner experiences to deliver excellence through secure, reliable, and efficient services. Citi is committed to diversity and fosters an environment where the best people want to work, valuing respect, merit-based promotion, and opportunities for personal development. Ideal candidates are innovators with well-rounded backgrounds who bring their authentic selves to work and complement the company's results-driven culture.

Requirements

  • 5-8 years of relevant experience
  • In-depth understanding of HDFS architecture, data storage, and fault tolerance mechanisms.
  • Experience with HDFS commands and administration.
  • Solid understanding of YARN resource management and job scheduling.
  • Fundamental understanding of MapReduce programming paradigm, even if primary development is in Spark/Flink.
  • Knowledge of Zookeeper for distributed coordination services.
  • Strong proficiency in Spark Core, Spark SQL, Spark Streaming, and Spark GraphX (beneficial).
  • Expert-level programming skills in Scala, specifically for developing Spark applications.
  • Experience with Spark performance optimization techniques (e.g., caching, partitioning, shuffle optimizations, memory management).
  • Familiarity with deploying Spark applications on YARN, Mesos, or Kubernetes.
  • Advanced proficiency in writing complex HiveQL queries for data analysis and ETL processes.
  • Understanding of Hive metastore, execution engines (MapReduce, Tez, Spark), and storage formats (ORC, Parquet, Avro).
  • Experience in optimizing Hive queries and table designs for performance.
  • Strong object-oriented and functional programming skills.
  • Experience with Scala build tools (SBT, Maven).
  • Knowledge of common Scala libraries and frameworks.
  • Experience with PySpark for data processing.
  • Familiarity with data manipulation libraries (Pandas, NumPy).
  • Scripting for automation and data orchestration.
  • Complex query writing, subqueries, window functions, and performance tuning.
  • HBase (for real-time access to large datasets within Hadoop).
  • Cassandra, MongoDB, or similar.
  • Familiarity with RDBMS concepts and SQL for data integration.
  • Understanding of dimensional modeling, fact and dimension tables, star/snowflake schemas.
  • Data Ingestion Tools: Apache Sqoop, Apache Flume, Kafka
  • Workflow Orchestration: Apache Oozie, Apache Airflow
  • Experience with AWS (EMR, S3, Glue, Lambda), Azure (HDInsight, Data Lake, Databricks), or Google Cloud Platform (Dataproc, BigQuery).
  • Version Control: Git (GitHub, GitLab, Bitbucket).
  • CI/CD: Experience with Jenkins, GitLab CI, Azure DevOps, or similar tools.
  • Monitoring and Logging: ELK Stack (Elasticsearch, Logstash, Kibana), Grafana, Prometheus.
  • Agile Development: Familiarity with Agile/Scrum methodologies.
  • Shell Scripting: For automation and system administration tasks.
  • Bachelor’s degree/University degree or equivalent experience

Responsibilities

  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas.
  • Monitor and control all phases of the development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users.
  • Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgment.
  • Recommend and develop security measures in post-implementation analysis of business usage to ensure successful system design and functionality.
  • Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems.
  • Ensure essential procedures are followed and help define operating standards and processes.
  • Serve as advisor or coach to new or lower-level analysts.
  • Operate with a limited level of direct supervision, exercising independence of judgment and autonomy.
  • Act as SME to senior stakeholders and/or other team members.
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.

Benefits

  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service