Senior Big Data Engineer

CitiTampa, FL
10d

About The Position

Citi, the leading global bank, has approximately 200 million customer accounts and does business in more than 160 countries and jurisdictions. Our core activities are safeguarding assets, lending money, making payments and accessing the capital markets on behalf of our clients. Citi’s Mission and Value Proposition explain what we do and Strategy explain how we do it. Our mission is to serve as a trusted partner to our clients by responsibly providing financial services that enable growth and economic progress. We strive to earn and maintain our clients’ and the public’s trust by constantly adhering to the highest ethical standards and making a positive impact on the communities we serve. We're seeking a talented Senior Big Data Engineer to join our data engineering team. You'll play a critical role in optimizing our existing data pipelines and building new high-performance solutions that power analytics and business intelligence across the department. This is an opportunity to work with large-scale distributed systems and make a measurable impact on data processing efficiency.

Requirements

  • 6+ years of hands-on experience with Hadoop ecosystem technologies
  • Experience in managing and implementing successful projects
  • Strong proficiency in Apache Spark with Scala development
  • Solid experience with Hive and Impala for large-scale data querying
  • Understanding of distributed computing principles and data partitioning strategies
  • Experience optimizing Spark jobs and SQL queries for performance
  • Proficiency with version control (Git) and CI/CD practices
  • Proficiency with streaming frameworks (Kafka)

Nice To Haves

  • Exposure to cloud platforms and technologies is preferred
  • Exposure to Databricks is preferred

Responsibilities

  • Performance Optimization Analyze and optimize existing Hadoop/Spark pipelines to improve processing speed, resource utilization, and reliability
  • Identify bottlenecks in data workflows and implement solutions that reduce processing time and costs
  • Tune Spark jobs, Hive queries, and Impala performance through partitioning strategies, caching, and execution plan optimization
  • Pipeline Development Design and build scalable data pipelines using Spark (Scala) to process terabytes of data efficiently
  • Develop robust ETL/ELT workflows that integrate data from multiple sources into Hadoop environment and Oracle data warehouses
  • Implement data quality checks and monitoring to ensure pipeline reliability
  • Technical Collaboration Work closely with product teams to understand requirements and deliver data solutions
  • Participate in code reviews and contribute to engineering best practices
  • Document pipeline architecture, data flows, and operational procedures

Benefits

  • In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards.
  • Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs.
  • Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays.
  • For additional information regarding Citi employee benefits, please visit citibenefits.com.
  • Available offerings may vary by jurisdiction, job level, and date of hire.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service