Data Solutions Engineer

CitiIrving, TX
1d

About The Position

This Data Solutions Engineer (Applications Development Senior Programmer Analyst - C12) is responsible for building next-generation Data Engineering solutions. This intermediate-level position involves active participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team. A key aspect of this role is liaising between business users and technologists to facilitate the exchange of information regarding solutions, including requirements and usage. Responsibilities: Serve as an integral team member of our Data Engineering team, responsible for the design and development of Big Data solutions. Partner with domain experts, product managers, analysts, and data scientists to develop robust Big Data pipelines in Hadoop or Snowflake environments. Responsible for delivering a data-as-a-service framework. Responsible for moving all legacy workloads to cloud platform. Lead the migration of all legacy workloads to cloud platforms. Engage with key stakeholders to elicit and document requirements, including detailed data flow specifications. Assess appropriate solutions and collaborate with relevant teams to drive optimal implementations. Work with data scientists to build client pipelines using heterogeneous sources and provide essential engineering services for data science applications. Research and evaluate open-source technologies and components, recommending and integrating them into design and implementation efforts. Act as a technical expert, mentoring other team members on Big Data and Cloud technology stacks. Define comprehensive requirements for maintainability, testability, performance, security, quality, and usability across the data platform. Drive the implementation of consistent patterns, reusable components, and coding standards for all data engineering processes. Convert SAS-based pipelines into modern languages like PySpark and Scala for execution on Hadoop and non-Hadoop ecosystems. Optimize Big Data applications on both Hadoop and non-Hadoop platforms for peak performance. Evaluate new IT developments and evolving business requirements, recommending appropriate system alternatives and/or enhancements to current systems through analysis of business processes, systems, and industry standards. Appropriately assess risk when making business decisions, demonstrating consideration for the firm's reputation and safeguarding Citigroup, its clients, and assets. This includes driving compliance with applicable laws, rules, and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct, and business practices, and escalating, managing, and reporting control issues with transparency.

Requirements

  • 5+ years of experience with Hadoop and Big Data technologies
  • Demonstrated proficiency in Python, PySpark, and Scala, including practical experience with fundamental machine learning libraries
  • Experience in developing robust data solutions leveraging Google Cloud or AWS platforms; relevant certifications are preferred
  • Experience with SAS
  • Experience with containerization and related technologies (e.g., Docker, Kubernetes)
  • Comprehensive understanding of software engineering and data analytics
  • In-depth knowledge and hands-on experience with the Hadoop ecosystem and Big Data technologies (e.g., HDFS, MapReduce, Hive, Pig, Impala, Kafka, Kudu, Solr)
  • Knowledge of Agile (Scrum) development methodologies.
  • Strong development and automation skills.
  • System-level understanding of data structures, algorithms, distributed storage, and compute.
  • A proactive approach to solving complex business problems, complemented by strong interpersonal and teamwork skills.

Nice To Haves

  • Familiarity with Hadoop administration and Snowflake.
  • Proficiency in Java or additional experience with Apache Beam.

Responsibilities

  • Serve as an integral team member of our Data Engineering team, responsible for the design and development of Big Data solutions.
  • Partner with domain experts, product managers, analysts, and data scientists to develop robust Big Data pipelines in Hadoop or Snowflake environments.
  • Responsible for delivering a data-as-a-service framework.
  • Responsible for moving all legacy workloads to cloud platform.
  • Lead the migration of all legacy workloads to cloud platforms.
  • Engage with key stakeholders to elicit and document requirements, including detailed data flow specifications.
  • Assess appropriate solutions and collaborate with relevant teams to drive optimal implementations.
  • Work with data scientists to build client pipelines using heterogeneous sources and provide essential engineering services for data science applications.
  • Research and evaluate open-source technologies and components, recommending and integrating them into design and implementation efforts.
  • Act as a technical expert, mentoring other team members on Big Data and Cloud technology stacks.
  • Define comprehensive requirements for maintainability, testability, performance, security, quality, and usability across the data platform.
  • Drive the implementation of consistent patterns, reusable components, and coding standards for all data engineering processes.
  • Convert SAS-based pipelines into modern languages like PySpark and Scala for execution on Hadoop and non-Hadoop ecosystems.
  • Optimize Big Data applications on both Hadoop and non-Hadoop platforms for peak performance.
  • Evaluate new IT developments and evolving business requirements, recommending appropriate system alternatives and/or enhancements to current systems through analysis of business processes, systems, and industry standards.
  • Appropriately assess risk when making business decisions, demonstrating consideration for the firm's reputation and safeguarding Citigroup, its clients, and assets. This includes driving compliance with applicable laws, rules, and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct, and business practices, and escalating, managing, and reporting control issues with transparency.

Benefits

  • In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards.
  • Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs.
  • Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays.
  • For additional information regarding Citi employee benefits, please visit citibenefits.com.
  • Available offerings may vary by jurisdiction, job level, and date of hire.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service