About The Position

Design and develop complex DataStage jobs (Parallel and Sequence) to extract data from various sources (Flat Files, DB2, SQL Server, APIs) and load them into Netezza. Optimize DataStage jobs using partitioning, sorting, and buffer tuning. Perform SQL tuning specifically for Netezza (e.g., managing Distribution Keys, Clustered Base Tables, and minimizing data skew). Utilize the Netezza Connector and Netezza Performance Server features within DataStage to implement high-speed data loading (nzload) and external table frameworks. Write advanced Netezza SQL, including stored procedures, complex joins, and analytical functions to handle data transformation directly within the warehouse (ELT approach). Develop Unix/Linux shell scripts to automate job scheduling, file management, and error handling. Implement data validation and cleansing rules to ensure the integrity and accuracy of the data warehouse. Provide production support, troubleshoot job failures, and perform root cause analysis for performance bottlenecks.

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 3–5 years of experience in ETL Development, with at least 2 years specifically focused on the DataStage-Netezza integration.
  • Deep understanding of Netezza’s MPP architecture, S-Blades, and FPGA processing.
  • Proven ability to debug complex data issues and optimize long-running ETL workflows.
  • IBM InfoSphere DataStage (v11.x preferred), Designer, Director, Administrator.
  • Advanced SQL (Netezza-specific SQL), Unix/Linux Shell Scripting.
  • Knowledge of Star/Snowflake schemas, Dim & Fact tables, and SCD (Type 1/2).
  • Control-M or AutoSys (Scheduling), Git/SVN (Version Control), Aginity (Netezza Workbench).

Responsibilities

  • Design and develop complex DataStage jobs to extract data from various sources and load them into Netezza.
  • Optimize DataStage jobs using partitioning, sorting, and buffer tuning.
  • Perform SQL tuning specifically for Netezza.
  • Utilize the Netezza Connector and Netezza Performance Server features within DataStage to implement high-speed data loading and external table frameworks.
  • Write advanced Netezza SQL to handle data transformation directly within the warehouse.
  • Develop Unix/Linux shell scripts to automate job scheduling, file management, and error handling.
  • Implement data validation and cleansing rules to ensure the integrity and accuracy of the data warehouse.
  • Provide production support, troubleshoot job failures, and perform root cause analysis for performance bottlenecks.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service