Data Engineer III

Walmart•Bentonville, AR

3d•$90,000 - $180,000

About The Position

The Data Engineer III is responsible for designing, building, and optimizing scalable big data pipelines, architectures, and datasets that enable advanced analytics and data-driven decision-making. This role involves developing efficient data transformation and processing frameworks, managing data structures, metadata, dependencies, and workloads, and ensuring the reliability and performance of the data ecosystem. The engineer will also work extensively with unstructured datasets, applying analytical techniques to extract insights and improve data accessibility across the organization. What you'll do... Data Modeling: Designing and implementing data models to support structured and unstructured datasets, ensuring data integrity and efficiency. Data Extraction: Developing and optimizing data extraction processes from various sources including databases, APIs, and logs. Data Cleaning: Preprocessing and cleaning data to remove inconsistencies and improve data quality. Data Screening: Implementing data validation and quality checks to ensure accuracy and completeness of data. Data Exploration: Conducting exploratory data analysis to understand patterns, trends, and correlations in the data. Data Visualization: Creating visualizations using tools like Tableau, PowerBI, or Looker to communicate insights and findings effectively. Big Data Technologies: Utilizing tools and frameworks such as Spark, Spark SQL, PySpark, HDFS, and MapReduce for processing large datasets efficiently. Cloud Services: Leveraging cloud platforms like GCP, Azure/AWS, Databricks, Azure HD Insights, ADF for data storage, processing, and analytics. Data Querying: Writing advanced SQL queries to extract and manipulate data from relational databases and other data stores. Data Pipeline Development: Building and optimizing scalable data pipelines and architectures to move and transform data across systems. Data Transformation: Developing processes for data transformation, structure, metadata, dependency, and workload management. Enterprise Software Development: Contributing to the development of enterprise-level software products related to data engineering and analytics. What you'll bring: Cross-functional Collaboration: Working closely with cross-functional teams including data scientists, analysts, and software engineers to achieve common goals. Programming Languages: Proficiency in at least one scripting language like Python or Scala for automation, data manipulation, and tool development. Agile Environment: Collaborating effectively in an Agile environment, participating in sprints, and adapting to changing Analytical Skills: Applying strong analytical skills to work with complex and unstructured datasets, extracting valuable insights and actionable information. project requirements. Big Data Data Stores: Implementing and managing highly scalable big data stores to efficiently store and access large volumes of data. Data Value Extraction: Manipulating, processing, and extracting value from large, diverse datasets to drive business decisions and innovation. Big Data Technologies: Experience utilizing tools and frameworks such as Spark, Spark SQL, PySpark, HDFS, and MapReduce for processing large datasets efficiently. Cloud Services: Experience leveraging cloud platforms like GCP, Azure/AWS, Databricks, Azure HD Insights, ADF for data storage, processing, and analytics.

Requirements

Option 1: Bachelor’s degree in Computer Science and 2 years' experience in software engineering or related field.
Option 2: 4 years’ experience in software engineering or related field.
Option 3: Master's degree in Computer Science.

Nice To Haves

Data engineering, database engineering, business intelligence, or business analytics
Master’s degree in Computer Science or related field and 2 years' experience in software engineering or related field
We value candidates with a background in creating inclusive digital experiences, demonstrating knowledge in implementing Web Content Accessibility Guidelines (WCAG) 2.2 AA standards, assistive technologies, and integrating digital accessibility seamlessly.
The ideal candidate would have knowledge of accessibility best practices and join us as we continue to create accessible products and services following Walmart’s accessibility standards and guidelines for supporting an inclusive culture.

Responsibilities

Designing and implementing data models to support structured and unstructured datasets, ensuring data integrity and efficiency.
Developing and optimizing data extraction processes from various sources including databases, APIs, and logs.
Preprocessing and cleaning data to remove inconsistencies and improve data quality.
Implementing data validation and quality checks to ensure accuracy and completeness of data.
Conducting exploratory data analysis to understand patterns, trends, and correlations in the data.
Creating visualizations using tools like Tableau, PowerBI, or Looker to communicate insights and findings effectively.
Utilizing tools and frameworks such as Spark, Spark SQL, PySpark, HDFS, and MapReduce for processing large datasets efficiently.
Leveraging cloud platforms like GCP, Azure/AWS, Databricks, Azure HD Insights, ADF for data storage, processing, and analytics.
Writing advanced SQL queries to extract and manipulate data from relational databases and other data stores.
Building and optimizing scalable data pipelines and architectures to move and transform data across systems.
Developing processes for data transformation, structure, metadata, dependency, and workload management.
Contributing to the development of enterprise-level software products related to data engineering and analytics.
Working closely with cross-functional teams including data scientists, analysts, and software engineers to achieve common goals.
Proficiency in at least one scripting language like Python or Scala for automation, data manipulation, and tool development.
Collaborating effectively in an Agile environment, participating in sprints, and adapting to changing Analytical Skills: Applying strong analytical skills to work with complex and unstructured datasets, extracting valuable insights and actionable information. project requirements.
Implementing and managing highly scalable big data stores to efficiently store and access large volumes of data.
Manipulating, processing, and extracting value from large, diverse datasets to drive business decisions and innovation.

Benefits

Beyond our great compensation package, you can receive incentive awards for your performance.
Other great perks include 401(k) match, stock purchase plan, paid maternity and parental leave, PTO, multiple health plans, and much more.
Health benefits include medical, vision and dental coverage
Financial benefits include 401(k), stock purchase and company-paid life insurance
Paid time off benefits include PTO, parental leave, family care leave, bereavement, jury duty, and voting.
You will also receive PTO and/or PPTO that can be used for vacation, sick leave, holidays, or other purposes.
The amount you receive depends on your job classification and length of employment.
It will meet or exceed the requirements of paid sick leave laws, where applicable.
For information about PTO, see https://one.walmart.com/notices.
Other benefits include short-term and long-term disability, company discounts, Military Leave Pay, adoption and surrogacy expense reimbursement, and more.
Live Better U is a company paid education benefit program for full-time and part-time associates in Walmart and Sam's Club facilities.
Programs range from high school completion to bachelor's degrees, including English Language Learning and short-form certificates.
Tuition, books, and fees are completely paid for by Walmart.
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment.
Benefits are subject to change and may be subject to a specific plan or program terms.
For information about benefits and eligibility, see One.Walmart.