About The Position

PDF Solutions is redefining the way the semiconductor industry approaches data, analytics, and experience design, building a modern, human-centered analytics platform. As a Data Engineer at PDF, you will be part of a global team dedicated to leveraging innovative approaches and public cloud infrastructure to refine and enhance the design and architecture of Big Data Analytics software. This role involves architecting robust NoSQL Apache Cassandra databases, maintaining their high availability, performance, and scalability, and supporting development teams with troubleshooting. You will also contribute to designing, implementing, and maintaining scalable data processing components and automating infrastructure using modern DevOps tools like Terraform and Ansible. This position is ideal for engineers passionate about working with cutting-edge technology to drive impactful results.

Requirements

  • Proven experience as a Cassandra Administrator or in a similar database administration role
  • 3+ years of hands-on experience administering Apache Cassandra in production environments
  • Experience with tools like nodetool, Medusa, Reaper, and other Cassandra management utilities
  • Hands-on experience with monitoring and observability tools such as Prometheus, Grafana, or similar solutions
  • Implement robust security practices, including user authentication, authorization, and encryption for data in transit and at rest
  • Experience with automation and managing infrastructure as code: terraform / Ansible / chef / puppet
  • Strong communication skills and the ability to work across engineering teams
  • Strong troubleshooting skills
  • Demonstrated ability to generate and maintain technical documentation
  • Prior exposure to some programming languages like Python, Perl, Java
  • Solid knowledge of Shell Scripting and command line management
  • Familiarity with cloud platforms (AWS, Azure, GCP) and deploying Cassandra in cloud environments
  • Strong understanding of Cassandra architecture (gossip, hinted handoff, replication, partitioning)
  • Proficiency in scripting (e.g., Bash, Python, or Perl)

Nice To Haves

  • Experienced with data processing tools
  • Experience with Kubernetes and running Cassandra in containerized environments
  • Apache Cassandra certificate

Responsibilities

  • Participate in the continuous efforts to improve the design and architecture of our application
  • Install, configure, upgrade, and maintain Apache Cassandra clusters (on-premises and/or cloud-based)
  • Monitor database health and performance using tools such as nodetool, JMX, Prometheus, Grafana, Medusa
  • Perform regular backup and restore operations using native and third-party tools
  • Manage compaction strategies (Size-Tiered, Leveled, Unified) and optimize read/write paths
  • Proactively identify performance bottlenecks and apply tuning strategies
  • Handle schema changes, keyspace/table design, and data modeling best practices
  • Troubleshoot and resolve database incidents and support high availability and disaster recovery strategies
  • Automate routine operations using shell scripts, Python, or Ansible
  • Collaborate with developers, SREs, and DevOps teams to support CI/CD integration and deployment pipelines
  • Proactively ensure the highest levels of systems and infrastructure availability
  • Work closely with the Application and Database teams to resolve issues and improve customer experience
  • Build and sustain high-performance databases on on-premises and Cloud infrastructure
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service