Software Engineer, Data Infrastructure (Hybrid)

CrowdStrikeUniondale, NY
1dHybrid

About The Position

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About the Role: CrowdStrike is looking to hire a Senior Systems Engineer to join the Data Engineering team as a Senior Distributed Systems Engineer. In this team, we are on a mission to create a hyper scale data lake, which helps finding bad actors and stopping breaches. The team builds and operates systems to centralize all of the data the falcon platform collects, making it easy for internal and external customers to transform and access the data for analytics, machine learning, and threat hunting. As a Software Engineer in this team you will be responsible for building our Spark ecosystem in our DataCenter that includes but not limited to Kubernetes, Spark, MinIO, HDFS etc. We are looking for candidates that have setup big data scale clusters in the DataCenter or Cloud (PB-scale would be a plus) in the DataCenter/AWS and are passionate about solving problems at high scale. This role involves leading efforts to build State-of-the-art Spark k8 platform. This role can be based out of Sunnyvale/Silicon Valley Metro, Austin Metro or New York City Metro and will likely include 1-3 days in-office per week on average (NYC office is in development, so workers her will remain fully remote for likely 18-24 months)

Requirements

  • Strong in one or both of field i.e., Spark ecosystem or Kubernetes ecosystem
  • Strong analytical skills and with deep understanding of Distributed Systems
  • Strong programming skills in languages as Go, Java or Python
  • Understanding of Apache Spark ecosystem technologies (Deltalake/Iceberg, YARN, HDFS, Kafka, Zookeeper, Hive, Jupyter notebooks)
  • Experience with large-scale business critical platforms with Spark on Kubernetes on data center or cloud.
  • Experience with continuous deployment on K8 with Helm, fluxCD etc
  • Solid understanding of Spark(data) and K8 storage systems (Object stores S3/Minio, Spark ephemeral storage, Persistent volumes mapping and claims)
  • Understanding or spark memory management and experience with spark internals.
  • Familiarity with Chef is preferred.
  • Proven ability to work with both local and remote teams
  • Strong communication skills both verbal and written
  • This role will require the candidate to periodically undergo and pass additional background and fingerprint check(s) consistent with government customer requirements.

Responsibilities

  • Design and implement extremely scalable large scale distributed Spark cluster on Kubernetes platform optimized for Data center environment.
  • Perform and oversee a variety of functions to ensure that our Spark infrastructure on Kubernetes is available, reliable, stable and secure
  • Applies judgment in analyzing and selecting technologies, installation and maintenance of software and hardware systems that allow multiple Engineering and DataScience teams to interact with this system
  • Design, writes and own APIs, automate and generate reports to visualize cluster health.
  • Has responsibility for building and maintaining the Spark infrastructure such that software engineers, data analysts and data scientists can run jobs to gather data and insights.
  • Work on Query systems, like Apache Presto or Trino, Jupyter Notebooks

Benefits

  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service