Software Engineer, Data Infrastructure (Hybrid)

CrowdStrike•Uniondale, NY

17d•Hybrid

About The Position

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you. About the Role: CrowdStrike is looking to hire a Senior Systems Engineer to join the Data Engineering team as a Senior Distributed Systems Engineer. In this team, we are on a mission to create a hyper scale data lake, which helps finding bad actors and stopping breaches. The team builds and operates systems to centralize all of the data the falcon platform collects, making it easy for internal and external customers to transform and access the data for analytics, machine learning, and threat hunting. As a Software Engineer in this team you will be responsible for building our Spark ecosystem in our DataCenter that includes but not limited to Kubernetes, Spark, MinIO, HDFS etc. We are looking for candidates that have setup big data scale clusters in the DataCenter or Cloud (PB-scale would be a plus) in the DataCenter/AWS and are passionate about solving problems at high scale. This role involves leading efforts to build State-of-the-art Spark k8 platform. This role can be based out of Sunnyvale/Silicon Valley Metro, Austin Metro or New York City Metro and will likely include 1-3 days in-office per week on average (NYC office is in development, so workers her will remain fully remote for likely 18-24 months)

Requirements

Strong in one or both of field i.e., Spark ecosystem or Kubernetes ecosystem
Strong analytical skills and with deep understanding of Distributed Systems
Strong programming skills in languages as Go, Java or Python
Understanding of Apache Spark ecosystem technologies (Deltalake/Iceberg, YARN, HDFS, Kafka, Zookeeper, Hive, Jupyter notebooks)
Experience with large-scale business critical platforms with Spark on Kubernetes on data center or cloud.
Experience with continuous deployment on K8 with Helm, fluxCD etc
Solid understanding of Spark(data) and K8 storage systems (Object stores S3/Minio, Spark ephemeral storage, Persistent volumes mapping and claims)
Understanding or spark memory management and experience with spark internals.
Familiarity with Chef is preferred.
Proven ability to work with both local and remote teams
Strong communication skills both verbal and written
This role will require the candidate to periodically undergo and pass additional background and fingerprint check(s) consistent with government customer requirements.

Responsibilities

Design and implement extremely scalable large scale distributed Spark cluster on Kubernetes platform optimized for Data center environment.
Perform and oversee a variety of functions to ensure that our Spark infrastructure on Kubernetes is available, reliable, stable and secure
Applies judgment in analyzing and selecting technologies, installation and maintenance of software and hardware systems that allow multiple Engineering and DataScience teams to interact with this system
Design, writes and own APIs, automate and generate reports to visualize cluster health.
Has responsibility for building and maintaining the Spark infrastructure such that software engineers, data analysts and data scientists can run jobs to gather data and insights.
Work on Query systems, like Apache Presto or Trino, Jupyter Notebooks

Benefits

Market leader in compensation and equity awards
Comprehensive physical and mental wellness programs
Competitive vacation and holidays for recharge
Paid parental and adoption leaves
Professional development opportunities for all employees regardless of level or role
Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
Vibrant office culture with world class amenities
Great Place to Work Certified™ across the globe

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume