Tiktok-posted 3 months ago
Hybrid • San Jose, CA
5,001-10,000 employees
Broadcasting and Content Providers

The Cyber Defense & Engineering team is missioned to run and operate security infrastructures, platforms and technologies, as well as to support cross-functional teams to protect our users, products and infrastructures. This team is responsible for enhancing security tools and identifying vulnerabilities, with a specific focus on content assurance and the application of large language models (LLMs). You'll collaborate cross-functionally with partners inside and outside TikTok to fortify our products and users' security, helping to establish TikTok as the most trusted platform. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.

  • Lead and perform hands-on technical work, including architecture design and code development for an on-premise, highly scalable, and parallelized infrastructure.
  • Architect, implement, and manage a high-performance compute cluster for LLM workloads, including the selection and configuration of specialized hardware like GPUs.
  • Oversee the end-to-end project lifecycle, from planning and requirements gathering to execution and delivery, ensuring alignment with business goals for deploying LLM-powered applications.
  • Develop and maintain automation scripts and configuration management to automate the deployment and management of the on-premise hardware and software stack.
  • Implement security best practices for a private data center environment, including configuring network firewalls and managing access controls.
  • Establish comprehensive monitoring and alerting systems to track the health and performance of the compute cluster and LLM workloads.
  • Collaborate with internal stakeholders to optimize resource utilization and improve the platform's efficiency.
  • Strong background in systems engineering, distributed infrastructure, and backend development.
  • Experience with technologies such as Apache Kafka, Apache Flink, Elasticsearch, PostgreSQL, Redis, or Kubernetes.
  • Ability to solve complex technical problems and write high-quality code.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service