Salesforce-posted 2 days ago
Full-time • Mid Level
San Francisco, CA
5,001-10,000 employees

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all. Ready to level-up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce. We are seeking a highly seasoned and expert Software Engineering Architect to lead the design and scaling of one of the world's largest Kubernetes deployments. This critical role involves architecting a robust, secure, and highly reliable container platform that powers thousands of microservices and other services across diverse environments. The ideal candidate possesses a profound technical understanding of distributed systems, container orchestration, and infrastructure development, coupled with a passion for designing platforms that are easy for other software engineers to build, test, and operate on. You will work on real-world, massive scale problems, collaborate with top-tier engineers, and directly influence the strategic direction of our core container platform across multiple substrates.

  • Platform Strategy & Design: Lead the architectural design and evolution of our large-scale, enterprise-grade Kubernetes platform to ensure it meets requirements for scalability, reliability, security, and performance.
  • Software Development Lifecycle (SDLC) Integration: Define and implement platform tooling and APIs to optimize the SDLC for thousands of microservices, with a focus on application development and deployment pipelines.
  • Scale and Performance: Architect solutions to handle massive, ever-increasing service and infrastructure scale, ensuring high availability and low latency across the deployment, paying close attention to performance tuning.
  • Technical Leadership: Act as a subject matter expert and technical leader, guiding platform implementation teams and ensuring alignment with best practices in platform and software engineering.
  • Microservices Architecture: Define and evangelize resilient software design patterns and best practices for building, deploying, and managing thousands of microservices on the container platform.
  • Cross-Functional Partnership: Partner closely with infrastructure, security, and application development teams to integrate platform components seamlessly and define clear interfaces for engineering efficiency.
  • System Reliability: Design systems that are inherently resilient, self-healing, and easy to monitor and troubleshoot, driving down operational complexity for our application engineers.
  • Experience: 15+ years of progressive experience in hands-on software engineering and/or platform engineering, with a significant focus on building and scaling complex, high-volume distributed systems.
  • Deep Kubernetes Expertise: Expert-level understanding of Kubernetes internals, architecture, networking, security, and operation at extreme scale. Proven experience in designing and scaling Kubernetes deployments supporting thousands of services
  • Programming Skills: Deep proficiency in Golang (Go) for developing and extending infrastructure systems, APIs, and platform tooling (required for infrastructure development).
  • Infrastructure Systems: Extensive background in infrastructure development, including cloud environments, networking, storage, and infrastructure-as-code principles.
  • Microservices: Expert knowledge of microservices architecture, service mesh technologies, API design principles, and inter-service communication patterns.
  • Security & Reliability: A strong track record of designing platforms that prioritize security, observability (logging, metrics, tracing), and operational reliability for both the platform and the applications it hosts.
  • time off programs
  • medical, dental, vision, mental health support
  • paid parental leave
  • life and disability insurance
  • 401(k)
  • employee stock purchasing program
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service