Salesforce-posted about 1 year ago
$204,400 - $296,400/Yr
Full-time • Senior
San Francisco, CA
Publishing Industries

Salesforce is seeking experienced engineers to join their team, focusing on solving complex engineering problems that support their products. The role involves complete service ownership, ensuring high availability and performance of mission-critical services, and leveraging AI for operational efficiency. Engineers will work on large-scale distributed systems, collaborating with various teams to enhance service reliability and performance.

  • Complete service ownership, from influencing product architecture to operating services seamlessly in production.
  • Analyze and remediate production incidents for the Core Application Server and asynchronous processing platform.
  • Develop deeper insights into platform incidents and influence engineering backlog to address repeat incidents and prevent incidents proactively.
  • Leverage AIOps platform to continuously improve anomaly detection, automate runbooks, and drive MTTD & MTTR goals.
  • Understand customer use cases leveraging our platform and services and collaborate with the engineering organization to identify opportunities to achieve availability goals.
  • Engage with engineers developing features on our platform and provide consultative support and onboarding guidance.
  • Collaborate with Systems engineering team for activities such as providing inputs for OS patching, JDK upgrade, and software configuration.
  • Collaborate with technical writers to create, update, and review documentation for users and operators.
  • Participate in the team's 24x7 on-call rotation to address complex problems in real-time and keep services operational and highly available.
  • Continuously raise standards of engineering perfection by implementing best DevOps practices.
  • Lead, collaborate, communicate, and mentor.
  • A related technical degree required.
  • 10 years of work experience and 5+ years in a production engineering/DevOps/SRE or similar role working on high scale distributed systems.
  • Knowledge of OO programming and concepts and experience coding in Java, C++ or Python.
  • Ability to debug complex distributed systems to understand system design with an eye for performance and scalability bottlenecks and provide recommendations to optimize code.
  • In-depth, hands-on experience with Linux, networking, server, and cloud architectures.
  • Exposure to container related technologies such as Kubernetes, Docker, etc.
  • Proficiency with source control, continuous integration, and testing pipelines.
  • Strong background in open source software is preferred.
  • Experience analyzing heap dumps.
  • Experience instrumenting code and profiling applications.
  • Experience evaluating and interpreting large volumes of production data to know efficiency, latency, memory and CPU utilization.
  • Experience with messaging platforms.
  • Experience with AWS or another cloud PaaS provider.
  • Experience in configuration management technologies such as Chef, Puppet or Ansible.
  • Strong problem-solving, troubleshooting and analytical skills clearly demonstrated in past projects.
  • Solid understanding of configuration, deployment, management and maintenance of large cloud-hosted systems; including auto-scaling, monitoring, performance tuning, troubleshooting and disaster recovery.
  • Understanding of Java Virtual Machine technology and ability to tune and debug issues related to compilers, Garbage collectors.
  • Wellbeing reimbursement
  • Generous parental leave
  • Adoption assistance
  • Fertility benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service