Manager, Platform Engineering

Apex Fintech SolutionsAustin, TX
41dHybrid

About The Position

We are seeking a skilled and experienced Manager, Platforming Engineering to join our dynamic team. If you are passionate about distributed systems and have expertise in message queueing systems like Kafka, RabbitMQ, IBM MQ, or Google PubSub, we want to hear from you! You will help deploy, maintain, and support these technologies, enabling application teams to deliver their services efficiently and reliably. You’ll play a key role in driving service migration to managed offerings on Google Cloud Platform (GCP) and designing multi-region deployments to meet application requirements. This role requires not only strong technical and operational expertise but also excellent team leadership skills and the ability to navigate ambiguous requirements. A successful candidate will be deeply operationally focused, maintaining stability while balancing proactive planning with reactive incident resolution in a fast-paced environment.

Requirements

  • BA, BS, MS in Computer Science, Engineering or related technology field (or equivalent experience) required
  • 5+ years of prior work experience
  • Strong Linux administration experience, including troubleshooting, system tuning, and scripting.
  • Proven background managing operational teams in a technical or infrastructure-heavy environment.
  • Hands-on experience with configuration management tools like Salt or Ansible.
  • Experience implementing and supporting resilient, multi-regional infrastructure.
  • Knowledge of message queuing platforms like Kafka, RabbitMQ, IBM MQ, and Google Pub/Sub.
  • Robust knowledge of monitoring and observability tools to identify issues and prevent system downtime.
  • Ability to effectively balance proactive and reactive work.
  • Problem-solving and communication skills, with the ability to translate ambiguous requirements into actionable steps.
  • A passion for driving automation, streamlining operations, and modernizing legacy systems.

Responsibilities

  • Lead and mentor a team focused on delivering stable, secure, scalable, and highly available message queuing platforms and other operational services.
  • Work closely with stakeholders to align priorities, resolve roadblocks, and ensure your team continues to deliver high-value operational support.
  • Foster a culture of accountability, technical excellence, and continuous improvement among team members.
  • Oversee the and continuously improve the deployment, configuration, maintenance, and support of message queuing services like Kafka, RabbitMQ, IBM MQ, Google Pub/Sub, as well as MongoDB.
  • Ensure all systems are effectively monitored and that any incidents are triaged, resolved, and documented to continually improve operations.
  • Migrate from self-managed services to managed GCP offerings, ensuring minimal disruption and improved operational efficiency.
  • Define and implement effective multi-regional requirements and service deployment strategies.
  • Explore and evaluate emerging technologies to support operational modernization and scalability efforts.
  • Act as a technical liaison for application teams, helping understand the actual requirements that they may not fully understand themselves.
  • Deliver service and support to application teams balanced with operational improvements and project deliverables.
  • Improve system observability with robust monitoring, alerting, and incident management practices.
  • Conduct post-mortems to drive process improvements, reduce downtime, and improve responsiveness during incidents.
  • Proactively address technical debt and identify opportunities for automation to minimize manual effort.
  • Participate in Scaled Agile Framework quarterly planning sessions, ensuring your team’s priorities align with business goals.
  • Balance longer-term strategic projects with day-to-day operational needs, including reactive incident management.

Benefits

  • healthcare benefits (medical, dental and vision, EAP)
  • competitive PTO
  • 401k match
  • parental leave
  • HSA contribution match
  • paid subscription to the Calm app
  • generous external learning and tuition reimbursement benefits
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service