Principal Site Reliability Engineer

CotalityIrving, TX
2dHybrid

About The Position

At Cotality, we are driven by a single mission—to make the property industry faster, smarter, and more people-centric. Cotality is the trusted source for property intelligence, with unmatched precision, depth, breadth, and insights across the entire ecosystem. Our talented team of 5,000 employees globally uses our network, scale, connectivity and technology to drive the largest asset class in the world. Join us as we work toward our vision of fueling a thriving global property ecosystem and a more resilient society. Cotality is committed to cultivating a diverse and inclusive work culture that inspires innovation and bold thinking; it's a place where you can collaborate, feel valued, develop skills and directly impact the real estate economy. We know our people are our greatest asset. At Cotality, you can be yourself, lift people up and make an impact. By putting clients first and continuously innovating, we're working together to set the pace for unlocking new possibilities that better serve the property industry. Job Description: What is the role? This is a Principal-level role, making you a key contributor to the Site Reliability Engineering function within Cotality. You will be a hands-on practitioner and a technical leader, providing guidance and deep expertise to our SR Engineers and other engineering functions. Your work will help lay the foundation for all the promises we make to our customers regarding the reliability, performance, and security of our products. You will be responsible for driving operational excellence across the business. This includes designing and implementing technical solutions that reduce the operational overhead of keeping applications healthy, secure, and available for our customers. By curating observability data and insights, and finding opportunities for improvement through technical system analysis, you will create a compelling picture of the health of our production systems and gain influence across the business to collaborate on necessary changes. You will ensure our SRE team is a trusted partner to development and operational teams, helping them to understand and improve the reliability of the services they own and support.

Requirements

  • Bachelor's Degree or equivalent work experience.
  • 5+ years of experience.
  • Site Reliability Engineers need to be well-rounded in tech skills. Along with coding and AI/automation acumen, SREs should have strong knowledge of operating systems, networks, virtualization, and CI/CD pipeline tools. There’s no substitute for this level of expertise.
  • Drive for automation. You constantly consider, "How can I automate this manual process?"
  • AI-Forward. You have experience utilizing agentic and generative AI tooling to build solutions that reduce toil and/or generate efficiencies and savings in human efforts.
  • Extensive experience with cloud-based technologies and solutions (we are heavily invested in GCP and AWS).
  • Experience with docker containers and Container Orchestration and CaaS solutions (eg: Kubernetes, ECS).
  • Expert knowledge of scripting language (eg: Python).
  • Expert knowledge of Data Structures and SQL.
  • Knowledge of DevOps tools and mindset.
  • Experience using version control (eg: GitHub), Infrastructure Provisioning (eg: Terrafrom) and Configuration Management (eg: Ansible).
  • Ability to work alongside others and be a team player.
  • Desire to learn and adapt. Our team manages a diverse portfolio of active initiatives, giving you the chance to gain broad visibility across our entire codebase and feature set. You'll constantly be learning new areas and new technologies.
  • Passion. Our customers are passionate about real estate data, and we want the same from our engineers. We want you to actively own your work and be excited about your projects.
  • Operational excellence. Data excites you and you make decisions based on numbers rather than assumptions. If an issue arises, you strive to be alerted before our customers notice.
  • Skilled in identifying performance bottlenecks, and figuring out the root cause of incidents.

Responsibilities

  • Drive the Next-Gen Application Analysis Program: Direct the evaluation of critical applications for stability and scalability, driving the strategic shift from manual auditing to AI-driven automated analysis that proactively identifies risks, technical debt, and architectural anti-patterns before they cause incidents.
  • Principal-Level Remediation: Go beyond consultation by actively partnering with engineering teams to architect and implement resilient design patterns, harden deployment pipelines, and solve complex technical challenges that jeopardize product availability.
  • Strategic Reliability Roadmap: Deliver high-profile, strategic enterprise-scale reliability initiatives that drive global impact, simultaneously enhancing external customer transparency and internal operational efficiency.
  • Automation & Incident Intelligence: Champion the reduction of operational toil by building advanced tooling that empowers all facets of IT Service Management. Drive the shift from manual workflows to intelligent, automated remediation, ensuring that operational insights lead to permanent, automated fixes rather than recurring manual patches across our service landscape.
  • SRE Evangelism & Culture Building: Act as a technical authority and mentor across the organization, breaking down silos to drive ownership of production systems, transforming "tribal knowledge" into accessible documentation, and embedding SRE best practices into the daily workflows of product teams.

Benefits

  • Time off: Generous PTO and 11 paid holidays, plus well-being and volunteer time off.
  • Family Support: Up to 16 weeks of fully paid parental leave and a baby stipend.
  • Health: Multiple medical plan options with mental health and wellness support offerings.
  • Retirement: 401(k) with company match and vesting after one year.
  • Financial Perks: $400 annual well-being stipend and tuition assistance up to $5,250.
  • Extras: Recognition Rewards, Referral bonuses, exclusive discounts and more!
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service