Senior Backend Python Engineer

3MD Inc.Redmond, WA

About The Position

Senior Backend Python Engineer with strong systems thinking and API integration experience will design, build, and scale AI-driven network operations platform. Focus on backend services, data pipelines, and integrations with upstream tools, ensuring reliability, performance, and correctness in a production environment.

Requirements

  • Bachelor’s degree in computer science AI, Data Engineering, or related field
  • 5+ years of experience as a DevOps, Data Engineering, or AI infrastructure
  • Strong Python expertise
  • 5+ years professional software engineering experience (or equivalent).
  • Demonstrated “extreme competency” in Python: data structures, async patterns, error handling, testing, packaging, etc.
  • Backend / Systems Engineering
  • Experience building backend APIs and services (FastAPI, Flask, Django REST, or similar).
  • Experience with databases (SQL and/or NoSQL) and designing schemas for operational systems.
  • Solid understanding of I/O performance, concurrency, and reliability in distributed systems.
  • Integration Experience
  • Proven experience integrating with third-party and internal APIs.
  • Comfortable working with REST/JSON, authentication schemes, and potentially HTML scraping or direct DB connectivity when APIs aren’t available.
  • Software Engineering Fundamentals
  • Strong grasp of object-oriented programming, software design principles, and maintainable architecture.
  • Familiarity with CI/CD practices, logging, monitoring, and production support.
  • Communication & Ownership
  • Ability to work independently in a small, high-trust team, owning features end-to-end.
  • Strong communication skills to work directly with product and operational stakeholders.

Nice To Haves

  • Networking / NOC Domain Knowledge
  • Understanding of basic networking concepts (routes, subnets/CIDR, DNS, IPAM, firewalls, routers, alarms).
  • Experience with NOC environments, monitoring, alarming, maintenance windows, and change management.
  • AI & Agentic Coding Exposure
  • Experience working alongside LLM-based systems (e.g., ChatGPT) for coding, code review, or operations.
  • Familiarity with vector databases, embeddings, and retrieval-augmented generation (RAG).
  • Exposure to agentic coding patterns (AI factories, tool orchestration) is a plus but not a requirement.
  • Tooling & Infrastructure
  • Experience with Prefect or similar workflow orchestrators (Airflow, Dagster, etc.).
  • Experience with Kubernetes and containerized deployments.
  • Prior work integrating with observability/monitoring stacks (e.g., Splunk).

Responsibilities

  • Backend Services & APIs
  • Design, implement, and maintain Python-based backend services and APIs (e.g., FastAPI or similar).
  • Build robust integrations with systems such as ServiceNow, Splunk, IPAM, DNS, routing/device lookup tools, and other internal platforms.
  • Ensure services are built with high reliability, concurrency, and performance in mind.
  • Data Pipelines & Workflow Orchestration
  • Develop and maintain workflows using Prefect (or similar orchestration tools) for:
  • Document ingestion and preprocessing (e.g., manuals, internal docs).
  • Vectorization pipelines and blob storage management for retrieval.
  • Scheduled and event-driven workflows for ticket summarization, alarm correlation, compliance workflows, etc.
  • Implement automatic retries, logging, and observability for all workflows.
  • Integrations & Tooling
  • Integrate with a wide range of operational tools (e.g., device lookup, route lookup, trace route, change record lookup).
  • Help transition existing “agentic tools” and internal scripts into MCP-based tools and standardized interfaces over time.
  • Work with internal tool owners (e.g., networking automation platforms) using APIs, HTML scraping, or direct DB access where needed.
  • Quality, Reliability & Optimization
  • Write clean, well-structured, testable Python code.
  • Optimize for I/O, disk operations, and concurrency in a high-volume production environment.
  • Contribute to system design for redundancy, fault tolerance, and scalability.
  • Collaboration & Delivery
  • Work within 2-week sprints, owning tickets from design to deployment and validation with customer teams.
  • Collaborate closely with:
  • Product Owners/Managers on requirements and scope.
  • AI Specialists on model usage, prompting, and agentic coding patterns.
  • NOC and network engineers to deeply understand domain workflows (e.g., shift handover, maintenance windows, alarm correlation).
  • Participate in code reviews and knowledge sharing across lanes.

Benefits

  • U.S.-based employees have access to medical, dental, and vision insurance, a 401(k) plan and company contribution, short-term and long-term disability coverage, basic life insurance, and wellbeing benefits, among others.
  • U.S.-based employees also receive, per calendar year, up to 6 scheduled paid holidays.
  • Additionally, eligible hourly/non-exempt and exempt employees accrue up to 112 hours of PTO based on years of service and may annually take up to 8 hours of paid volunteer time.
  • Additional paid sick leave is also provided if required by state or local law.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service