About The Position

Oracle Cloud Infrastructure (OCI) is seeking a technically advanced and hands-on Senior Software Engineer to drive the design, implementation, and optimization of critical networking solutions for the world’s most demanding data center environments. In this role, you will combine expertise in congestion management, transport protocols (TCP, RDMA, and hybrids), real-time telemetry, network security, and reliable systems software to deliver efficient, secure, and resilient infrastructure at hyperscale.

Requirements

  • 5+ years of experience in designing and delivering production-grade transport protocol controllers (TCP, RDMA, RTTCC, DCQCN, TIMELY etc.) at data center scale.
  • Strong background in congestion control (loss/delay/hybrid), buffer management, telemetry systems, and large-scale data center networking concepts.
  • Demonstrated experience with network or transport-layer features and best practices.
  • Proficiency in systems programming languages such as C, C++, Go, or Rust, and experience with modern network monitoring and diagnostic tools.
  • Proven ability to analyze, troubleshoot, and optimize network performance at scale.
  • Excellent collaboration and communication skills; able to work effectively with engineering peers and cross-functional partners.
  • Track record of applying theoretical knowledge to solve real-world challenges in reliable and pragmatic ways.

Nice To Haves

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or related discipline, with emphasis on computer networks, distributed systems, or security preferred.

Responsibilities

  • Transport and Congestion Control: Design, implement, and optimize protocols and controllers—spanning TCP, RDMA, and hybrid transports—to deliver high throughput, low latency, and fairness across massive data center networks.
  • Telemetry: Develop and integrate robust telemetry systems to provide continuous, fine-grained visibility into transport performance, congestion, and overall network health.
  • System Implementation: Contribute to the reliability, performance, and operational excellence of congestion management, telemetry, and security components within OCI’s stack.
  • Protocol Optimization: Advance algorithmic solutions for efficient transport and congestion control at both host and switch levels, ensuring seamless interplay between protocols and infrastructure.
  • Performance Troubleshooting: Use advanced diagnostic tools to analyze protocol behavior, buffer utilization, network bottlenecks, and implement corrective actions at scale.
  • Production Delivery: Validate and deploy new features and enhancements across OCI’s large-scale deployments, collaborating closely with hardware, host networking, telemetry, and SRE teams.
  • Collaboration and Best Practices: Share insights, review designs, and drive adoption of best practices for telemetry, security, and transport optimization across the engineering organization.
  • Continuous Learning: Stay current with developments in network transport, telemetry, security, and data center networking, rapidly applying new techniques for production benefit.

Benefits

  • Medical, dental, and vision insurance, including expert medical opinion
  • Short term disability and long term disability
  • Life insurance and AD&D
  • Supplemental life insurance (Employee/Spouse/Child)
  • Health care and dependent care Flexible Spending Accounts
  • Pre-tax commuter and parking benefits
  • 401(k) Savings and Investment Plan with company match
  • Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
  • 11 paid holidays
  • Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
  • Paid parental leave
  • Adoption assistance
  • Employee Stock Purchase Plan
  • Financial planning and group legal
  • Voluntary benefits including auto, homeowner and pet insurance
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service