Leidos has an exciting opening for you, our next TS/SCI cleared Senior Technical Operations Engineer working with a dynamic team to design, develop and deploy a state-of-the-art technology stack supporting the DOMEX Data Discovery Platform (D3P) Modernization program as well as our client’s mission to centralize and standardize Tasking, Collection, Processing, Exploitation and Dissemination (TCPED) of Open Source Intelligence (OSINT) across the Defense Intelligence Enterprise (DIE). You will have impact as part of a mission focused, solutions oriented, and adaptive team that values inclusion, innovation, collaboration, and professional development. While most work is conducted on-site at our client location in Bethesda, MD, we offer a flexible schedule and, occasionally, some tasks may be performed remotely. As a Senior Technical Operations Engineer you will be responsible for the availability and performance of a full stack containerized microservice software platform. You will participate in fostering a DevSecOps culture, building strong cross functional collaboration with systems engineering, architecture, development, security, operations, and integration teams, in a dynamic and fast paced environment. You will work closely with a multi-disciplinary team of systems engineers, developers, integrators, system administrators on the following key tasks: System Reliability & Performance – Maintaining system uptime, performance, and capacity planning for a big data production platform with a microservice architecture running on Kubernetes, Elasticsearch and PostgreSQL backends, Kafka messaging, and using Java, Python, and React as well as low code software technologies (e.g. Appian) Monitoring & Observability – Utilizing tools to monitor systems and detect issues proactively. Incident Response – Triage and troubleshooting issues, managing failures, performing root cause analysis, and conducting post-incident reviews. SLIs and SLOs – Defining and tracking Service-Level Indicators (SLIs) and Service-Level Objectives (SLOs) to measure reliability Management Oversight – Leading a team of system administrators who staff a help desk during core business hours; maintaining technical standards and mentoring staff on troubleshooting techniques Technical Leadership – Collaborating with system engineers to design solutions for new features and requirements and providing technical input to systems engineering documentation and diagrams/models working in coordination/collaboration with SE team members and architect team SAFe Agile – Participating in Agile release planning, scrum of scrums, bug triage, design sessions and other meetings You bring enthusiasm, the ability to work well with people from different disciplines with varying degrees of technical experience, and meet the following qualifications:
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees