Image & Computer Vision AI Engineer

Hatch IT•Reston, VA

65d•Hybrid

About The Position

As an Engineer on the Image & Computer Vision AI team, you will play a hands-on role in developing and deploying computer vision capabilities that support Babel Street’s intelligence applications. You will build systems that extract, analyze, and reason over visual data—enabling facial matching, object and scene understanding, geolocation and location inference from imagery, and multimodal intelligence workflows. This role is execution-focused and suited for engineers with strong foundations in computer vision, image processing, and machine learning who want to apply their skills to real-world, mission-driven problems. You will work closely with AI, Product, and Engineering teams to deliver reliable, scalable, and cost-efficient vision capabilities, including integration with multimodal LLM systems that allow users to search and reason over images using natural language. This is a hybrid role to be based out of either their Reston, VA/Washington DC office or their Somerville MA office.

Requirements

3+ years of experience in computer vision, image processing, or applied machine learning.
Hands-on experience with computer vision models and techniques (e.g., CNNs, transformers for vision, feature embeddings).
Experience building or integrating image analytics such as facial recognition, object detection, or image similarity.
Strong programming skills in Python; experience with common CV/ML libraries (PyTorch, TensorFlow, OpenCV, etc.).
Solid understanding of machine learning fundamentals, model evaluation, and performance tradeoffs.
Experience working with large image datasets and production ML pipelines.
Ability to work collaboratively in a fast-moving, mission-driven engineering environment.

Nice To Haves

Experience with facial matching or biometric systems in regulated or high-stakes environments.
Experience with image-based geolocation or scene/location inference.
Familiarity with multimodal AI systems, including combining vision models with LLMs or natural-language search.

Responsibilities

Build and maintain computer vision pipelines for image ingestion, preprocessing, inference, and evaluation.
Implement facial matching, and identity-related vision workflows in accordance with accuracy, safety, and compliance requirements.
Develop and support object detection, image similarity, and scene understanding models.
Contribute to image-based geolocation and location inference capabilities using visual features and contextual signals.
Support multimodal AI workflows that combine image embeddings with LLM-based search and reasoning.
Write clean, maintainable Python code and contribute to production services and APIs.
Assist with model evaluation, bias testing, and accuracy monitoring for vision systems.
Optimize inference pipelines for performance, scalability, and cost efficiency (GPU usage, batching, model selection).
Collaborate with Product and Engineering teams to integrate vision capabilities into user-facing intelligence applications.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume