Senior AI / Machine Learning Engineer

Absentia Labs•San Francisco, CA

2d•Hybrid

About The Position

Absentia Labs is building intelligent systems that sit at the intersection of AI, biology, chemistry, and large-scale engineering. Our goal is to translate complex scientific data into machine intelligence capable of reasoning, generalizing, and driving discovery. Biomedical data is fragmented, noisy, and deeply interconnected. Turning it into a useful signal requires not only strong data foundations but also carefully designed learning systems that can scale across modalities, tasks, and uncertainty regimes. This role focuses on building and training those systems.

Requirements

5+ years of industry experience in machine learning or applied AI roles.
Demonstrated experience training large-scale models in production settings, not just prototypes.
Hands-on expertise with LLMs, diffusion models, and/or GNNs.
Strong proficiency in PyTorch (or equivalent deep learning frameworks).
Deep understanding of distributed training, including parallelism strategies and performance optimization.
Experience working with large datasets and high-throughput data pipelines.
Strong software engineering fundamentals: clean code, testing, reproducibility, and debugging at scale.
Ability to clearly communicate technical trade-offs to both technical and non-technical stakeholders.

Nice To Haves

Experience with reinforcement learning, fine-tuning, or preference-based optimization (e.g., RLHF).
Familiarity with model compression, distillation, or inference optimization.
Experience deploying models in production inference systems.
Exposure to multimodal learning or foundation models.
Prior work in startups or fast-moving R&D environments.
Contributions to open-source ML frameworks or research codebases.
Prior experience with molecular or biomedical models is not required.

Responsibilities

Design, train, and evaluate large-scale models, including Large Language Models (LLMs), diffusion models, and Graph Neural Networks (GNNs).
Own end-to-end training pipelines, from dataset interfaces and batching strategies to distributed training and checkpointing.
Make principled decisions about model architecture, objective functions, optimization strategies, and scaling laws.
Build and optimize distributed training systems (data parallelism, model parallelism, sharding, mixed precision).
Collaborate closely with data engineers to define ML-ready datasets and streaming interfaces.
Translate ambiguous scientific or product requirements into robust ML solutions.
Drive model evaluation, ablation, and iteration with a focus on generalization, stability, and reproducibility.
Contribute to architectural decisions around model serving, inference efficiency, and lifecycle management.
Provide technical leadership through design reviews, mentorship, and cross-team collaboration.

Benefits

Competitive compensation, including meaningful equity participation
The opportunity to work on foundation-level ML systems applied to real scientific problems.
Ownership over model design and training strategy, not just implementation.
Close collaboration with data, infrastructure, and scientific teams.
High autonomy, low bureaucracy, and a culture that values technical depth.
Flexible remote or hybrid work arrangements.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume