This first cheminformatics hire will design and build the chemistry search engine: the system that exposes our accessible chemical space through query modes fit for how our partners actually design molecules and run their discovery campaigns. This engine will be searched by generative models, AI agents, and medicinal chemists alike. The engine is grounded in predictive models that capture the actual capabilities of our platform rather than a static catalog, which changes the underlying architecture in ways an experienced builder in this space will already have opinions about. You will take in constraints from our automation team, conversion and substrate-scope models from our ML team, and building blocks from our vendors to define the complete specification of Satomic's accessible chemical space. You will also translate insights from our partners’ discovery strategies into fit-for-purpose query modes, keeping the engine in sync as both Satomic's accessible chemistry expands and our partners' discovery workflows evolve. This role is a founding-engineer position within the Development team. You'll be the technical owner of the cheminformatics domain — defining its standards and its architecture — while working as part of a broader engineering group that owns the platform you're building on. You will partner closely with our data science / ML lead and work day-to-day with the rest of Development on the systems your engine integrates with. You will have exposure to end users at Satomic’s commercial partners to translate user behavior into the query types the engine should support. We believe being part of the chemistry-AI conversation externally (through publication, open source, and the conferences and forums where this field develops) is part of the job. This is a hands-on role for an engineer who thrives with extreme latitude, ownership, and judgment. You will decide what to build versus buy on cheminformatics-specific tooling (RDKit / OpenEye for chemistry primitives, AiZynthFinder / ASKCOS / Synthia / IBM RXN for retrosynthesis, pharmacophore tooling, IP / FTO filtering, vector index choice), choose the engine's core representations, and define the versioning rules that keep it correct as chemistry capabilities expand.Infrastructure-adjacent choices — vector index, storage layer, serving framework — are made jointly with the team. As Satomic’s first cheminformatics engineer, you will be the technical owner of this domain: you will be the sole individual contributor driving this work directly for the first several months, with the opportunity to build a team under you within year one as the surface area grows.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Principal
Education Level
No Education Listed