ApprovalMax is a fast-growing B2B SaaS company that helps businesses automate their approval workflows and financial controls. With a global team of over 150 people spanning the UK, Europe, North America, Australia, and South Africa, we build software that matters and we’re scaling quickly. The Role Our Capture product extracts structured data from hundreds of thousands of financial documents monthly - invoices, bills, POs - through an OCR pipeline that matches extracted fields against customer accounting systems. Your KPI is zero-touch rate: the percentage of documents where the system output requires zero manual correction. Your job is to move it up - systematically, measurably, and permanently. We’ve built the foundation, a validated accuracy measurement framework on full production data, a comprehensive error taxonomy of root causes, an error identification methodology, and the first shipped production fixes. You inherit the methodology and the backlog. We need a dedicated owner to execute and scale it. The work splits roughly 70% forensic data investigation / 30% ML engineering, shifting toward 50/50 as models go to production. Four error origins drive the roadmap: Entity matching (~50% of fixable errors). OCR extracts field values correctly, but the pipeline matches them to the wrong account, supplier, or tax code. Planned: embedding-based similarity search, recommender systems, consensus-based coding prediction - a standalone ML service the core pipeline calls. Pipeline logic (~25%). Our post-processing pipeline introduces errors through its own deterministic logic - tax treatment misclassification, rounding, spurious adjustment lines. Planned: forensic investigation per pattern, tracing data through processing steps, designing and validating rule-based fixes. OCR extraction (~25%). The OCR engine misreads the document - wrong currency, phantom line items, structural parsing failures. Planned: build an OCR correction layer - the right approach may be LLM with guardrails, an alternative OCR engine, a correction model from HuggingFace, or a combination. Freedom to choose; rigour required to validate. User overrides (~equal to the above combined, lower priority). Users change correct values for business reasons. Future: learn organisation/vendor correction patterns, build recommendation systems from historical data. Remote - applicants must be based in the UK, Serbia, or Moldova.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed