Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988! Fast-forward a few years, and this little innovation and our passion for data has skyrocketed us to a Fortune 200 company and a leader in the world of data-driven decision-making. As a Data Scientist at Capital One, you’ll be part of a team that’s leading the next wave of disruption at a whole new scale, using the latest in computing and machine learning technologies and operating across billions of customer records to unlock the big opportunities that help everyday people save money, time and agony in their financial lives. Team Description: The Servicing Intelligence team delivers data science solutions to capture value from unstructured, multi-modal data sources — text, image, and audio data. We operate as an applied data science team, building with open source generative AI models and tooling, but prioritizing application over research to scale the adoption of AI with in-market solutions. You will sit on a team of data scientists that collaborates daily with product, tech, and business teams to embed AI in varied domains, including frontline agent servicing, back office document processing, AI for regulatory compliance, and overall customer experience. Your work will apply generative AI on millions of inputs, spanning from extracting key information from unstructured documents to analyzing call transcripts to resolve the root cause of customer friction. Role Description: In this role, you will: Apply expertise in unstructured data (text, image) to harness the power of open source large language models (LLMs) and visual language models (VLMs) Leverage a broad stack of technologies — LangGraph, LlamaIndex, Weights and Biases Weave, Hugging Face, PyTorch, AWS, and more — to automate workflows using huge volumes of text and vision data Build machine learning and NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems that serve 80+ million customers. Assessing GenAI or LLM-Powered application architectures in production, including best practices for Generative AI development and deployments. Define requirements for AI observability, focusing on the traceability of autonomous decisions and comprehensive system audit trails. Evaluate the dynamic behavior of AI systems and oversee the development of key continuous monitoring controls and testing, ensuring that non-deterministic outputs and autonomous actions remain within risk appetite. Get into the weeds of internal business processes and data operations by guiding annotators to curate high quality, consistent datasets for model training, evaluation, and ongoing AI monitoring. Collaborate on a team of data scientists through all phases of project development, from design through training, evaluation, validation, implementation, and maintenance. Interact with a variety of internal stakeholders to ensure the alignment of data science solutions with business outcomes.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Number of Employees
5,001-10,000 employees