Senior Associate, Data Scientist - US Card (Applied GenAI)

Capital One•McLean, VA

74d

About The Position

Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988! Fast-forward a few years, and this little innovation and our passion for data has skyrocketed us to a Fortune 200 company and a leader in the world of data-driven decision-making. As a Data Scientist at Capital One, you’ll be part of a team that’s leading the next wave of disruption at a whole new scale, using the latest in computing and machine learning technologies and operating across billions of customer records to unlock the big opportunities that help everyday people save money, time and agony in their financial lives. Team Description: The Servicing Intelligence team delivers data science solutions to capture value from unstructured, multi-modal data sources — text, image, and audio data. We operate as an applied data science team, building with open source generative AI models and tooling, but prioritizing application over research to scale the adoption of AI with in-market solutions. You will sit on a team of data scientists that collaborates daily with product, tech, and business teams to embed AI in varied domains, including frontline agent servicing, back office document processing, AI for regulatory compliance, and overall customer experience. Your work will apply generative AI on millions of inputs, spanning from extracting key information from unstructured documents to analyzing call transcripts to resolve the root cause of customer friction. Role Description: In this role, you will: Apply expertise in unstructured data (text, image) to harness the power of open source large language models (LLMs) and visual language models (VLMs) Leverage a broad stack of technologies — LangGraph, LlamaIndex, Weights and Biases Weave, Hugging Face, PyTorch, AWS, and more — to automate workflows using huge volumes of text and vision data Build machine learning and NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems that serve 80+ million customers. Assessing GenAI or LLM-Powered application architectures in production, including best practices for Generative AI development and deployments. Define requirements for AI observability, focusing on the traceability of autonomous decisions and comprehensive system audit trails. Evaluate the dynamic behavior of AI systems and oversee the development of key continuous monitoring controls and testing, ensuring that non-deterministic outputs and autonomous actions remain within risk appetite. Get into the weeds of internal business processes and data operations by guiding annotators to curate high quality, consistent datasets for model training, evaluation, and ongoing AI monitoring. Collaborate on a team of data scientists through all phases of project development, from design through training, evaluation, validation, implementation, and maintenance. Interact with a variety of internal stakeholders to ensure the alignment of data science solutions with business outcomes.

Requirements

Currently has, or is in the process of obtaining one of the following with an expectation that the required degree will be obtained on or before the scheduled start date:
A Bachelor's Degree in a quantitative field (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field) plus 2 years of experience performing data analytics
A Master's Degree in a quantitative field (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science, or a related quantitative field) or an MBA with a quantitative concentration

Nice To Haves

Master’s Degree in “STEM” field (Science, Technology, Engineering, or Mathematics), or PhD in “STEM” field (Science, Technology, Engineering, or Mathematics)
Experience working with AWS
At least 2 years’ experience in Python, Scala, or R
At least 2 years’ experience with machine learning
At least 2 years’ experience with SQL
At least 2 years’ experience AI/ML tools and ecosystems, such as LangGraph, LlamaIndex, Weights and Biases Weave, Pytorch, or Hugging Face

Responsibilities

Apply expertise in unstructured data (text, image) to harness the power of open source large language models (LLMs) and visual language models (VLMs)
Leverage a broad stack of technologies — LangGraph, LlamaIndex, Weights and Biases Weave, Hugging Face, PyTorch, AWS, and more — to automate workflows using huge volumes of text and vision data
Build machine learning and NLP models through all phases of development, from design through training, evaluation, and validation; partnering with engineering teams to operationalize them in scalable and resilient production systems that serve 80+ million customers.
Assessing GenAI or LLM-Powered application architectures in production, including best practices for Generative AI development and deployments.
Define requirements for AI observability, focusing on the traceability of autonomous decisions and comprehensive system audit trails.
Evaluate the dynamic behavior of AI systems and oversee the development of key continuous monitoring controls and testing, ensuring that non-deterministic outputs and autonomous actions remain within risk appetite.
Get into the weeds of internal business processes and data operations by guiding annotators to curate high quality, consistent datasets for model training, evaluation, and ongoing AI monitoring.
Collaborate on a team of data scientists through all phases of project development, from design through training, evaluation, validation, implementation, and maintenance.
Interact with a variety of internal stakeholders to ensure the alignment of data science solutions with business outcomes.

Benefits

This role is also eligible to earn performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI).
Incentives could be discretionary or non discretionary depending on the plan.
Capital One offers a comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume