Data Scientist

S&P GlobalNew York, NY
Onsite

About The Position

The Collection Platforms & AI team is building ML-powered products and capabilities to power natural language understanding, data extraction, information retrieval, and data sourcing solutions for S&P Global and its clients. In this role, you will spearhead the development of production-ready AI products and pipelines while leading by example in a highly engaging work environment. You will work in a global team and be encouraged for thoughtful risk-taking and self-initiative. The team has already delivered breakthrough products and significant business value. In this role, you will be developing our next generation of new products while enhancing existing ones, aiming at solving high-impact business problems. You will be part of a dynamic team that solves diverse problems using applied machine learning and web development with an end-to-end implementation of the solution: inception, prototyping, development, and productionizing. You will be part of a global company and build solutions at enterprise scale, growing with a highly skilled, hands-on technical team. You will contribute to solving high-complexity, high-impact problems end-to-end and build end-to-end production-ready pipelines from ideation to deployment.

Requirements

  • Strong grasp of statistics, probability, and the mathematics underpinning modern AI.
  • Linear programming and optimization.
  • Multi-dimensional optimizers, such as Adam, SGD, Gradient Descent …
  • Ability to adjust weights for full/partial tuning of LLMs.
  • Hands-on experience with any large language models (e.g., OpenAI, Anthropic, Llama), prompt engineering, fine-tuning/customization, and embedding-based retrieval.
  • Intermediate proficiency in Python (NumPy, Pandas, SpaCy, scikit-learn, PyTorch/TF 2, Hugging Face Transformers).
  • Understanding of ML & Deep Learning models, including architectures for NLP (e.g., transformers), GNNs, and multimodal systems.
  • Solid understanding of database structures and SQL.
  • Ability to perform independent research and synthesize current AI/ML research, with a track record of applying new methods in production.
  • Experience in end-to-end GenAI or advanced NLP projects, such as NER, table extraction, OCR integrations, or GNN solutions.
  • Familiarity with orchestration and deployment tools: Airflow, Redis, Flask/Django/FastAPI, SQL, R-Shiny/Dash/Streamlit.
  • Openness to evaluate and adopt emerging technologies and programming languages as needed.
  • Public contributions or demos on GitHub, Kaggle, StackOverflow, technical blogs, or publications.
  • Indefinite right to work within the USA.

Responsibilities

  • Develop and deploy large-scale ML and GenAI-powered products and pipelines.
  • Own all stages of the data science project lifecycle, including: Develop, deploy, monitor, and scale models through the full Software Development Life Cycle into production (including both ML and GenAI services).
  • Perform exploratory data analysis, proof-of-concepts, model benchmarking, and validation experiments for both ML and GenAI approaches.
  • Partner with business leaders, domain experts, and end-users to gather requirements and align on success metrics.
  • Follow coding standards, perform code reviews, and optimize data science workflows.
  • Evaluation, interpretation, and communication of results to executive stakeholders.

Benefits

  • Health care coverage designed for the mind and body.
  • Generous time off.
  • Access to a wealth of resources to grow your career and learn valuable new skills.
  • Competitive pay.
  • Retirement planning.
  • Continuing education program with a company-matched student loan contribution.
  • Financial wellness programs.
  • Perks for partners and little ones.
  • Retail discounts.
  • Referral incentive awards.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

No Education Listed

Number of Employees

5,001-10,000 employees

© 2026 Teal Labs, Inc
Privacy PolicyTerms of Service