Object Technology Solutions, Inc (OTSI) has an immediate opening for an IBM MDM ARCHITECT IBM MDM ARCHITECT (REMOTE ) MAJOR RESPONSIBILITES: · Own data quality, matching logic, and golden record creation · Strong experience in Master Data Management (MDM) · Hands-on with: · Matching & deduplication strategies · Golden record creation · Experience tuning: · False positives / false negatives · Knowledge of: · Phonetic matching (Soundex, etc.) · Data quality rules · Key Responsibilities · Define and tune matching algorithms · Identify and fix: · Duplicate records · Data inconsistencies · Work with business teams to: · Define matching rules · Handle edge cases · Validate new data sources and ingestion logic · Define data quality metrics & SLAs · Good to Have · Experience with tools like: · Informatica MDM / Reltio / IBM MDM · Exposure to SQL QUALIFICATIONS AND EXPERIENCE · The right fit is looking for IBM MDM (virtual) guy who is able to move on-prem MDM to cloud with kubernetes and they would need Java knowledge · Conduct comprehensive reviews of the current MDM ecosystem, including matching and linking algorithm performance, duplicate and suspect processing, and custom extensions. · Evaluate and enhance logging, monitoring, backup, and recovery processes to ensure system reliability and resiliency. · Assess and optimize transaction/query SLAs and index performance, identifying and addressing known issues such as bucket overflows and bulk-load mechanism gaps (including DataStage constraints). · Design, develop, and maintain Java code for MDM algorithms, data processing, and integration components, ensuring code is performant, scalable, and maintainable. · Refactor and optimize existing Java code, and write unit, integration, and system tests for all Java components. · Adjust matching algorithm thresholds to minimize false positives/negatives and add/test new matching attributes (e.g., email + last name, phone combinations). · Assess and implement phonetic matching techniques (e.g., Soundex), and lead feasibility studies to determine the need for single or multi-algorithm approaches for different business lines. · Perform sample pairing for new data sources, collaborating with business teams to identify edge cases and refine algorithm behavior, including store-identifier-based auto-matching where applicable. · Review and enhance frequency-based and dynamic bucketing strategies, assess passive execution options to minimize production impact, and validate interactions with STR tables and existing buckets. · Develop and document bulk load strategies, including ingestion options with and without entity management/event handlers; explore disabling matching for high-volume ingestion scenarios (e.g., 200M+ records) and evaluate alternative ingestion patterns beyond API-only approaches. · Capture and report on metrics, SLAs, and trade-offs for each ingestion strategy. · Target <2 seconds for matching response time through iterative tuning, must-gather data analysis, and leveraging load testing tools (e.g., JMeter); review reconciliation SLAs as part of ongoing optimization. · Address deployment challenges such as outages and unpredictable environment behaviors; explore and implement CICD or improved deployment automation; review and optimize configuration, code, and deployment steps for efficiency. · Plan and implement Auth/PingFed-based authentication, reduce reliance on process IDs and passwords, automate credential rotation, and maintain a dedicated 'Security' category in the scope documentation. · Ensure all code adheres to coding standards and guidelines.
Stand Out From the Crowd
Upload your resume and get instant feedback on how well it matches this job.
Job Type
Full-time
Career Level
Mid Level
Education Level
No Education Listed
Number of Employees
501-1,000 employees