Back to search
100
MCP live v1.4.5 streamable-http

GoldenMatch

io.github.benzsevern/goldenmatch

Find duplicate records in 30 seconds. Zero-config entity resolution, 97.2% F1 out of the box.

Uptime
35.3%
17 probes
Response
842ms
last probe
Tools
42
callable

Tools · 42

analyze_data

Profile data, detect domain, recommend ER strategy

auto_configure

Run AutoConfigController on a CSV; return the committed GoldenMatchConfig (incl. negative_evidence / Path Y when chosen) plus telemetry — stop_reason, health, decision trace, indicator column priors. …

controller_telemetry

Return the AutoConfigController telemetry from the most recent `auto_configure` or `agent_deduplicate` call in this MCP session. Same JSON shape as the web /api/v1/controller/telemetry endpoint.

agent_deduplicate

Run full ER pipeline with confidence gating and reasoning

agent_match_sources

Match two files with intelligent strategy selection

agent_explain_pair

Natural language explanation for a record pair

agent_explain_cluster

Explain why records are in the same cluster

agent_review_queue

Get borderline pairs awaiting approval

agent_approve_reject

Approve or reject a review queue pair

agent_compare_strategies

Compare ER strategies on your data

suggest_pprl

Check if data needs privacy-preserving matching

scan_quality

Run GoldenCheck data quality scan on a CSV file. Returns issues found (encoding errors, Unicode problems, format violations) without applying fixes. Requires goldencheck: pip install goldenmatch[quali…

fix_quality

Run GoldenCheck scan and apply fixes to a CSV file. Returns the fixed data summary and a manifest of all fixes applied. Requires goldencheck: pip install goldenmatch[quality]

run_transforms

Run GoldenFlow data transforms on a CSV file. Normalizes phone numbers (E.164), dates (ISO), categorical spelling, and Unicode issues. Returns a manifest of transforms applied. Requires goldenflow: pi…

list_corrections

List stored Learning Memory corrections, optionally filtered by dataset. Returns id_a, id_b, decision, source, trust, reason, matchkey_name, dataset, original_score, created_at.

add_correction

Add a pair correction to Learning Memory. Source is set to 'agent' with trust=0.5 (lower than human steward decisions which are 1.0). Pair (id_a, id_b) is canonicalized to (min, max) before storage.

learn_thresholds

Force a MemoryLearner pass over accumulated corrections. Returns the list of LearnedAdjustments produced (matchkey_name, threshold, sample_size, learned_at). Requires >= 10 corrections per matchkey be…

memory_stats

Return Learning Memory status: total correction count, last learn time, and current learned adjustments. Cheap; safe for status checks.

memory_export

Return all corrections as a list of dicts (CSV-shaped). Caller is responsible for writing the file. Optionally filter by dataset.

identity_resolve

Resolve a record_id to its durable identity. Returns the full identity view (members, evidence edges, recent events) or null when no identity exists for that record.

identity_list

List identities, optionally filtered by dataset/status.

identity_history

Return the temporal event log for an identity.

identity_conflicts

List evidence edges marked `conflicts_with`.

identity_merge

Manually merge two identities. All records from `absorb_entity_id` are reassigned to `keep_entity_id`.

identity_split

Split a subset of records off an identity into a brand-new identity. The original keeps the remaining records.

get_stats

Get dataset statistics: record count, cluster count, match rate, cluster sizes.

find_duplicates

Find duplicate matches for a record. Provide field values to search against the loaded dataset.

explain_match

Explain why two records match or don't match. Shows per-field score breakdown.

list_clusters

List duplicate clusters found in the dataset. Returns cluster IDs, sizes, and member counts.

get_cluster

Get details of a specific cluster: all member records and their field values.

get_golden_record

Get the merged golden (canonical) record for a cluster.

match_record

Match a single record against the loaded dataset in real-time. Paste a record's fields and instantly see if it matches any existing record. Uses the configured matchkeys, scorers, and thresholds. Exam…

unmerge_record

Remove a record from its cluster. The record becomes a singleton. Remaining cluster members are re-clustered using stored pair scores. Use this to fix bad merges.

shatter_cluster

Break an entire cluster into individual records. All members become singletons. Use when a cluster is completely wrong.

suggest_config

Analyze bad merges and suggest config changes. Provide examples of incorrect merges (pairs that should NOT have matched) and GoldenMatch will identify which fields/thresholds to tighten. Example: [{"r…

profile_data

Get data quality profile: column types, null rates, unique counts, sample values.

export_results

Export matching results to a file (CSV or JSON).

list_domains

List available domain extraction rulebooks (built-in + user-defined).

create_domain

Create a custom domain extraction rulebook. Define patterns for a specific data domain (medical devices, automotive parts, real estate, etc.).

test_domain

Test a domain extraction rulebook against sample records. Shows what features would be extracted from the loaded data.

pprl_auto_config

Analyze the loaded dataset and recommend optimal PPRL (privacy-preserving record linkage) configuration. Returns recommended fields, bloom filter parameters, threshold, and explanation.

pprl_link

Run privacy-preserving record linkage between two parties' data. Computes bloom filters, matches records without sharing raw data. Specify fields, threshold, and security level.

Similar MCP servers embedding-nearest

GoldenCheck live
Auto-discover validation rules from data — scan, profile, health-score. No rules to write.
19 tools · streamable-http
GoldenFlow live
Data transformation toolkit — standardize, reshape, and normalize messy data
10 tools · streamable-http
io.github.Deesmo/arch-tools-mcp live
116 AI tools in one MCP server. Web search, crypto data, image gen, news.
64 tools · sse
GoldenMatch
Entity resolution toolkit for deduplication, record matching, golden records, and PPRL.
0 tools
Ground Truth live
Live fact-checks for AI agents: endpoints, security headers, pricing, claims, compliance, markets.
9 tools · streamable-http
com.metricspot/seo-mcp live
Six tools for SEO and AI-readability audits. 91 checks, 11 score modules.
6 tools · streamable-http

How to use

Add to your Claude Desktop / Cursor / Cline MCP config:

{
  "mcpServers": {
    "goldenmatch": {
      "url": "https://goldenmatch-mcp-production.up.railway.app/mcp/",
      "transport": "streamable-http"
    }
  }
}