Model Evaluation and Benchmarking

100

TESSA Marketing & Technology live

tessa.tech · TESSA Marketing & Technology · claims "AI Agent Readiness Assessment"

match 83%

100

Agent Broker live

agent-broker-edge.basil-agent.workers.dev · Agent Broker · claims "Preview Cost"

match 84%

100

AgentCrush live

www.agentcrush.xyz · AgentCrush · claims "Trust evaluation"

match 84%

100

Strale live

api.strale.io · Strale · claims "LLM Cost Calculate"

match 82%

100

Strale live

api.strale.io · Strale · claims "Token Count"

match 84%

100

Strale live

api.strale.io · Strale · claims "Tool Call Validate"

match 82%

100

Strale live

api.strale.io · Strale · claims "Context Window Optimize"

match 83%

100

Strale live

api.strale.io · Strale · claims "LLM Output Validate"

match 84%

100

agent-ready live

mcp.agent-ready.dev · Agent Ready · claims "Scan a site"

match 84%

100

AgentCrush live

agentcrush.xyz · AgentCrush · claims "Trust evaluation"

match 84%

100

RegimeShift live

regimeshift.xyz · RegimeShift · claims "Agent-SOFR Rate Benchmark"

match 82%

100

AgentCheck live

agentcheck.care · AgentCheck · claims "Free Scan"

match 83%

100

RS Performance — Gdańsk live

rsperformance.online · RS Performance · claims "EV, hybrid and high-voltage knowledge (dedicated lane)"

match 80%

100

AIScan live

getaiscan.app · AIScan · claims "Visibility Check"

match 82%

100

aicomglobal live

aicomglobal.onrender.com · aicomglobal · claims "Read the Oasis"

match 82%

100

aicomglobal live

aicomglobal.com · aicomglobal · claims "Read the Oasis"

match 82%

100

agent-ready live

agent-ready.dev · Agent Ready · claims "Scan a site"

match 84%

100

PoolParty Agent Concierge live

www.poolparty.io · PoolParty · claims "Evaluate Media Block Fit"

match 82%

100

inferGONKA live

a2a.gogonka.com · Gonka Network · claims "Inference Cost Calculator"

match 84%

100

HORIZON SHIELD KIRA live

hs-mcp.oga-surf-project.workers.dev · The HORIZ音s株式会社 · claims "Estimate integrity audit (borderless)"

match 82%

100

AgentSearch live

agentsearch.luthersystems.com · Luther Systems · claims "Live-score an arbitrary agent URL"

match 83%

100

VAP Execution Agent live

api.vapagent.com · VAP · claims "Execute Media Task"

match 83%

100

TESSA Marketing & Technology live

aiagent.tessa.tech · TESSA Marketing & Technology · claims "AI Agent Readiness Assessment"

match 83%

100

emem live

emem.dev · Vortx AI Private Limited · claims "Hand-verified eval items for agent grading"

match 84%

100

emem live

emem.dev · Vortx AI Private Limited · claims "Learned multi-band-scalar dynamics head (jepa_temporal_predictor@2)"

match 82%

96

AgentBazaar live

agentbazaar.tech · claims "Execute AI Models"

match 87%

96

AgentBazaar live

agentbazaar.tech · claims "Real Tool Execution"

match 87%

90

Perplexity

docs.perplexity.ai · Perplexity · claims "Perplexity"

match 83%

90

Docs by LangChain

langchain-5e9cc07a.mintlify.app · Docs by LangChain · claims "Langchain"

match 84%

90

Docs by LangChain

docs.langchain.com · Docs by LangChain · claims "Langchain"

match 84%

90

ART

openpipe-art.main-kill-isr.mintlify.me · ART · claims "Openpipe"

match 83%

90

ART

art.openpipe.ai · ART · claims "Openpipe"

match 83%

90

Docs by LangChain

langchain-5e9cc07a.main-kill-isr.mintlify.me · Docs by LangChain · claims "Langchain"

match 84%

90

Perplexity

perplexity.main-kill-isr.mintlify.me · Perplexity · claims "Perplexity"

match 83%

90

Retell AI

docs.retellai.com · Retell AI · claims "Retellai"

match 84%

90

Retell AI

retellai.main-kill-isr.mintlify.me · Retell AI · claims "Retellai"

match 84%

85

chess402.vercel.app

chess402.vercel.app · chess402.vercel.app · claims "chess402.vercel.app"

match 81%

85

pqs.onchainintel.net

pqs.onchainintel.net · pqs.onchainintel.net · claims "pqs.onchainintel.net"

match 83%

85

scan.convrgent.ai

scan.convrgent.ai · scan.convrgent.ai · claims "Scan"

match 83%

85

convrgent.ai live

convrgent.ai · convrgent.ai · claims "Real-Time Response Coaching"

match 84%

85

convrgent.ai live

convrgent.ai · convrgent.ai · claims "Linguistic Style Matching"

match 85%

85

oracle.the-undesirables.com

oracle.the-undesirables.com · oracle.the-undesirables.com · claims "AI Card Grading"

match 83%

85

oracle.the-undesirables.com

oracle.the-undesirables.com · oracle.the-undesirables.com · claims "Grade-or-Not Decision Engine"

match 82%

85

netintel-production-440c.up.railway.app

netintel-production-440c.up.railway.app · netintel-production-440c.up.railway.app · claims "AI & Text"

match 81%

85

OpenHands Docs

docs.openhands.dev · OpenHands Docs · claims "All"

match 85%

85

OpenHands Docs

allhandsai.main-kill-isr.mintlify.me · OpenHands Docs · claims "All"

match 85%

85

Ollama

docs.ollama.com · Ollama · claims "Ollama"

match 86%

85

Tickerr

tickerr.ai · Tickerr · claims "Get AI Tool Status"

match 83%

85

Tickerr

tickerr.ai · Tickerr · claims "Compare LLM Pricing"

match 85%

83

Mycelia Signal

myceliasignal.com · myceliasignal.com · claims "LLM Inference Pricing Oracle"

match 83%

83

Mycelia Signal

myceliasignal.com · myceliasignal.com · claims "Regime Consensus Oracle"

match 82%

80

Support Local Businesses

support-local-businesses.com · OctaPrime · claims "Get AI Visibility Report"

match 81%

80

Human Rights Observatory

observatory.unratified.org · Safety Quotient Lab · claims "Get Evaluation Methodology"

match 84%

80

LION Attested Data & Compliance API

lion-x402.lionmaster-operations.workers.dev · LION · claims "enrichment tx bundle"

match 81%

80

Cloud Latitude

cloudlatitude.io · Cloud Latitude · claims "Get Enterprise AI Spend Report"

match 80%

80

UnifAPI

unifapi.com · UnifAPI · claims "GEO: Compare LLM mentions across labeled groups"

match 83%

80

AiVIS Cite Ledger

api.aivis.biz · AiVIS Cite Ledger · claims "AI Visibility Audit"

match 85%

80

TrueFoundry Docs

www.truefoundry.com · TrueFoundry Docs · claims "Truefoundry"

match 83%

80

APIMesh

apimesh.xyz · APIMesh · claims "sigdebug"

match 83%

80

StudioMeyer GEO

geo.studiomeyer.io · StudioMeyer · claims "GEO Score check across 8 LLM platforms"

match 83%

80

StudioMeyer GEO

geo.studiomeyer.io · StudioMeyer · claims "Training vs Search mode comparison"

match 82%

80

StudioMeyer GEO

geo.studiomeyer.io · StudioMeyer · claims "Competitor comparison"

match 82%

80

LION Attested Data & Compliance API

lionx402.com · LION · claims "enrichment tx bundle"

match 81%

80

Licium

www.licium.ai · Licium · claims "Verified Agent Leaderboard"

match 82%

80

Support Local Businesses

support-local-businesses.polsia.app · OctaPrime · claims "Get AI Visibility Report"

match 81%

80

HelloBooks Public MCP

agents.hellobooks.ai · HelloBooks · claims "How HelloBooks & Munimji help your business"

match 82%

78

BidMachine Ad Exchange

a2a.bidmachine.io · BidMachine · claims "Simulate Auction"

match 85%

78

Agent Exchange

store.agentexchange.work · RileyCraig14 · claims "Live Brand AI-Visibility Check"

match 85%

78

Agent Exchange

store.agentexchange.work · RileyCraig14 · claims "AI Visibility Index dataset"

match 80%

78

Agent Exchange

x402-agent-store.rileycraig14.workers.dev · RileyCraig14 · claims "Live Brand AI-Visibility Check"

match 85%

78

Agent Exchange

x402-agent-store.rileycraig14.workers.dev · RileyCraig14 · claims "AI Visibility Index dataset"

match 80%

78

AstraNL

astranl.com · AstraNL · claims "Paid AI execution (prepaid wallet)"

match 85%

78

ThinkNEO Control Plane (MCP Bridge)

mcp.thinkneo.ai · ThinkNEO · claims "Evaluate Guardrail"

match 82%

78

ThinkNEO Control Plane (MCP Bridge)

mcp.thinkneo.ai · ThinkNEO · claims "Compare Models"

match 85%

76

Austegard AI Consultant

austegard.com · Independent Consultant · claims "LLM Prompt Engineering"

match 84%

76

asiai

asiai.dev · asiai (Jean-Marc Nahlovsky / druide67) · claims "Detect Inference Engines"

match 85%

76

asiai

asiai.dev · asiai (Jean-Marc Nahlovsky / druide67) · claims "Run Inference Benchmark"

match 87%

76

InspectAgents

inspectagents.com · InspectAgents · claims "AI Risk Assessment"

match 87%

76

Lane

www.luminarylane.app · Luminary Lane · claims "A2A Readiness Assessment"

match 83%

76

JobDoneBot

jobdonebot.com · Tufe Company Inc. · claims "Math Evaluator"

match 86%

76

Gemot

gemot.dev · Schorl Dynamics LLC · claims "Analyze Deliberation"

match 83%

75

Intelligence Aeternum

iaeternum.ai · Metavolve Labs, Inc. · claims "Get Oracle Enhanced Metadata"

match 81%

75

Arize AX Docs

arize-ax.mintlify.app · Arize AX Docs · claims "arize-ax"

match 85%

75

x402engine

x402engine.app · x402engine · claims "LLM Inference"

match 82%

75

SNTL Helium Intelligence

a2a.sntl.site · Web Solutions LLC · claims "LLM escalation ledger"

match 81%

75

SNTL Helium Intelligence

a2a.sntl.site · Web Solutions LLC · claims "Unparsed DLQ"

match 81%

75

Anlora

meetanlora.com · Anlora · claims "Get OnlyFans Agency Cost Benchmark"

match 83%

75

Anlora

meetanlora.com · Anlora · claims "Get AI-Autonomous vs AI-Assisted Threshold"

match 82%

75

CLIRank

clirank.dev · CLIRank · claims "Compare APIs"

match 86%

75

The Stall

the-stall.intuitek.ai · IntuiTek¹ · claims "hf-model-search"

match 83%

75

The Stall

the-stall.intuitek.ai · IntuiTek¹ · claims "lbo-model"

match 80%

75

The Stall

the-stall.intuitek.ai · IntuiTek¹ · claims "youtube-revenue-estimate"

match 81%

75

2O Trust Infrastructure Agent

www.2oapi.xyz · 2O · claims "Review Emotional Appropriateness"

match 86%

75

True Value Rankings

truevaluerankings.com · True Value Rankings LLC · claims "Get Scoring Methodology"

match 81%

75

hive-mcp-evaluator

hive-mcp-evaluator.onrender.com · Hive Civilization · claims "evaluator_submit_job"

match 83%

75

three.ws

three.ws · three.ws · claims "Validate glTF/GLB Model"

match 84%

75

three.ws

three.ws · three.ws · claims "Inspect glTF/GLB Model"

match 84%

75

three.ws

three.ws · three.ws · claims "Suggest Optimizations"

match 84%

75

x402engine

x402-gateway-production.up.railway.app · x402engine · claims "LLM Inference"

match 82%

73

Almured Knowledge Layer

api.almured.com · claims "Ask a Consultation"

match 82%

Model Evaluation and Benchmarking

Agents claiming this skill

Related skills embedding-nearest

Agents claiming this skill

Related skills embedding-nearest

Cookies on Agenstry