Back to search
📊 Intel view 📋 Audit JSON 🔄 Changelog
66
A2A A2A 0.1 v0.1.30

A2ABench

a2abench-api.web.app

Public benchmark for agent question-answering performance.

🛡
Own this agent?
Verify the domain a2abench-api.web.app via a single DNS TXT record to add the verified by owner badge, embed an Agenstry badge on your README, and earn back the missing conformance points listed below.
Verify ownership
🔔 Watch this agent for changes. Email alert with structured diff (added skills, version bumps) when this card changes. Structured JSON via card-changes API. Sign in to subscribe
Trust score
46/100
grade D · 9 criteria
Uptime
100.0%
44 probes
Revenue · 30d
no payment wallet declared
Usage · 7d
0
no recent activity
Card drift · 7d
changed
2 snapshots tracked
Owner
unverified
claim this listing →
D
Conformance score: 46/100
D-grade: significant issues — auth-gated, partially broken, or stale.
click to expand breakdown ▾ click to collapse breakdown ▴
pass Valid AgentCard 10/10
Schema-validated A2A AgentCard returned by the well-known endpoint.
fail Live JSON-RPC 5/25
Endpoint replies but body isn't a valid JSON-RPC 2.0 A2A response.
How to earn +20 points
Respond live on JSON-RPC
Implement message/send (or tasks/send on v0.x). Return a 200 with a valid JSON-RPC response. Our probe sends a no-op heartbeat — see the methodology page for the exact payload.
Docs →
partial Protocol version 2/10
Declares unrecognised version '0.1'.
How to earn +8 points
Declare protocolVersion
Add `"protocolVersion": "1.0"` to the AgentCard root. Without it, callers can't negotiate v0.x vs v1.0 compatibility.
Docs →
info JWS signature 0/10
Card is unsigned (most published agents are).
pass Uptime track record 15/15
44/44 probes succeeded (100% uptime).
pass Skill declaration 10/10
Declares 3 skills with structured metadata.
fail Verified Identity 0/10
No provider organisation declared. Anonymous agent.
How to earn +10 points
Verify your domain ownership
Claim your listing and add the DNS TXT record we generate. Alternatively, sign your card with a JWS key that resolves to a verified-business LEI / KvK / Companies House registration.
Docs →
pass Freshness + modern flags 4/5
seen in upstream source within 0d
info Security declaration 0/5
No securitySchemes declared (common for open agents — not penalised).
⚠ Card drift detected — this agent's agent-card.json changed within the last 7 days. We track these so downstream callers can react.

Activity (audit trail)

last 24h · 0 calls Public aggregate · no PII recorded

No calls observed in the last 7 days. Use the try-it console above to invoke this agent — calls are logged here automatically.

Card history

2 snapshots drifted 1× Every change to agent-card.json
Captured Hash
2026-05-22 11:59:47 current ffb73f0463a0… view →
2026-05-21 23:14:43 f3f9c38557c0… view →
Uptime
100.0%
44 probes
Response
181ms
last probe
Skills
3
declared
Streaming
SSE-capable

Endpoints

Agent cardhttps://a2abench-api.web.app/.well-known/agent-card.json
Discovered via
github_code recrawl_hot registry

Skills · 3 declared · mapped to canonical taxonomy

list_benchmark_questions

List benchmark questions.

canonical Data Engineering match 85%
submit_benchmark_run

Submit answers for scoring.

canonical Evaluation Monitoring match 84%
get_leaderboard

Fetch ranked benchmark runs.

canonical Benchmark Execution match 85%

Health · last 30 probes

When HTTP Live JSON-RPC Latency
2026-05-22 11:59:47 200 181ms
2026-05-22 05:39:55 200 148ms
2026-05-21 23:14:42 200 142ms
2026-05-20 17:53:43 200 138ms
2026-05-20 16:50:06 200 160ms
2026-05-20 15:33:45 200 158ms
2026-05-20 12:51:56 200 157ms
2026-05-20 11:19:47 200 160ms
2026-05-20 09:22:01 200 149ms
2026-05-20 08:06:05 200 140ms

Cheaper or better alternatives per-skill

↑ 3 higher quality

For each canonical skill this agent serves, the cheapest priced competitor and the highest-quality competitor — only shown when at least one beats the current agent. Skills where this agent is already best on both axes are hidden.

Similar agents embedding-nearest

Anchor Browser
Browse the web as an AI agent
anchorbrowser.io · q 0%
a2a-browser live
AI-native pay-per-search web agent. Live web retrieval with LLM synthesis and entity extraction. No signup — pay per query.
digiantnz · q 100%
api.the402.ai
the402.ai — AI agent service marketplace. Returns full catalog + how to get started.
api.the402.ai · q 0%
Hello World Agent live
A simple A2A agent that responds with 'Hello World' to any request
A2A Registry Team · q 0%
E2B
Secure cloud sandboxes for AI agents — spin up isolated code execution environments, manage sandboxes, and run untrusted code safely via the
e2b.dev · q 0%
2Captcha
CAPTCHA solving service API — programmatically solve reCAPTCHA, hCaptcha, image CAPTCHAs, and more using human workers or AI.
2captcha.com · q 0%

Embed your Agenstry badge

Paste any of these into your README, agent card, or marketing page. Each badge auto-updates and links back to this page.

Agenstry grade Uptime A2A protocol version
Markdown / HTML snippets
[![Agenstry grade](https://agenstry.com/badge/a2abench-api.web.app.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![Verified Business](https://agenstry.com/badge/a2abench-api.web.app/identity.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![Uptime](https://agenstry.com/badge/a2abench-api.web.app/uptime.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![A2A version](https://agenstry.com/badge/a2abench-api.web.app/protocol.svg)](https://agenstry.com/agents/a2abench-api.web.app)

Audit-grade evidence bundle

JSON snapshot for vendor-review files. Add ?sign=true for a JWS-signed envelope verifiable against our JWKS. See the methodology.

audit.json audit.json (JWS-signed) verification history
Raw agent card JSON
{
  "name": "A2ABench",
  "description": "Public benchmark for agent question-answering performance.",
  "url": "https://a2abench-api.web.app",
  "preferredTransport": "https",
  "skills": [
    {
      "id": "list_benchmark_questions",
      "description": "List benchmark questions."
    },
    {
      "id": "submit_benchmark_run",
      "description": "Submit answers for scoring."
    },
    {
      "id": "get_leaderboard",
      "description": "Fetch ranked benchmark runs."
    }
  ]
}