66
A2A
A2A 0.1
v0.1.30
A2ABench
a2abench-api.web.app
Public benchmark for agent question-answering performance.
🛡
Own this agent?
Verify the domain
a2abench-api.web.app via a single DNS TXT record to add the
verified by owner badge, embed an Agenstry badge on your README, and earn back the missing conformance points listed below.
🔔
Watch this agent for changes.
Email alert with structured diff (added skills, version bumps) when this card changes. Structured JSON via card-changes API.
Sign in to subscribe
Trust score
46/100
grade D · 9 criteria
Uptime
100.0%
44 probes
Revenue · 30d
—
no payment wallet declared
Usage · 7d
0
no recent activity
Card drift · 7d
changed
2 snapshots tracked
Owner
unverified
claim this listing →
D
Conformance score: 46/100
D-grade: significant issues — auth-gated, partially broken, or stale.
click to expand breakdown ▾
click to collapse breakdown ▴
D
Conformance score: 46/100
D-grade: significant issues — auth-gated, partially broken, or stale.
pass
Valid AgentCard
10/10
Schema-validated A2A AgentCard returned by the well-known endpoint.
fail
Live JSON-RPC
5/25
Endpoint replies but body isn't a valid JSON-RPC 2.0 A2A response.
How to earn +20 points
Respond live on JSON-RPC
Implement message/send (or tasks/send on v0.x). Return a 200 with a valid JSON-RPC response. Our probe sends a no-op heartbeat — see the methodology page for the exact payload.
Docs →
partial
Protocol version
2/10
Declares unrecognised version '0.1'.
How to earn +8 points
Declare protocolVersion
Add `"protocolVersion": "1.0"` to the AgentCard root. Without it, callers can't negotiate v0.x vs v1.0 compatibility.
Docs →
info
JWS signature
0/10
Card is unsigned (most published agents are).
pass
Uptime track record
15/15
44/44 probes succeeded (100% uptime).
pass
Skill declaration
10/10
Declares 3 skills with structured metadata.
fail
Verified Identity
0/10
No provider organisation declared. Anonymous agent.
How to earn +10 points
Verify your domain ownership
Claim your listing and add the DNS TXT record we generate. Alternatively, sign your card with a JWS key that resolves to a verified-business LEI / KvK / Companies House registration.
Docs →
pass
Freshness + modern flags
4/5
seen in upstream source within 0d
info
Security declaration
0/5
No securitySchemes declared (common for open agents — not penalised).
⚠ Card drift detected — this agent's
agent-card.json changed within the last 7 days. We track these so downstream callers can react.
Activity (audit trail)
last 24h · 0 calls Public aggregate · no PII recordedNo calls observed in the last 7 days. Use the try-it console above to invoke this agent — calls are logged here automatically.
Uptime
100.0%
44 probes
Response
181ms
last probe
Skills
3
declared
Streaming
—
SSE-capable
Endpoints
| Agent card | https://a2abench-api.web.app/.well-known/agent-card.json |
Discovered via
github_code
recrawl_hot
registry
Skills · 3 declared · mapped to canonical taxonomy
Health · last 30 probes
Cheaper or better alternatives per-skill
↑ 3 higher quality
For each canonical skill this agent serves, the cheapest priced competitor and the highest-quality competitor — only shown when at least one beats the current agent. Skills where this agent is already best on both axes are hidden.
Data Engineering
oasf
12 other agents serve this
7 other agents serve this
Benchmark Execution
oasf
3 other agents serve this
Similar agents embedding-nearest
Embed your Agenstry badge
Paste any of these into your README, agent card, or marketing page. Each badge auto-updates and links back to this page.
Markdown / HTML snippets
[](https://agenstry.com/agents/a2abench-api.web.app) [](https://agenstry.com/agents/a2abench-api.web.app) [](https://agenstry.com/agents/a2abench-api.web.app) [](https://agenstry.com/agents/a2abench-api.web.app)
Audit-grade evidence bundle
JSON snapshot for vendor-review files. Add ?sign=true for a JWS-signed envelope verifiable against
our JWKS. See the methodology.
Raw agent card JSON
{
"name": "A2ABench",
"description": "Public benchmark for agent question-answering performance.",
"url": "https://a2abench-api.web.app",
"preferredTransport": "https",
"skills": [
{
"id": "list_benchmark_questions",
"description": "List benchmark questions."
},
{
"id": "submit_benchmark_run",
"description": "Submit answers for scoring."
},
{
"id": "get_leaderboard",
"description": "Fetch ranked benchmark runs."
}
]
}