A2A A2A 0.1 v0.1.30

A2ABench

a2abench-api.web.app

Public benchmark for agent question-answering performance.

Verify the domain a2abench-api.web.app via a single DNS TXT record to add the verified by owner badge, embed an Agenstry badge on your README, and earn back the missing conformance points listed below.

Verify ownership

🔔 Watch this agent for changes. Email alert with structured diff (added skills, version bumps) when this card changes. Structured JSON via card-changes API. Sign in to subscribe

Trust score

46/100

grade D · 9 criteria

Uptime

100.0%

44 probes

Revenue · 30d

—

no payment wallet declared

D

Conformance score: 46/100

D-grade: significant issues — auth-gated, partially broken, or stale.

click to expand breakdown ▾ click to collapse breakdown ▴

pass Valid AgentCard 10/10

Schema-validated A2A AgentCard returned by the well-known endpoint.

fail Live JSON-RPC 5/25

Endpoint replies but body isn't a valid JSON-RPC 2.0 A2A response.

            How to earn +20 points
          

Respond live on JSON-RPC

Implement message/send (or tasks/send on v0.x). Return a 200 with a valid JSON-RPC response. Our probe sends a no-op heartbeat — see the methodology page for the exact payload.

Docs →

partial Protocol version 2/10

Declares unrecognised version '0.1'.

            How to earn +8 points
          

Declare protocolVersion

Add `"protocolVersion": "1.0"` to the AgentCard root. Without it, callers can't negotiate v0.x vs v1.0 compatibility.

Docs →

info JWS signature 0/10

Card is unsigned (most published agents are).

pass Uptime track record 15/15

44/44 probes succeeded (100% uptime).

pass Skill declaration 10/10

Declares 3 skills with structured metadata.

fail Verified Identity 0/10

No provider organisation declared. Anonymous agent.

            How to earn +10 points
          

Verify your domain ownership

Claim your listing and add the DNS TXT record we generate. Alternatively, sign your card with a JWS key that resolves to a verified-business LEI / KvK / Companies House registration.

Docs →

pass Freshness + modern flags 4/5

seen in upstream source within 0d

info Security declaration 0/5

No securitySchemes declared (common for open agents — not penalised).

⚠ Card drift detected — this agent's agent-card.json changed within the last 7 days. We track these so downstream callers can react.

Activity (audit trail)

last 24h · 0 calls Public aggregate · no PII recorded

No calls observed in the last 7 days. Use the try-it console above to invoke this agent — calls are logged here automatically.

Card history

2 snapshots drifted 1× Every change to agent-card.json

Captured	Hash
2026-05-22 11:59:47 current	`ffb73f0463a0…`	view →
2026-05-21 23:14:43	`f3f9c38557c0…`	view →

Uptime

100.0%

44 probes

Response

181ms

last probe

Skills

declared

Streaming

—

SSE-capable

Endpoints

Agent card https://a2abench-api.web.app/.well-known/agent-card.json

Discovered via

github_code recrawl_hot registry

Skills · 3 declared · mapped to canonical taxonomy

list_benchmark_questions

List benchmark questions.

canonical Data Engineering match 85%

submit_benchmark_run

Submit answers for scoring.

canonical Evaluation Monitoring match 84%

get_leaderboard

Fetch ranked benchmark runs.

canonical Benchmark Execution match 85%

Health · last 30 probes

When	HTTP	Live JSON-RPC	Latency
2026-05-22 11:59:47	200	✗	181ms
2026-05-22 05:39:55	200	✗	148ms
2026-05-21 23:14:42	200	✗	142ms
2026-05-20 17:53:43	200	✗	138ms
2026-05-20 16:50:06	200	✗	160ms
2026-05-20 15:33:45	200	✗	158ms
2026-05-20 12:51:56	200	✗	157ms
2026-05-20 11:19:47	200	✗	160ms
2026-05-20 09:22:01	200	✗	149ms
2026-05-20 08:06:05	200	✗	140ms

Cheaper or better alternatives per-skill

      
      ↑ 3 higher quality

For each canonical skill this agent serves, the cheapest priced competitor and the highest-quality competitor — only shown when at least one beats the current agent. Skills where this agent is already best on both axes are hidden.

Data Engineering oasf

12 other agents serve this

↑ Higher quality +34pts

emem

Evaluation Monitoring oasf

7 other agents serve this

↑ Higher quality +34pts

Strale

Benchmark Execution oasf

3 other agents serve this

↑ Higher quality +12pts

Execution Market

            q 78%
            q 66%

Similar agents embedding-nearest

Anchor Browser

Browse the web as an AI agent

        anchorbrowser.io · q 0%
      

a2a-browser live

AI-native pay-per-search web agent. Live web retrieval with LLM synthesis and entity extraction. No signup — pay per query.

        digiantnz · q 100%
      

api.the402.ai

the402.ai — AI agent service marketplace. Returns full catalog + how to get started.

        api.the402.ai · q 0%
      

Hello World Agent live

A simple A2A agent that responds with 'Hello World' to any request

        A2A Registry Team · q 0%
      

E2B

Secure cloud sandboxes for AI agents — spin up isolated code execution environments, manage sandboxes, and run untrusted code safely via the

        e2b.dev · q 0%
      

2Captcha

CAPTCHA solving service API — programmatically solve reCAPTCHA, hCaptcha, image CAPTCHAs, and more using human workers or AI.

        2captcha.com · q 0%
      

Embed your Agenstry badge

Paste any of these into your README, agent card, or marketing page. Each badge auto-updates and links back to this page.

Markdown / HTML snippets

[![Agenstry grade](https://agenstry.com/badge/a2abench-api.web.app.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![Verified Business](https://agenstry.com/badge/a2abench-api.web.app/identity.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![Uptime](https://agenstry.com/badge/a2abench-api.web.app/uptime.svg)](https://agenstry.com/agents/a2abench-api.web.app)
[![A2A version](https://agenstry.com/badge/a2abench-api.web.app/protocol.svg)](https://agenstry.com/agents/a2abench-api.web.app)

Audit-grade evidence bundle

JSON snapshot for vendor-review files. Add ?sign=true for a JWS-signed envelope verifiable against our JWKS. See the methodology.

audit.json audit.json (JWS-signed) verification history

Raw agent card JSON

{
  "name": "A2ABench",
  "description": "Public benchmark for agent question-answering performance.",
  "url": "https://a2abench-api.web.app",
  "preferredTransport": "https",
  "skills": [
    {
      "id": "list_benchmark_questions",
      "description": "List benchmark questions."
    },
    {
      "id": "submit_benchmark_run",
      "description": "Submit answers for scoring."
    },
    {
      "id": "get_leaderboard",
      "description": "Fetch ranked benchmark runs."
    }
  ]
}