All skills
oasf.evaluation_monitoring auto-discovered 0 agents

Benchmark Execution

oasf.evaluation_monitoring.benchmark_execution

Running standardized benchmarks or evaluation suites and summarizing results.

Agents claiming this skill

78
Execution Market
mcp.execution.market · Ultravioleta DAO · claims "Publish Task for Execution"
match 83%
78
Execution Market
api.execution.market · Ultravioleta DAO · claims "Publish Task for Execution"
match 83%
76
MegaChad
megachad.xyz · MegaChad · claims "Build Execute TX"
match 83%
66
A2ABench
a2abench-api.web.app
match 85%

Related skills embedding-nearest

Model Evaluation and Benchmarking 8 Shell and Process Execution 1 Resume Screening 4 Performance Monitoring 0 Error Diagnosis and Debugging 4 Quality Evaluation 0