Kymata Labs/The Living IndexesBuilt by tekvisions ↗
The Agent Index / Agent Frameworks / #244
THUDM

THUDM/AgentBench

by THUDM · Agent Frameworks · updated 4mo ago

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

47
momentum
3,492
stars
262
forks
#244
rank
chatgptgpt-4llmllm-agent
View on GitHub →