The Agent Index / Agent Frameworks / #244
THUDM/AgentBench
by THUDM · Agent Frameworks · updated 4mo ago
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
47
momentum
3,492
stars
262
forks
#244
rank
chatgptgpt-4llmllm-agent
View on GitHub →