Kymata Labs/The Living IndexesBuilt by tekvisions ↗

← All frameworks

The Agent Index / Agent Frameworks / #244

THUDM

THUDM/AgentBench

by THUDM · Agent Frameworks · updated 4mo ago

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

47

momentum

3,492

stars

262

forks

#244

rank

chatgptgpt-4llmllm-agent

View on GitHub →