AgentBench

Organization

by THUDM

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

View on GitHub

#chatgpt #gpt-4 #llm #llm-agent

3.5k

Stars

256

Forks

3.5k

Watchers

Issues

Repository Details

Created

Jul 28, 2023

Last Updated

May 26, 2026

Primary Language

Python

License

Repository Size

30.4k KB

Quick Actions

Open in GitHub

Project Overview

About this repository

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Technologies & Topics

chatgpt gpt-4 llm llm-agent

Default Branch

main

README

AgentBench

Organization

by THUDM

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

View on GitHub

#chatgpt #gpt-4 #llm #llm-agent

3.5k

Stars

256

Forks

3.5k

Watchers

Issues

Repository Details

Created

Jul 28, 2023

Last Updated

May 26, 2026

Primary Language

Python

License

Repository Size

30.4k KB

Quick Actions

Open in GitHub

Project Overview

About this repository

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Technologies & Topics

chatgpt gpt-4 llm llm-agent

Default Branch

main

README

Command Palette

AgentBench

About this repository

Technologies & Topics

Default Branch

AgentBench

About this repository

Technologies & Topics

Default Branch