Deprecated: Using ${var} in strings is deprecated, use {$var} instead in /home/moderndigitalind/public_html/wp-content/plugins/td-composer/legacy/common/wp_booster/td_util.php on line 4631

Notice: Function _load_textdomain_just_in_time was called incorrectly. Translation loading for the td-cloud-library domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /home/moderndigitalind/public_html/wp-includes/functions.php on line 6131
ChatGPT and Claude are ‘becoming capable of tackling real-world missions,’ say scientists By Cointelegraph - Modern Digital India
Saturday, January 17, 2026

ChatGPT and Claude are ‘becoming capable of tackling real-world missions,’ say scientists By Cointelegraph

Share

[ad_1]



Nearly two dozen researchers from Tsinghua University, Ohio State University and the University of California at Berkeley collaborated to create a method for measuring the capabilities of large language models (LLMs) as real-world agents.

LLMs such as OpenAI’s ChatGPT and Anthropic’s Claude have taken the technology world by storm over the past year, as cutting-edge “chatbots” have proven useful at a variety of tasks, including coding, cryptocurrency trading and text generation.

Flowchart of AgentBench’s evaluation method. Source: Liu, et al