Forgot password
 Register account
View 4|Reply 0

MathArena评估GPT-5的数学解题能力

[Copy link]

3290

Threads

7914

Posts

62

Karma

Show all posts

hbghlyj posted 2025-8-13 00:22 |Read mode
8月11日 MathArena.ai 发布了针对GPT-5的最新评估结果。该模型在最终答案型竞赛(如AIME)、Project Euler问题集以及IMO 2025中,均获得第一名。

评估方法:针对每个问题运行模型四次,计算平均得分和美元成本。详细模型输出可点击表格中的彩色单元格查看,评估代码已开源于github.com/eth-sri/matharena,所有输出和问题集上传至huggingface.co/MathArena

Quick Reply

Advanced Mode
B Color Image Link Quote Code Smilies
You have to log in before you can reply Login | Register account

$\LaTeX$ formula tutorial

Mobile version

2025-8-14 05:09 GMT+8

Powered by Discuz!

Processed in 0.012483 seconds, 22 queries