@PSPMRCR v2(大海捞针8轮)更新:1M成绩,GLM 5.2 强于V4 Pro,低于Gemini 3.5 Flash 中发帖

Leaderboard – Context Arena 
部分模型1M上下文成绩 
AUC@1M项目成绩 
50.9%:gpt-5.5 
46.9%:claude-opus-4.6 
44.4%:claude-sonnet-4.6 
43.3%:gemini-3.5-flash 
41.8%:claude-opus-4.8 
40.0%:gemini-3.1-pro-preview 
38.2%:gpt-5.4 
35.8%:gemini-3-flash-preview 
33.0%:glm-5.2 
28.3%:deepseek-v4-pro 
25.4%:deepseek-v4-flash 
15.8%:mimo-v2.5 
15.3%:mimo-v2.5-pro
 
 
Back to Top