@vsatWhy benchmark scores can be misleading 中发帖

[image] 
I was reading this Qwen3-Coder-Next先到 Sonnet-5还会远么? post about Qwen-3-Coder-Next being able to beat out Sonnet, and a comment there pertained to something that I’ve been thinking about for a while, so I wanted to make a post. Here is the comment; 

in which @sd_d states that the actual user experience matters more than the benchmarks. I happen to agree, and I see this rising sentiment ove...
 
 
Back to Top