GPT-5 剛剛完成了《精靈寶可夢 紅版》!6,470 步對比 o3 的 18,184 步!查看統計網站進行比較! 這是一個巨大的進步!幹得好,@OpenAI,你用 GPT-5 做得很好。真是一個令人難以置信的模型。 接下來:GPT-5 對戰《精靈寶可夢 水晶版》(16 個徽章 + 紅版)。比賽很快將在 Twitch 上開始。
Clad3815
Clad38158月14日 14:39
GPT-5 has reached Victory Road! This is the last challenge before the Elite Four. GPT-5 reached this part almost three times faster than o3 (6105 steps for GPT-5 vs 16882 steps for o3). Here are my observations as to why: - GPT-5 hallucinates far less than o3. This is the main reason for the speed increase. - GPT-5 has better spatial reasoning. o3 often tried to brute-force through walls and had a hard time navigating complex areas. GPT-5 can plan long input sequences with few mistakes, which saves a lot of time. - GPT-5 is better at planning its own objectives and following them. Let's see how it handle this last challenge!
224.91K