GPT-o1 - yuyan

GPT-o1

OpenAI o1はどう作るのか（詳細編）

o1の要素技術の元になってるとされる論文(Quiet-STaR)を読んだ。実際使われた手法が公開されてないので推測にはなるが、以下感想。

学習データのスケーリング、モデルパラメータのスケーリングがともにcapして来た中で、推論時間のスケーリングという新しい探索方向を示したという意味で画期的。

learning to reasoning

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

O1 Replication Journey: A Strategic Progress Report – Part 1

o1 Proを使ってプロダクトのアイデア出しから実装までやってみる！

A Small Step Towards Reproducing OpenAI o1: Progress Report on the Steiner Open Source Models

大規模言語モデルのOpenAI、従来手法の限界を打破する新しいAI学習手法「test-time compute」を開発