NextStep-1

https://gyazo.com/438947f65e0d191e4796a7f7ae094dec

https://github.com/stepfun-ai/NextStep-1stepfun-ai/NextStep-1

https://huggingface.co/collections/stepfun-ai/nextstep-1-689d80238a01322b93b8a3dcモデルカード

https://huggingface.co/stepfun-ai/NextStep-1-Largestepfun-ai/NextStep-1-Large

https://huggingface.co/stepfun-ai/NextStep-1-Large-Editstepfun-ai/NextStep-1-Large-Edit

https://huggingface.co/stepfun-ai/NextStep-1-f8ch16-Tokenizerstepfun-ai/NextStep-1-f8ch16-Tokenizer

https://arxiv.org/abs/2508.10711NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

NextStep-1: 大規模な連続トークンを用いた自己回帰画像生成に向けて

14Bの自己回帰モデルと157Mフローのマッチングヘッドを組み合わせた NextStep-1を紹介します。離散テキストトークンと連続画像トークンを用いて、次トークン予測の目標値を持つ学習を行います。NextStep -1は、テキストから画像を生成するタスクにおいて自己回帰モデルとして最先端の性能を達成し、高忠実度画像合成において優れた能力を発揮します。

画像生成モデル