ロボット基盤モデル

Physical Intelligence is bringing general-purpose AI into the physical world. We are a group of engineers, scientists, roboticists, and company builders developing foundation models and learning algorithms to power the robots of today and the physically-actuated devices of the future.

Team

Kevin Black, Noah Brown, Danny Driess, Adnan Esmail, Michael Equi, Chelsea Finn, Nick Fusai, Dibya Ghosh, Lachy Groom, Karol Hausman, Brian Ichter, Szymon Jakubczak, Tim Jones, Kay Ke, Sergey Levine, Adrian Li-Bell, Mohith Mothukuri, Suraj Nair, Karl Pertsch, Lucy Shi, Laura Smith, James Tanner, Quan Vuong, Anna Walling, Haohuan Wang, Charles Xu, Ury Zhilinsky, and growing!

We are in the early stages of building, and plan on sharing more information soon! If you are interested in joining, please get in touch.

Investors

We are grateful for the support and partnership of Khosla Ventures, Lux Capital, OpenAI, Sequoia Capital, and Thrive Capital.

You can follow us on Twitter at @physical_int

intrinsic

NVIDIA と Alphabet 傘下の Intrinsic が次世代ロボティクスを現実のものに

https://blogs.nvidia.co.jp/blog/alphabet-intrinsic-robotics-isaac-manipulator/

A Vision-Language-Action Flow Model for General Robot Control

https://www.physicalintelligence.company/download/pi0.pdf

日経ロボティクス1月号

大規模言語モデルによるアプローチ(離散値)

拡散モデルによるアプローチ(連続値)

大規模言語モデルが持つ膨大な知識を活かしつつ、拡散モデルベースの器用さ・なめらかさを表現したものがπ0

ロボットを発達させるにはロボット工学の枠内だけで考えていても無意味であり、活発に進むAI領域での数理的な動向を逐一キャッチアップする必要がある

現在、ロボット用のソフトウェアはAI技術を使わず、人が個別にハードコーディングして、各市場にフィットさせているが、GPT4級の汎用モデルができたら、ソフト側のハードコーディングの作業がなくなる

最新の模倣学習 (SARNN, ACT, DiffusionPolicy) をシミュレーションや実世界のロボットで簡単に再現検証できるソフトウェア "RoboManipBaselines" を公開しました！

https://github.com/isri-aist/RoboManipBaselines