見た 2026/6 - pokutuna

見た 2026/6

<< 見た 2026/5 | 見た 2026/7 >>

2026/6/18

XユーザーのItaru Tomita / 冨田到さん: 「動画説明AIが「どのフレームが重要か」を映像だけで判断できる軽量セレクターPEEK（https://arxiv . org/abs/2605.31029）が公開された。 https://t.co/d2f0f04uHW」 / X

2605.31029 PEEK: Picking Essential frames via Efficient Knowledge distillation

対話システムの評価を LLM にどこまで任せられるか（前編）：5 つの落とし穴

Friends Don't Let Friends Use Ollama | Sleeping Robots

そりゃそう

2026/6/15

How the Open Knowledge Format can improve data sharing | Google Cloud Blog

SPEC.md knowledge-catalog/okf/SPEC.md at main · GoogleCloudPlatform/knowledge-catalog

knowledge-catalog/okf at main · GoogleCloudPlatform/knowledge-catalog

/nishio/OKFとLLM Wiki

鮮度や信頼性(LLM が集めた情報 < 目を通した考察 < 検証したこと) 的な階層つけて運用してみるといいかねえ

XユーザーのItaru Tomita / 冨田到さん: 「LoRAが「どれだけ記憶できるか」を1本の数式で予測できることを証明した研究（https://arxiv . org/html/2605.30260v1）。 https://t.co/yQV06ZV5rJ」 / X

2605.30260v1 How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

2026/6/10

XユーザーのItaru Tomita / 冨田到さん: 「LLMが論理推論をどうやって解いているかが、Attention Headレベルで解剖された（https://arxiv . org/html/2605.27824v1）。 https://t.co/DuVKDvv65k」 / X

2605.27824v1 Revealing Algorithmic Deductive Circuits for Logical Reasoning

Causal Mediation Analysis とは

因果推論の DAG 的なやつか!

2026/6/9

anthropics/defending-code-reference-harness: Skills for threat modeling, scanning, triage, patching, plus an autonomous scanning harness you can /customize

defending-code-reference-harness/docs/blog-post.md at main · anthropics/defending-code-reference-harness

→ Agent Harness

→ Terminal-Bench

Introducing the Google Colab CLI - Google Developers Blog

便利そうではあるが

→ Ideogram 4.0

2026/6/8

【AI論文解説】アーキテクチャ改良の変遷から見た最新の軽量モデル: MobileNet V4 - YouTube

Gemma 3n は MobileNet V5 らしい

Gemma 3n の紹介: デベロッパーガイド - Google Developers Blog

2026/6/5

示量性と示強性 - Wikipedia

足し算できるできない話から

2605.27734 Learn from your own latents and not from tokens: A sample-complexity theory

自分自身の潜在表現を予測する学習 (data2vec/JEPA 系) が、なぜ・どれだけデータ効率を改善するかを、階層的な合成データ RHM 上で理論的に解明した論文。

2026/6/4

AIエージェントを支える次世代データ基盤 - Snowflake / Databricks / Google が挑む仮想グラフ戦略

Introducing Gemma 4 12B

Multi-token-prediction in Gemma 4

MAI-Thinking-1: Building a Hill-Climbing Machine

蒸留なし・合成データなしという話、開発プロセスやデータが参考になる?

MAI-Thinking-1は、Microsoft AI(MAI)が外部モデルからの蒸留を一切使わず、クリーンな「企業グレード」データのみでフルスクラッチ学習した推論モデルです。アーキテクチャは35Bアクティブ/1TトータルのスパースMoE(8/512エキスパート)。事前学習30Tトークン+中間学習3.55Tトークンで構築され、最大256Kコンテキストに対応します。

Cloud Run での AI のコールドスタートに関するガイド | Google Cloud 公式ブログ

2026/6/3

ハーネスエンジニアなら知っておきたい Claude Code Plugin の落とし穴

CLAUDE_PLUGIN_DATA

XユーザーのLukas Aichbergerさん: 「We unlocked the working memory of LLMs 💥 Reasoning in Memory (RiM) replaces autoregressive "thinking out loud" with fixed memory blocks that form a task-specific workspace for latent reasoning. The key idea is simple: reasoning should happen inside the LLM, not in its output! https://t.co/MeaIj7KLpI」 / X

2605.30343 Unlocking the Working Memory of Large Language Models for Latent Reasoning

LLM の推論を「中間トークンの自己回帰生成」から切り離し、固定された特殊トークン列（メモリブロック）を作業記憶として使う潜在推論手法 RiM (Reasoning in Memory) を提案。2段階カリキュラムでメモリブロックに計算機能を学習させ、単一フォワードパスで処理する。GSM8K 系ベンチマークで Coconut や SFT を上回りつつ、直接回答と同等の低レイテンシを達成した。

Claude Code Dynamic Workflows 入門：公式情報から整理する仕組みと注意点

2026/6/2

sandcastle/docs/adr at main · mattpocock/sandcastle

ADR 書いてるよという人の例、シンプル

2026/6/1

jasperai/monet · Datasets at Hugging Face

画像生成データセット #データセット

2605.21272 MONET: A Massive, Open, Non-redundant and Enriched Text-to-image dataset