Guardrails

NeMo-Guardrailsを試してみる

SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models

機密情報の流出を防ぎ、企業の安全な生成AI活用を促進する「chakoshi」のパブリックベータ版を公開

生成 AI をもっと気軽に、安全に使うための「chakoshi」をリリースした話

Exploiting Partial Compliance: The Redact-and-Recover Jailbreak