マルチモーダルRAG
視覚文書理解
Vision Language Model
RAG
マルチモーダル
Large Language Model
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation
https://arxiv.org/abs/2502.08826