Applying OpenAI's RAG Strategies - nikkie-memos

Applying OpenAI's RAG Strategies

https://blog.langchain.dev/applying-openai-rag/

#LangChain_Blog #RAG

02 RAGで取り上げられた98%の事例で紹介された手法の整理

we expand on each method mention and show how you can implement each one for yourself.

リンク集としても有用（サーベイっぽい）

日本語訳ないかなーと思ってたやつ、あった。さすがnpakaさん > LangChain への OpenAIのRAG戦略の適用で知った

https://note.com/npaka/n/n62cd25213679

https://blog.langchain.dev/content/images/size/w1000/2023/11/image-12.png

Baseline

The base-case retrieval method used in the OpenAI study mentioned cosine similarity.

LangChainのVector Stores

distance metricsへの参考

Distance Metrics in Vector Search

Vector Similarity Explained

Query Transformations (1)

Query transformations are a set of approaches focused on modifying the user input in order to improve retrieval.

Query Transformations (LangChain Blog)

Query expansion

👉 MultiQueryRetriever

hyde (LangChain Templates)

OpenAIによって報告されなかったメソッド

Step back prompting

A New Prompt Engineering Technique Has Been Introduced Called Step-Back Prompting

Step-Back Prompting (Question-Answering)

Rewrite-Retrieve-Read

👉 Rewrite-Retrieve-Read (LangChain cookbook)

Routing (2)

The OpenAI presentation reported that they needed to route question between two vectorstores and single SQL database.

SQLデータベースはtool利用の話ではないかと思われる

Dynamically route logic based on input

Query Construction (3)

valid SQL needed to be generated from the user input in order to extract the necessary information.

Query Construction (LangChain Blog)

OpenAIによって報告されなかったメソッド

Text-to-metadata filter for vectorstores

Text-to-Cypher for graph databases

Text-to-SQL+semantic for semi-structured data in Postgres with Pgvector

Building the Index (4)

OpenAI reported an notable boost in performance simply from experimenting with the chunk size during document embedding.

👉Text Splitting Playground

報告されなかったメソッドに「embedding fine-tuning」

How to build an AI assistant for the enterprise

Using LangSmith to Support Fine-tuning

HuggingFaceのtutorial

Getting Started With Embeddings

Train and Fine-Tune Sentence Transformers Models

Post-Processing (5)

We can use post-processing to enforce diversity or recency among our retrieved documents, which can be especially important when we are pooling documents from multiple sources.

Re-rank

Cohere Reranker

RAG-fusion

Forget RAG, the Future is RAG-Fusion

RAG Fusion (LangChain cookbook)

Classification

This marries two ideas

tagging of text

Using OpenAI functions

logical routing

Dynamically route logic based on input

OpenAIによって報告されなかったメソッド

MMR

例 https://python.langchain.com/docs/integrations/vectorstores/pinecone#maximal-marginal-relevance-searches

論文 The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries

Maximal Marginal Relevance to Re-rank results in Unsupervised KeyPhrase Extraction

クラスタリング

LOTR (Merger Retriever)

For RAG evaluation, LangSmith offers a great deal of support

👉Advanced RAG Eval