Faiss
Author: Ilgyun Jeong
Design:
Peer Review:
This is a part of LangChain Open Tutorial
Overview
This tutorial covers how to use Faiss with LangChain .
Faiss (Facebook AI Similarity Search) is a library for efficient similarity search and clustering of dense vectors. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also includes supporting code for evaluation and parameter tuning.
This tutorial walks you through using CRUD operations with the Faiss storing , updating , deleting documents, and performing similarity-based retrieval .
Table of Contents
References
Environment Setup
Set up the environment. You may refer to Environment Setup for more details.
[Note]
langchain-opentutorialis a package that provides a set of easy-to-use environment setup, useful functions and utilities for tutorials.You can checkout the
langchain-opentutorialfor more details.
You can alternatively set API keys such as OPENAI_API_KEY in a .env file and load them.
[Note] This is not necessary if you've already set the required API keys in previous steps.
What is Faiss?
Faiss (Facebook AI Similarity Search) is a library for efficient similarity search and clustering of dense vectors.
Core Concepts
Similarity search: Finding vectors that are closest to a query vectorScaling: Handles vector sets of any size, including those exceeding RAMEfficiency: Optimized for memory usage and search speed
Vector Operations
Nearest neighbor: Finding k vectors closest to a query vectorMaximum inner product: Finding vectors with highest dot productClustering: Grouping similar vectors together
Index Types
Flat: Exact search with exhaustive comparisonIVF: Inverted file structure for faster approximate searchHNSW: Hierarchical navigable small world graphs for high-quality searchPQ: Product quantization for memory compressionOPQ: Optimized product quantization for better accuracy
Performance Metrics
Speed: Query time for finding similar vectorsMemory: RAM requirements for index storageAccuracy: How well results match exhaustive search (recall)
Technical Features
GPU support: State-of-the-art GPU implementations with 5-20x speedupMulti-threading: Parallel processing across CPU coresSIMD optimization: Vectorized operations for faster computationHalf-precision: Float16 support for better performance
Applications
Image similarity: Finding visually similar imagesText embeddings: Semantic search in document collectionsRecommendation systems: Finding similar items for usersClassification: Computing maximum inner-products for classification
Data
This part walks you through the data preparation process .
This section includes the following components:
Introduce Data
Preprocessing Data
Introduce Data
In this tutorial, we will use the fairy tale 📗 The Little Prince in PDF format as our data.
This material complies with the Apache 2.0 license .
The data is used in a text (.txt) format converted from the original PDF.
You can view the data at the link below.
Preprocessing Data
In this tutorial section, we will preprocess the text data from The Little Prince and convert it into a list of LangChain Document objects with metadata.
Each document chunk will include a title field in the metadata, extracted from the first line of each section.
Initial Setting Faiss
This part walks you through the initial setup of Faiss .
This section includes the following components:
Load Embedding Model
Load
FaissClient
Load Embedding Model
In the Load Embedding Model section, you'll learn how to load an embedding model.
This tutorial uses OpenAI's API-Key for loading the model.
💡 If you prefer to use another embedding model, see the instructions below.
Load Faiss Client
In the Load Faiss Client section, we cover how to load the database client object using the Python SDK for Faiss .
Document Manager
To support the Langchain-Opentutorial , we implemented a custom set of CRUD functionalities for VectorDBs.
The following operations are included:
upsert: Update existing documents or insert if they don’t existupsert_parallel: Perform upserts in parallel for large-scale datasimilarity_search: Search for similar documents based on embeddingsdelete: Remove documents based on filter conditions
Each of these features is implemented as class methods specific to each VectorDB.
In this tutorial, you can easily utilize these methods to interact with your VectorDB.
We plan to continuously expand the functionality by adding more common operations in the future.
Create Instance
First, we create an instance of the faiss helper class to use its CRUD functionalities.
This class is initialized with the faiss Python SDK client instance, index name and the embedding model instance , both of which were defined in the previous section.
Now you can use the following CRUD operations with the crud_manager instance.
These instance allow you to easily manage documents in your faiss .
Upsert Document
Update existing documents or insert if they don’t exist
✅ Args
texts: Iterable[str] – List of text contents to be inserted/updated.metadatas: Optional[List[Dict]] – List of metadata dictionaries for each text (optional).ids: Optional[List[str]] – Custom IDs for the documents. If not provided, IDs will be auto-generated.**kwargs: Extra arguments for the underlying vector store.
🔄 Return
None
Upsert Parallel Document
Perform upserts in parallel for large-scale data
✅ Args
texts: Iterable[str] – List of text contents to be inserted/updated.metadatas: Optional[List[Dict]] – List of metadata dictionaries for each text (optional).ids: Optional[List[str]] – Custom IDs for the documents. If not provided, IDs will be auto-generated.batch_size: int – Number of documents per batch (default: 32).workers: int – Number of parallel workers (default: 10).**kwargs: Extra arguments for the underlying vector store.
🔄 Return
None
Similarity Search
Search for similar documents based on embeddings .
This method uses "cosine similarity" .
✅ Args
query: str – The text query for similarity search.k: int – Number of top results to return (default: 10).
**kwargs : Additional search options (e.g., filters).
🔄 Return
results: List[Document] – A list of LangChain Document objects ranked by similarity.
Delete Document
Remove documents based on filter conditions
✅ Args
ids: Optional[List[str]] – List of document IDs to delete. If None, deletion is based on filter.filters: Optional[Dict] – Dictionary specifying filter conditions (e.g., metadata match).**kwargs: Any additional parameters.
🔄 Return
Boolean
Last updated