Supported Models

  1. %load_ext autoreload
  2. %autoreload 2
  1. The autoreload extension is already loaded. To reload it, use:
  2. %reload_ext autoreload
  1. import pandas as pd
  2. from fastembed import (
  3. SparseTextEmbedding,
  4. TextEmbedding,
  5. LateInteractionTextEmbedding,
  6. ImageEmbedding,
  7. )
  8. from fastembed.rerank.cross_encoder import TextCrossEncoder

Supported Text Embedding Models

  1. supported_models = (
  2. pd.DataFrame(TextEmbedding.list_supported_models())
  3. .sort_values("size_in_GB")
  4. .drop(columns=["sources", "model_file", "additional_files"])
  5. .reset_index(drop=True)
  6. )
  7. supported_models
modeldimdescriptionlicensesize_in_GB
0BAAI/bge-small-en-v1.5384Text embeddings, Unimodal (text), English, 512…mit0.067
1BAAI/bge-small-zh-v1.5512Text embeddings, Unimodal (text), Chinese, 512…mit0.090
2snowflake/snowflake-arctic-embed-xs384Text embeddings, Unimodal (text), English, 512…apache-2.00.090
3sentence-transformers/all-MiniLM-L6-v2384Text embeddings, Unimodal (text), English, 256…apache-2.00.090
4jinaai/jina-embeddings-v2-small-en512Text embeddings, Unimodal (text), English, 819…apache-2.00.120
5BAAI/bge-small-en384Text embeddings, Unimodal (text), English, 512…mit0.130
6snowflake/snowflake-arctic-embed-s384Text embeddings, Unimodal (text), English, 512…apache-2.00.130
7nomic-ai/nomic-embed-text-v1.5-Q768Text embeddings, Multimodal (text, image), Eng…apache-2.00.130
8BAAI/bge-base-en-v1.5768Text embeddings, Unimodal (text), English, 512…mit0.210
9sentence-transformers/paraphrase-multilingual-…384Text embeddings, Unimodal (text), Multilingual…apache-2.00.220
10Qdrant/clip-ViT-B-32-text512Text embeddings, Multimodal (text&image), Engl…mit0.250
11jinaai/jina-embeddings-v2-base-de768Text embeddings, Unimodal (text), Multilingual…apache-2.00.320
12BAAI/bge-base-en768Text embeddings, Unimodal (text), English, 512…mit0.420
13snowflake/snowflake-arctic-embed-m768Text embeddings, Unimodal (text), English, 512…apache-2.00.430
14nomic-ai/nomic-embed-text-v1.5768Text embeddings, Multimodal (text, image), Eng…apache-2.00.520
15jinaai/jina-embeddings-v2-base-en768Text embeddings, Unimodal (text), English, 819…apache-2.00.520
16nomic-ai/nomic-embed-text-v1768Text embeddings, Multimodal (text, image), Eng…apache-2.00.520
17snowflake/snowflake-arctic-embed-m-long768Text embeddings, Unimodal (text), English, 204…apache-2.00.540
18mixedbread-ai/mxbai-embed-large-v11024Text embeddings, Unimodal (text), English, 512…apache-2.00.640
19jinaai/jina-embeddings-v2-base-code768Text embeddings, Unimodal (text), Multilingual…apache-2.00.640
20sentence-transformers/paraphrase-multilingual-…768Text embeddings, Unimodal (text), Multilingual…apache-2.01.000
21snowflake/snowflake-arctic-embed-l1024Text embeddings, Unimodal (text), English, 512…apache-2.01.020
22thenlper/gte-large1024Text embeddings, Unimodal (text), English, 512…mit1.200
23BAAI/bge-large-en-v1.51024Text embeddings, Unimodal (text), English, 512…mit1.200
24intfloat/multilingual-e5-large1024Text embeddings, Unimodal (text), Multilingual…mit2.240

Supported Sparse Text Embedding Models

  1. (
  2. pd.DataFrame(SparseTextEmbedding.list_supported_models())
  3. .sort_values("size_in_GB")
  4. .drop(columns=["sources", "model_file", "additional_files"])
  5. .reset_index(drop=True)
  6. )
modelvocab_sizedescriptionlicensesize_in_GBrequires_idf
0Qdrant/bm25NaNBM25 as sparse embeddings meant to be used wit…apache-2.00.010True
1Qdrant/bm42-all-minilm-l6-v2-attentions30522.0Light sparse embedding model, which assigns an…apache-2.00.090True
2prithivida/Splade_PP_en_v130522.0Independent Implementation of SPLADE++ Model f…apache-2.00.532NaN
3prithvida/Splade_PP_en_v130522.0Independent Implementation of SPLADE++ Model f…apache-2.00.532NaN

Supported Late Interaction Text Embedding Models

  1. (
  2. pd.DataFrame(LateInteractionTextEmbedding.list_supported_models())
  3. .sort_values("size_in_GB")
  4. .drop(columns=["sources", "model_file"])
  5. .reset_index(drop=True)
  6. )
modeldimdescriptionlicensesize_in_GBadditional_files
0answerdotai/answerai-colbert-small-v196Text embeddings, Unimodal (text), Multilingual…apache-2.00.13NaN
1colbert-ir/colbertv2.0128Late interaction modelmit0.44NaN
2jinaai/jina-colbert-v2128New model that expands capabilities of colbert…cc-by-nc-4.02.24[onnx/model.onnx_data]

Supported Image Embedding Models

  1. (
  2. pd.DataFrame(ImageEmbedding.list_supported_models())
  3. .sort_values("size_in_GB")
  4. .drop(columns=["sources", "model_file"])
  5. .reset_index(drop=True)
  6. )
modeldimdescriptionlicensesize_in_GB
0Qdrant/resnet50-onnx2048Image embeddings, Unimodal (image), 2016 yearapache-2.00.10
1Qdrant/clip-ViT-B-32-vision512Image embeddings, Multimodal (text&image), 202…mit0.34
2Qdrant/Unicom-ViT-B-32512Image embeddings, Multimodal (text&image), 202…apache-2.00.48
3Qdrant/Unicom-ViT-B-16768Image embeddings (more detailed than Unicom-Vi…apache-2.00.82

Supported Rerank Cross Encoder Models

  1. (
  2. pd.DataFrame(TextCrossEncoder.list_supported_models())
  3. .sort_values("size_in_GB")
  4. .drop(columns=["sources", "model_file"])
  5. .reset_index(drop=True)
  6. )
modelsize_in_GBdescriptionlicense
0Xenova/ms-marco-MiniLM-L-6-v20.08MiniLM-L-6-v2 model optimized for re-ranking t…apache-2.0
1Xenova/ms-marco-MiniLM-L-12-v20.12MiniLM-L-12-v2 model optimized for re-ranking …apache-2.0
2jinaai/jina-reranker-v1-tiny-en0.13Designed for blazing-fast re-ranking with 8K c…apache-2.0
3jinaai/jina-reranker-v1-turbo-en0.15Designed for blazing-fast re-ranking with 8K c…apache-2.0
4BAAI/bge-reranker-base1.04BGE reranker base model for cross-encoder re-r…mit
5jinaai/jina-reranker-v2-base-multilingual1.11A multi-lingual reranker model for cross-encod…cc-by-nc-4.0