CatsuCatsu Docs

Overview

Overview of all 11 supported embedding providers

Catsu supports 11 major embedding providers with a unified, consistent API.

Supported Providers

ProviderModelsKey FeaturesEnvironment Variable
OpenAI3Industry standard, MatryoshkaOPENAI_API_KEY
Voyage AI11Domain-specific, multimodal, quantizationVOYAGE_API_KEY
Cohere5Multilingual, truncationCOHERE_API_KEY
Gemini1Long context, MatryoshkaGEMINI_API_KEY
Jina AI6Multimodal, code-specific, long contextJINA_API_KEY
Mistral AI2Code-optimized, quantizationMISTRAL_API_KEY
Nomic2Long text handling, MatryoshkaNOMIC_API_KEY
Cloudflare7Edge inference, BGE modelsCLOUDFLARE_API_TOKEN
DeepInfra16Open-source models, Qwen3DEEPINFRA_API_KEY
Mixedbread4Multilingual, quantizationMIXEDBREAD_API_KEY
Together AI7Open-source, long contextTOGETHER_API_KEY

For detailed model information including pricing and benchmarks, visit the Models Catalog.

Quick Examples

Using Different Providers

from catsu import Client

client = Client()

# OpenAI
openai_response = client.embed("openai:text-embedding-3-small", ["Text"])

# Voyage AI
voyage_response = client.embed("voyageai:voyage-3", ["Text"])

# Cohere
cohere_response = client.embed("cohere:embed-v4.0", ["Text"])

Provider-Specific Features

# OpenAI with custom dimensions
response = client.embed(
    "openai:text-embedding-3-small",
    ["Text"],
    dimensions=256
)

# Voyage AI with input_type
response = client.embed(
    "voyageai:voyage-3",
    ["Search query"],
    input_type="query"
)

# Gemini with long context (2048 tokens)
response = client.embed(
    "gemini:gemini-embedding-001",
    ["Very long document..."]
)

Choosing a Provider

Consider these factors:

  • Use case: General retrieval, code search, multilingual, etc.
  • Features needed: input_type, dimensions
  • Cost: varies significantly by provider
  • Performance: latency and throughput requirements
  • Context length: maximum input tokens

See Best Practices: Model Selection for detailed guidance.

Explore detailed documentation for each provider:

On this page