Welcome to the Mixpeek documentation! Use our API to ingest multimodal data, extract features, enrich documents, and retrieve results with hybrid search.
Mixpeek turns raw media (text, images, video, audio, PDFs) into structured documents and optimized feature stores you can query with flexible, staged retrievers. The API provides clear primitives for isolation, ingestion, enrichment, and retrieval.

Unified Multimodal Platform

Programmable Feature Extraction

Advanced Retrieval

  • Compose Retrievers from stages (KNN, filters, reranking, LLM gen, etc.)
  • Execute with filters, grouping, and URL generation via Retrievers
  • Learn about stage building blocks on Retrievers

Organization & Isolation

  • Multi-tenant isolation via Namespaces and the X-Namespace header
  • Manage API keys under Organizations (see Quickstart for setup)
  • Notify external systems with webhooks (see Quickstart for setup)

Data Organization & Enrichment

Taxonomies (Similarity Joins)

Attach or validate Taxonomies to enrich documents with versioning support.

Clustering (Grouping)

Build Clusters over features with multiple algorithms and apply enrichments.
API authentication uses the Authorization: Bearer <api_key> header. For tenant isolation, include X-Namespace where supported.

How It Works

1

Create a Namespace (optional)

Use Namespaces to establish isolation boundaries for data and queries. Pass X-Namespace on subsequent calls.
2

Upload Objects to Buckets

Create a Bucket and add Objects single or batch. Group work with Batches and submit for processing.
3

Define Collections & Feature Extractors

Create a Collection with input/output schemas and Feature Extractors. Describe available features to understand addresses and indexes.
4

Enrich with Taxonomies & Clustering

Configure Taxonomies and Clusters to enrich and organize documents.
5

Retrieve & Analyze

Use Retrievers to combine vector search, filters, grouping, and ranking, optionally generating presigned URLs.

Headers You’ll Use

Authorization: Bearer sk_your_api_key
X-Namespace: your_namespace_or_id  # optional but recommended for isolation

Getting Started

The fastest path is the Quickstart Guide: create a namespace, bucket, collection, upload objects, and run your first retriever. For platform concepts, see Core Concepts.

Common Use Cases

For ready-to-use multimodal patterns, see Recipes.
Mixpeek’s API gives you typed resources—Namespaces, Buckets/Objects/Batches, Collections/Documents/Feature Extractors, Taxonomies, Clusters, Retrievers, Tasks, and Webhooks—to build reliable multimodal applications end‑to‑end.
Ready to get started? Create your first project →