Retrievers are schema‑validated, multi‑stage search pipelines. They combine vector search, metadata filters, grouping, sorting, and optional post‑processing to return ranked results from one or more collections.
Overview
Pipelines
Compose ordered stages (KNN, hybrid, filters, reranking, generation)
Schema
Define an input schema for query‑time
inputs
Collections
Target one or more collections for search
Execution
Single
execute
endpoint with filters, sorts, grouping, selection, paginationCapabilities
Vector & Hybrid
KNN over specific indexes; hybrid fusion across multiple vectors
Filters & Logic
AND/OR/NOT with typed operators and case sensitivity controls
Sorting & Paging
Sort by score or fields; stable
limit
/offset
Grouping & Selection
Group by a field; return only requested fields via
select
Presigned URLs
Optional asset URLs with
return_urls=true
Introspection
List stages and describe features to understand searchable fields
Create a retriever
- API: Create Retriever
- Method: POST
- Path:
/v1/retrievers
- Reference: API Reference
Execute a retriever
- API: Execute Retriever
- Method: POST
- Path:
/v1/retrievers/{retriever_identifier}/execute
- Reference: API Reference
How it works
1
Inputs
Validated against the retriever
input_schema
2
Stage execution
Each stage refines candidates; hybrid and filters can combine modalities and metadata
3
Response shaping
Apply optional grouping, sorting, pagination, and
select
Enable
return_urls=true
to include presigned URLs for assets in results.Inputs and filters
Manage retrievers
Best practices
1
Start simple
Begin with a single KNN stage; add hybrid, filters, and reranking as needed
2
Pre‑filter early
Narrow candidates with metadata filters before expensive vector ops
3
Select fields
Use
select
to minimize payload size and latency4
Group wisely
Keep per‑group caps small to avoid skew
5
Version retrievers
Create new retrievers for breaking changes to inputs or stages