General
What is Mixpeek?
What is Mixpeek?
What's the difference between Objects and Documents?
What's the difference between Objects and Documents?
Object → Batch → Engine → DocumentDo I need to know machine learning to use Mixpeek?
Do I need to know machine learning to use Mixpeek?
Can I bring my own models?
Can I bring my own models?
Ingestion & Processing
How long does batch processing take?
How long does batch processing take?
- Object count: 100 objects typically process in 1-5 minutes
- File size: Large videos (>1GB) take longer
- Extractors: Video/audio extractors are slower than text
- Ray cluster size: More workers = faster throughput
What happens if processing fails mid-batch?
What happens if processing fails mid-batch?
__fully_enriched: false__missing_features: ["list", "of", "failed", "features"]
- Query partial results (filter by
__fully_enriched: true) - Reprocess failed objects individually
- Inspect Engine logs for failure reasons
Can I update documents after creation?
Can I update documents after creation?
How do I delete documents?
How do I delete documents?
What file formats are supported?
What file formats are supported?
| Type | Formats |
|---|---|
| Video | MP4, MOV, AVI, MKV, WebM |
| Image | JPEG, PNG, GIF, WebP, TIFF |
| Audio | MP3, WAV, FLAC, OGG, M4A |
| Document | PDF, DOCX, TXT, Markdown |
| Structured | JSON, CSV |
Search & Retrieval
What's the difference between KNN search and hybrid search?
What's the difference between KNN search and hybrid search?
- Best for: Semantic search, natural language queries
- Example: “Find articles about climate change”
- Best for: Queries mixing semantics + exact terms
- Example: “React hooks API reference” (needs both concept and keyword matching)
How many collections can a retriever query?
How many collections can a retriever query?
Can I search across multiple namespaces?
Can I search across multiple namespaces?
X-Namespace header enforces hard isolation. Each request operates within a single namespace.Workaround: Make separate requests per namespace and merge results client-side.How do I search by image?
How do I search by image?
- Create a collection with
image_extractor - Build a retriever with KNN search on
clip_embedding - Pass image URL in
inputs:
What's the max result limit?
What's the max result limit?
limit parameter (e.g., retrieve 100 candidates, rerank top 20)Best practice: Use filters and sorts to narrow results before pagination.Can I use Mixpeek for real-time search?
Can I use Mixpeek for real-time search?
- Simple KNN: 20-50ms
- Hybrid search: 50-150ms
- With reranking: 100-300ms
- With LLM generation: 500-2000ms
Taxonomies & Enrichment
What's the difference between flat and hierarchical taxonomies?
What's the difference between flat and hierarchical taxonomies?
- Example: Product categories (Electronics, Clothing, Home)
- Example: Animal taxonomy (Kingdom → Phylum → Class → Order)
When should I use materialized vs on-demand taxonomies?
When should I use materialized vs on-demand taxonomies?
- Taxonomy stable, changes infrequently
- Enrichment cost amortized across many queries
- Low-latency retrieval required
- Taxonomy updates frequently
- Personalized enrichment per query
- Cost-sensitive (only pay when enrichment used)
Can I update a taxonomy without reprocessing documents?
Can I update a taxonomy without reprocessing documents?
- Create a new taxonomy version
- Update collection’s
taxonomy_applicationsto reference new version - New queries use updated taxonomy; existing documents unchanged
Namespaces & Multi-Tenancy
Do I need multiple namespaces?
Do I need multiple namespaces?
- Multi-tenancy: Isolate customer data
- Environments: dev, staging, production
- Access control: Restrict team/service access
Can namespaces share collections?
Can namespaces share collections?
What's the performance impact of many namespaces?
What's the performance impact of many namespaces?
Cost & Billing
How are credits calculated?
How are credits calculated?
- Document creation: 1 credit per document
- Inference: 1-500 credits depending on model (embeddings, LLMs, OCR)
- Search: 0.1-10 credits per query (vector search, web search)
- Storage: 100 credits per GB/month
Do cached results consume credits?
Do cached results consume credits?
- Retriever-level caching
- Stage-level caching
- Document reads (after initial creation)
What happens if I exceed my credit quota?
What happens if I exceed my credit quota?
- Monthly quota resets (1st of month)
- You upgrade tier
- You purchase additional credits
Can I get a refund for unused credits?
Can I get a refund for unused credits?
Security & Compliance
Is my data encrypted?
Is my data encrypted?
- In transit: TLS 1.3 for all API calls
- At rest: AES-256 encryption for Qdrant, MongoDB, Redis, S3
Can I control data residency?
Can I control data residency?
- US East (Virginia)
- US West (Oregon)
- EU (Frankfurt)
- Custom (contact us)
Is Mixpeek SOC 2 compliant?
Is Mixpeek SOC 2 compliant?
How long is data retained?
How long is data retained?
- Objects: Indefinitely (or until deleted)
- Documents: Indefinitely
- Tasks: 24 hours in Redis, 90 days in MongoDB
- Cache: TTL-based (default 5 minutes)
Advanced Use Cases
Can I fine-tune models on my data?
Can I fine-tune models on my data?
- Fine-tuning embedding models (text, image)
- Custom classification heads
- Domain-specific NER models
Can I run Mixpeek on-premises?
Can I run Mixpeek on-premises?
- Docker Compose deployment
- Kubernetes Helm charts
- Full source code access (with license)
Does Mixpeek support GraphQL?
Does Mixpeek support GraphQL?
Can I export data to my own vector database?
Can I export data to my own vector database?
Support & Community
How do I get help?
How do I get help?
- Documentation: You’re here! Start with Quickstart
- Talk to Engineers: Use CTA in top bar for 1:1 support
- GitHub Issues: For bug reports and feature requests
- Discord: Community support and discussions (link in footer)
What's the SLA for support response?
What's the SLA for support response?
| Tier | Response Time |
|---|---|
| Free | Best effort, 48-72 hours |
| Pro | <24 hours business days |
| Enterprise | <4 hours, 24/7 |
Do you offer onboarding or consulting?
Do you offer onboarding or consulting?
- Pro: 2-hour onboarding call included
- Enterprise: Dedicated solutions architect, quarterly reviews
- Custom integration development
- Performance optimization audits
- Training for internal teams

