Taxonomies
Taxonomies in Mixpeek allow you to classify, organize, and enrich your content with structured metadata, functioning as the multimodal analogue to SQL JOIN operations.
Overview
Taxonomies in Mixpeek are specialized structures that allow you to classify, organize, and enrich your data with structured metadata. They function similarly to JOIN operations in traditional databases, but operate on feature similarity rather than exact key matches, making them ideal for multimodal content.
Select Taxonomy Type
Choose between flat or hierarchical taxonomy structure based on your organizational needs and data relationships.
Select Source Collection
Select the collection containing the reference data that will be used to enrich your documents.
Select Enrichment Fields
Choose which fields from the source collection will be added to enrich your target documents.
Select Retriever
Choose the retriever that will match documents from your target collection with the appropriate reference data.
Configure Input Fields
Select the input fields from the source collection for the retriever. These fields define what data is used for matching, and can include constants.
Content Classification
Create organized classification systems to categorize your multimodal content
Data Enrichment
Enrich documents with additional metadata based on their characteristics
Types of Taxonomies
Mixpeek supports two main types of taxonomies:
Creating a Taxonomy
Flat Taxonomy
Hierarchical Taxonomy
Applying Taxonomies
Once you’ve created a taxonomy, you can apply it to enrich documents in your collections.
Materialization Options
When applying taxonomies, you have two main materialization options:
Materialized
Creates enriched documents in a specified output collection
Benefits:
- Faster query performance
- Pre-computed enrichments
- Historical enrichment tracking
Considerations:
- Requires additional storage
- Needs re-application when taxonomy changes
On-Demand
Computes enrichments during query execution
Benefits:
- Always uses latest taxonomy
- No duplicate storage required
- Automatic updates with taxonomy changes
Considerations:
- Higher query-time compute costs
- Potentially slower query performance
- Cannot track historical enrichment changes
Taxonomy Node Structure
In a hierarchical taxonomy, nodes are documents in collections with specific structures:
Property Inheritance
In hierarchical taxonomies, properties are inherited from parent to child nodes:
- Child nodes inherit all properties from their parent nodes
- Child nodes can override inherited properties with their own values
- Inheritance follows the collection hierarchy defined in the taxonomy
Using Enriched Documents
Once you’ve applied a taxonomy, you can use the enriched fields in searches and filters:
Example Use Cases
Create a product taxonomy to automatically classify products and inherit category properties:
This creates a three-level product hierarchy where each level inherits properties from its parent.
Best Practices
Design Thoughtful Hierarchies
For hierarchical taxonomies, carefully plan the levels and inheritance relationships to avoid redundancy and maximize usability.
Optimize Retriever Performance
Create efficient retrievers for taxonomy matching, focusing on the most distinctive features for each node type.
Consider Materialization Strategy
Choose between materialized and on-demand enrichment based on your query patterns, update frequency, and storage constraints.
Test with Representative Data
Validate taxonomy performance with representative test data before applying to your full collection.
Taxonomies with many nodes or complex hierarchies can impact application performance. Optimize your taxonomy structure and retriever configuration for the best balance of accuracy and performance.
Implementation Patterns
Dynamic Classification
Apply taxonomies to automatically classify new content as it’s ingested, using a pipeline hook to trigger taxonomy application.
Enriched Search
Use taxonomies to enrich documents with additional metadata that can be leveraged for more precise filtering and faceting in search.
Hierarchical Navigation
Create user interfaces that leverage hierarchical taxonomy structures for browsing and navigating content collections.
Compliance Tagging
Use taxonomies to automatically apply compliance or policy tags to content based on its characteristics.
API Reference
For complete details on working with taxonomies, see our Taxonomies API Reference.
Was this page helpful?