GET
/
v1
/
collections
/
{collection_identifier}
/
documents
/
{document_id}
Get a document by ID.
curl --request GET \
  --url https://api.mixpeek.com/v1/collections/{collection_identifier}/documents/{document_id}
{
  "document_id": "<string>",
  "collection_id": "<string>",
  "object_id": "<string>",
  "enrichments": {
    "clusters": [
      {
        "cluster_id": "cluster_001",
        "coordinates": [
          0.12,
          0.23,
          0.34,
          0.45,
          0.56
        ],
        "distance_to_centroid": 0.15,
        "document_id": "doc_123",
        "features": {
          "category": "electronics",
          "product_name": "Widget A"
        },
        "source_details": {
          "source_id": "col_products_v1",
          "type": "collection"
        }
      }
    ],
    "taxonomies": [
      {
        "label": "Mobile Phones",
        "node_id": "node_electronics_phones",
        "path": [
          "products",
          "electronics",
          "phones"
        ],
        "score": 0.87,
        "taxonomy_id": "tax_products"
      }
    ]
  },
  "source_blobs": [
    {}
  ],
  "internal_metadata": {},
  "metadata": {},
  "vector": [
    123
  ],
  "presigned_url": "<string>"
}

Headers

Authorization
string | null

Bearer token authentication using your API key. Format: 'Bearer your_api_key'. To get an API key, create an account at mixpeek.com/start and generate a key in your account settings. Example: 'Bearer sk_1234567890abcdef'

X-Namespace
string | null

Optional namespace for data isolation. This can be a namespace name or namespace ID. Example: 'netflix_prod' or 'ns_1234567890'. To create a namespace, use the /namespaces endpoint.

Path Parameters

collection_identifier
string
required

The ID of the collection.

document_id
string
required

The ID of the document to retrieve.

Query Parameters

return_url
boolean | null
default:false
return_vectors
boolean | null
default:false

Response

Successful Response

Response model for a single document.

document_id
string
required

ID of the document

collection_id
string
required

ID of the collection

object_id
string
required

ID of the source object for this document

enrichments
object | null

Enrichments from clusters, taxonomies, joins, etc. Container for document enrichments from various sources.

Uses existing models where appropriate to ensure consistency and reuse (e.g., ClusterMember for cluster assignments).

source_blobs
Source Blobs · object[]

A list of the source blobs for the object.

internal_metadata
object

Internal metadata calculated during processing.

metadata
object

Metadata associated with the document

vector
number[] | null

Vector embedding for the document

presigned_url
string | null

Presigned URL for the document