Available to Enterprise customers only. This is also a very hands on activity and typically we manage it end-to-end.

1

Send Annotated Data

Send your annotated data to an S3 bucket. Ensure the data is well-organized and labeled according to the model’s requirements (we’ll provide specs)

2

Initiate Fine-Tuning

Use the Mixpeek Dashboard (and soon to be API) to schedule a fine-tuning job. Specify the base model_id, the S3 bucket path, and specs that tell the fine-tuner how to run:

mixpeek.models.tune(
  model_id="vuse-generic-v1",
  annotation={
    "connection_id": "conn_123",
    "specs": "specs.json"
  }
)

This will return a new model_id, for example: model_1askdh2390, which you’ll then use in your methods like:

  mixpeek.embed.text(model_id="model_1askdh2390", input="hello")
3

Version Control

It’s recommended to store the version_id of each model state as it is fine-tuned.

If you append a model_version in the metadata of the pipeline it’ll automatically be added to your database so you can do query-time filtering:

mixpeek.pipelines.create(
  ...
  destination={
    ...
    "metadata": {
      "model_version": "model_1askdh2390"
    }
  }
)