1. Create a Bucket
2. Create a Collection
sentence– Best for Q&Aparagraph– Best for long-form contentfixed– Predictable token windows
3. Ingest Documents
4. Create a Retriever
5. Search
Hybrid Search (Vector + BM25)
Combine semantic and keyword matching:Pre-Filter by Metadata
Filter before vector search for efficiency:Reranking
Use a cross-encoder for better accuracy:Model Options
| Model | Speed | Use Case |
|---|---|---|
multilingual-e5-base | Fast | High-volume |
multilingual-e5-large-instruct | Medium | General-purpose |
bge-large-en-v1.5 | Medium | English-only |
openai/text-embedding-3-large | Slow | Premium |

