Endpoints Overview
| Endpoint | Method | Returns |
|---|---|---|
/analytics/retrievers/{id}/performance | GET | Latency percentiles (P50/P95/P99), query counts, trends |
/analytics/retrievers/{id}/stages | GET | Per-stage execution times and document flow |
/analytics/retrievers/{id}/signals | GET | Operational signals (cache hits, rerank scores, filter reduction) |
/analytics/retrievers/{id}/cache-performance | GET | Cache hit/miss rates, latency savings |
/analytics/retrievers/{id}/slow-queries | GET | Slowest queries with stage-level breakdown |
/analytics/retrievers/{id}/analyze-tuning | POST | AI-powered parameter tuning recommendations |
Performance Metrics
Get latency percentiles and query volume over time:| Parameter | Type | Default | Description |
|---|---|---|---|
start_date | datetime | — | Start of time range (UTC) |
end_date | datetime | — | End of time range (UTC) |
group_by | string | "hour" | Time grouping: hour, day, week |
Stage Breakdown
Understand which stages consume the most time:| Parameter | Type | Default | Description |
|---|---|---|---|
hours | integer | 24 | Hours of history (1–720) |
Slow Queries
Find the queries that take the longest to execute:| Parameter | Type | Default | Description |
|---|---|---|---|
limit | integer | 10 | Number of slow queries to return |
hours | integer | 24 | Hours of history (1–720) |
Cache Performance
Monitor how effectively caching reduces latency:Retriever Signals
Get raw operational signals for debugging:| Signal | Description |
|---|---|
cache_hit | Query served from cache |
cache_miss | Cache miss, full execution |
rerank_scores | Score distribution from rerank stage |
filter_reduction | How much the filter stage reduced the candidate set |
expansion_results | Query expansion output |
AI-Powered Tuning
Get automated recommendations for improving your retriever:| Parameter | Type | Default | Description |
|---|---|---|---|
days | integer | 7 | Days of history to analyze (1–90) |
Identifying Relevance Issues
Use analytics to spot relevance problems:| Symptom | Analytics Signal | Likely Cause | Action |
|---|---|---|---|
| High latency, normal results | Stage breakdown shows slow rerank | Too many candidates entering rerank | Reduce top_k or add a limit stage before rerank |
| Low click-through rate | Interaction signals show high skip rate | Poor ranking or irrelevant results | Check fusion strategy, consider learned fusion |
| Cache hit rate dropping | Cache performance shows increasing misses | Query diversity increasing or TTL too short | Adjust cache strategy, review in caching best practices |
| Inconsistent latency | Slow queries show specific patterns | Certain query types trigger expensive paths | Add pre-filters or query-specific optimization |
Monitoring Cadence
| Frequency | Check | Tools |
|---|---|---|
| Daily | Slow queries, P95 latency | /slow-queries, /performance |
| Weekly | Stage breakdown, cache efficiency, interaction trends | /stages, /cache-performance, /signals |
| Monthly | AI tuning analysis, full evaluation run | /analyze-tuning, Evaluations |
Related
- Evaluations — offline quality measurement
- Benchmarks — historical session replay
- Caching Strategies — optimizing cache performance
- Interaction Signals — capturing user behavior data

