View extractor details at api.mixpeek.com/v1/collections/features/extractors/face_identity_extractor_v1 or fetch programmatically with
GET /v1/collections/features/extractors/{feature_extractor_id}.When to Use
| Use Case | Description |
|---|---|
| Face verification | 1:1 matching to verify identity |
| Face identification | 1:N search to identify a person in a database |
| Face clustering | Group photos by person automatically |
| Employee verification | Workplace identity systems |
| Photo organization | Organize photo libraries by people |
| Surveillance | Security and monitoring applications |
When NOT to Use
| Scenario | Recommended Alternative |
|---|---|
| General image search | image_extractor |
| Object/scene detection | multimodal_extractor |
| Video content analysis | multimodal_extractor |
| Non-face biometrics | Specialized extractors |
Supported Input Types
| Input | Type | Description | Processing |
|---|---|---|---|
image | string | URL or S3 path | Detect and embed all faces |
video | string | URL or S3 path | Sample frames, detect faces, deduplicate |
video_frame | string | URL or S3 path | Treated as image |
- Image: JPEG, PNG, WebP, BMP
- Video: MP4, MOV, AVI, MKV, WebM
Input Schema
Provide one of the following inputs:| Field | Type | Description |
|---|---|---|
image | string | Image URL or S3 path containing faces |
video | string | Video URL or S3 path. Subject to max_video_length limit |
video_frame | string | Single video frame URL or S3 path (treated as image) |
Output Schema
Each detected face produces one document with the following fields:| Field | Type | Description |
|---|---|---|
face_identity_extractor_v1_embedding | float[512] | ArcFace embedding, L2 normalized |
face_index | integer | Index of this face in source image (0-based) |
bbox | object | Bounding box {x1, y1, x2, y2, width, height} |
detection_score | number | SCRFD detection confidence (0.0-1.0) |
landmarks | object | 5 facial landmarks for alignment |
quality_score | number | Face quality score (0.0-1.0) |
quality_components | object | Quality component scores (blur, size, etc.) |
aligned_face_crop | string | Base64 aligned 112x112 face crop (optional) |
frame_number | integer | Frame number in source video |
timestamp | number | Timestamp in source video (seconds) |
embedding_model | string | Embedding model used |
detection_model | string | Detection model used |
processing_time_ms | number | Processing time (milliseconds) |
Parameters
Detection Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
detection_model | string | "scrfd_2.5g" | SCRFD model variant |
min_face_size | integer | 20 | Minimum face size in pixels to detect |
detection_threshold | float | 0.5 | Confidence threshold (0.0-1.0) |
max_faces_per_image | integer | null | Maximum faces to process per image |
Detection Models
| Model | Speed | Accuracy | Best For |
|---|---|---|---|
scrfd_500m | 2-3ms | Good | Real-time applications |
scrfd_2.5g | 5-7ms | Better | Recommended - balanced |
scrfd_10g | 10-15ms | Best | Maximum accuracy |
Embedding Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
embedding_model | string | "arcface_r100" | Face embedding model |
normalize_embeddings | boolean | true | L2-normalize to unit vectors |
Embedding Models
| Model | Accuracy (LFW) | Speed | Notes |
|---|---|---|---|
arcface_r100 | 99.8%+ | Standard | Recommended - highest accuracy |
arcface_r50 | 99.5%+ | Faster | Slightly lower accuracy |
magface_r100 | 99.7%+ | Standard | Includes built-in quality score |
Quality Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
enable_quality_scoring | boolean | true | Compute quality scores (adds ~5ms per face) |
quality_threshold | float | null | Minimum quality to index (null = index all) |
null- Index all detected faces0.5- Moderate filtering (removes low quality)0.7- High quality only
Video Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
max_video_length | integer | 60 | Maximum video length in seconds |
video_sampling_fps | float | 1.0 | Frames per second to sample |
video_deduplication | boolean | true | Remove duplicate faces across frames |
video_deduplication_threshold | float | 0.8 | Cosine similarity for deduplication |
Output Parameters
| Parameter | Type | Default | Description |
|---|---|---|---|
output_mode | string | "per_face" | per_face or per_image |
include_face_crops | boolean | false | Include aligned 112x112 face crops as base64 |
store_detection_metadata | boolean | true | Store bbox, landmarks, detection scores |
Configuration Examples
Face Matching
Use cosine similarity to match faces:| Similarity Score | Interpretation |
|---|---|
| > 0.30 | Very likely same person |
| 0.25 - 0.30 | Likely same person |
| 0.20 - 0.25 | Possibly same person |
| < 0.20 | Different people |
Performance & Costs
| Metric | Value |
|---|---|
| Detection accuracy | 99%+ (WIDER FACE benchmark) |
| Verification accuracy | 99.8%+ (LFW benchmark) |
| Processing speed | Detection: 5-7ms, Embedding: 10-15ms per face |
| Cost per image | 30 credits base |
| Cost per face | 30 credits additional per detected face |
Video Processing
- Deduplication: Reduces 90-95% redundancy in video
- Sampling: 1 FPS recommended for most use cases
- Max length: 300 seconds (extraction only)
Vector Index
| Property | Value |
|---|---|
| Index name | face_identity_extractor_v1_embedding |
| Dimensions | 512 |
| Type | Dense |
| Distance metric | Cosine |
| Datatype | float32 |
| Inference model | face_identity_arcface_r100_v1 |
Pipeline Overview
- SCRFD Detection - Bounding boxes + 5 landmarks
- 5-Point Affine Alignment - 112x112 canonical face
- ArcFace Embedding - 512-d L2-normalized vector
- Quality Scoring (optional) - Filter low-quality faces
Limitations
- Face only: Does not identify age, gender, or expressions
- Pose sensitivity: Extreme angles may reduce accuracy
- Occlusion: Masks, glasses, hair may affect detection
- Resolution: Minimum 20px face size, 40px+ recommended
- Lighting: Poor lighting reduces quality scores
- Video length: Maximum 300 seconds per video

