Maximum — mixed-120-v2¶
Maximum temporal redundancy. Analyzes 120 frames (4 seconds of video) for the highest scrutiny scenarios and regulatory edge cases.
At a Glance¶
| Model ID | mixed-120-v2 |
| Frames required | 120 |
| Capture time | ~4 seconds at 30 FPS |
| Avg response time | ~10s |
| Security level | Maximum |
When to Use¶
- Highest scrutiny — regulatory edge cases, government services, high-value transactions
- Final escalation — when Extended (mixed-90-v2) is still borderline
- Maximum temporal redundancy — where 4 seconds of capture is acceptable
- Compliance mandates — when auditors or regulators require the longest temporal window
Performance¶
Based on evaluation across 1,960 test videos (9 attack types, held out from training):
| EER | 4.6% |
| AUC | 0.991 |
| Balanced accuracy | 94.7% |
| Avg response | ~10s |
What this means
The 120-frame model offers the highest temporal coverage in the Mixed V2 family. Strong AUC (0.991) and balanced accuracy (94.7%) — best for final-level escalation when latency is acceptable.
Tradeoffs¶
- Maximum redundancy — 4 seconds of video provides the most temporal signal
- Strong accuracy — 4.6% EER, 0.991 AUC (on par with Balanced and Thorough)
- Highest latency — ~4s capture + ~10s response; users must tolerate a longer check
- Final escalation — use as the last step when Extended is inconclusive
Integration¶
{
"session_id": "550e8400-e29b-41d4-a716-446655440000",
"model": "mixed-120-v2",
"source": "live",
"frames": [
{ "index": 0, "timestamp_ms": 0, "pixels": "..." },
...
]
}
Capture 120 frames at 30 FPS (4 seconds total). Credits: 2 per request.
See Overview for the full escalation pattern
Use Maximum when Extended returns borderline and you need the highest possible temporal coverage before making a final decision.