Maximum — mixed-120-v2¶

Maximum temporal redundancy. Analyzes 120 frames (4 seconds of video) for the highest scrutiny scenarios and regulatory edge cases.

At a Glance¶

Model ID	`mixed-120-v2`
Frames required	120
Capture time	~4 seconds at 30 FPS
Avg response time	~10s
Security level	Maximum

When to Use¶

Highest scrutiny — regulatory edge cases, government services, high-value transactions
Final escalation — when Extended (mixed-90-v2) is still borderline
Maximum temporal redundancy — where 4 seconds of capture is acceptable
Compliance mandates — when auditors or regulators require the longest temporal window

Performance¶

Based on evaluation across 1,960 test videos (9 attack types, held out from training):

EER	4.6%
AUC	0.991
Balanced accuracy	94.7%
Avg response	~10s

What this means

The 120-frame model offers the highest temporal coverage in the Mixed V2 family. Strong AUC (0.991) and balanced accuracy (94.7%) — best for final-level escalation when latency is acceptable.

Tradeoffs¶

Maximum redundancy — 4 seconds of video provides the most temporal signal
Strong accuracy — 4.6% EER, 0.991 AUC (on par with Balanced and Thorough)
Highest latency — ~4s capture + ~10s response; users must tolerate a longer check
Final escalation — use as the last step when Extended is inconclusive

Integration ¶

{
  "session_id": "550e8400-e29b-41d4-a716-446655440000",
  "model": "mixed-120-v2",
  "source": "live",
  "frames": [
    { "index": 0, "timestamp_ms": 0, "pixels": "..." },
    ...
  ]
}

Capture 120 frames at 30 FPS (4 seconds total). Credits: 2 per request.

See Overview for the full escalation pattern

Use Maximum when Extended returns borderline and you need the highest possible temporal coverage before making a final decision.