🎥 Real-Time Camera Captioning with Debug Logs
Model Size
Decoder Weights
CLIP Weights
Verbose Mode
CPU Threads (n_threads)
↺
1
16
Interval (ms)
↺
100
20000
System Prompt
Focus on key dramatic action…
User Prompt
Analyze the provided image and determine if any person is lying on the floor. Output "YES" only if at least one person is clearly lying down on a floor or flat surface (e.g., fully reclined, supine, prone, or in a fetal position). Output "NO" in all other cases, including if no person is present, if people are only standing, sitting, kneeling, crouching, or if the position is ambiguous (e.g., partially on the floor but not fully lying down). Respond with exactly "YES" or "NO" — no additional text, explanations, or punctuation.
Reset CLIP handler each frame
Webcam Feed
Drop Image Here
- or -
Click to Upload
Click to Access Webcam
Caption
Debug Log