TRACK EVERYMOVE
Detect a cat, hold its identity, and say exactly how far it is — in meters, with an honest confidence interval — from one ordinary camera. Then see where it will be, so you frame the shot before it lands.
- detector approaches
- 2
- every estimate
- ± CI
- FOV undistort
- 120°
detector approaches
every estimate
FOV undistort
Core capabilities
Three things, done honestly.
No magic numbers. Every distance carries its uncertainty, every prediction shows its lead time, and every robot move has a hard safety stop.
Distance ± CI
Pinhole geometry and metric depth are fused into a confidence-weighted median, so you get meters with a real uncertainty band — not a single hopeful number.
Where it will be
Per-track motion is extrapolated over an adjustable lead time. A ghost box shows the landing spot and its predicted distance, so the camera is ready before the cat is.
Robot follow
An anti-oscillation controller drives the chassis to hold a safe distance and keep the cat centered — smooth, with a hard ultrasonic stop in firmware.
How it works
One frame, four stages.
Pretrained perception plus classical camera geometry — no training required to run. The same pipeline drives the overlay, the JSON, and the robot.
Detect
YOLO11 or RT-DETR finds the cat (COCO class 15) on the undistorted frame.
Track
BoT-SORT + ReID keeps a stable identity through full occlusion.
Distance
Geometry ⊕ metric depth → meters ± a conformal confidence interval.
Predict & follow
Extrapolate the next position, then drive the robot to hold its distance.
Hardware
Fully wireless. One laptop brain.
The laptop talks to the camera over Wi-Fi and the robot over Bluetooth — no gateway. Connect it all from the admin view in a few clicks.
Tapo C211
EyesWi-Fi · RTSP
Live frames over the network, with optional pan/tilt.
Arduino Mega
MotionBluetooth · Serial
Drives the wheels and the camera-pan servo from the laptop brain.
HC-SR04
Ground truth±1 cm ultrasonic
Confirms the model live: “1.84 m” estimated, “1.86 m” measured.
Point a camera. Know the distance.
Run the console locally, connect your camera and robot, and start a session. The pretrained baseline runs out of the box — no training, no calibration to get a first reading.