TL; DR. We propose a novel perspective to regard the multiple object tracking task as an in-context ID prediction problem. Given a set of trajectories carried with ID information, MOTIP directly ...
Abstract: Multiple Instance Learning (MIL) has demonstrated promise in Whole Slide Image (WSI) classification. However, a major challenge persists due to the high computational cost associated with ...
Abstract: Weakly-supervised Video Anomaly Detection (wVAD) aims to detect frame-level anomalies using only video-level labels in training. Due to the limitation of coarse-grained labels, ...