AI models still lose track of who is who and what's happening in a movie. A new system orchestrates face recognition and staged summarization, keeping characters straight, and plots coherent across ...
Abstract: Modern railways continue to strive for remote and automated methods to improve the visual inspection procedures for their assets. In some cases, these inspections provide new information ...
Abstract: Vision Foundation Models (VFMs), such as DINOv2 and SAM, have demonstrated unprecedented generalizability in natural imaging and show strong promise in medical imaging due to their ...