Abstract: Event camera-based visual tracking has drawn more and more attention in recent years due to the unique imaging principle and advantages of low energy consumption, high dynamic range, and ...
Abstract: Object pose estimation is a core means for robots to understand and interact with their environment. For this task, monocular category-level methods are attractive as they require only a ...
A comprehensive repository for fine-tuning the Donut model for document image classification and parsing tasks. This project provides optimized training pipelines using Hugging Face Transformers with ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results