Robuta

https://arxiv.org/abs/2506.13589v2
Abstract page for arXiv paper 2506.13589v2: AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding
omnicontextualadaptiveretrievalaugmented
https://arxiv.org/abs/2601.16155
Abstract page for arXiv paper 2601.16155: HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval
human visionrepresentation learninghvddrivenvideo
https://aclanthology.org/2024.parlaclarin-1.21/
Mikitaka Masuyama, Tatsuya Kawahara, Kenjiro Matsuda. Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary...
automatic speech recognitionvideo retrievalthe japanesesystemusing