PDF] LoL-V2T: Large-Scale Esports Video Description Dataset

Por um escritor misterioso
Last updated 21 novembro 2024
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
The dataset, which the authors call LoL-V2T, is the largest video description dataset in the video game domain, and includes 9,723 clips with 62,677 captions, and the masking can significantly improve performance. Esports is a fastest-growing new field with a largely online-presence, and is creating a demand for automatic domain-specific captioning tools. However, at the current time, there are few approaches that tackle the esports video description problem. In this work, we propose a large-scale dataset for esports video description, focusing on the popular game "League of Legends". The dataset, which we call LoL-V2T, is the largest video description dataset in the video game domain, and includes 9,723 clips with 62,677 captions. This new dataset presents multiple new video captioning challenges such as large amounts of domain-specific vocabulary, subtle motions with large importance, and a temporal gap between most captions and the events that occurred. In order to tackle the issue of vocabulary, we propose a masking the domain-specific words and provide additional annotations for this. In our results, we show that the dataset poses a challenge to existing video captioning approaches, and the masking can significantly improve performance. Our dataset and code is publicly available1.
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
Fine-Grained Video Captioning for Sports Narrative
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
League of Legends
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
Video Captioning with Transferred Semantic Attributes
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
Edgar Simo-Serra's research works
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF] MSR-VTT: A Large Video Description Dataset for Bridging Video
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
PDF) Fighting Game Commentator with Pitch and Loudness Adjustment
PDF] LoL-V2T: Large-Scale Esports Video Description Dataset
GitHub - google-research-datasets/Video-Timeline-Tags-ViTT: A

© 2014-2024 likytut.eu. All rights reserved.