RacingGame Commentary Dataset
Race game video, audio commentary, trext transcript and structured data
Public data
The dataset contains the following data (cca 700GB):- Game video (mp4 format with original game audio, two types: driver's perspective and aerial perspective)
- Audio commentary (mp3 format)
- Audio commentary subtitles (srt format)
- Structured data on race conditions (via Assetto Corsa API)
Sample
Notice
- Redistribution and commercial use are prohibited.
- AIST will not bear any responsibility whatsoever for any incidents caused by the use of this data.
- If you cannot find the email, please check your spam folder.
If you publish any results from research based on this dataset, please cite the following:
- Generating Racing Game Commentary from Vision, Language, and Structured Data, Tatsuya Ishigaki, Goran Topić, Yumi Hamazono, Hiroshi Noji, Ichiro Kobayashi, Yusuke Miyao, Hiroya Yakamura, Proceedings of the 14th International Conference on Natural Language Generation (INLG2021) [bibtex]