A New Dataset for Multimodal Intent Recognition
| Total number of coarse-grained intents | 2 |
| Total number of fine-grained intents | 20 |
| Total number of videos | 43 |
| Total number of video segments | 2,224 |
| Total number of words in text utterances | 15,658 |
| Total number of unique words in text utterances | 2,562 |
| Average length of text utterances | 7.04 |
| Maximum length of text utterances | 26 |
| Average length of video segments (s) | 2.38 |
| Maximum length of video segments (s) | 9.59 |
Please cite the following papers if you use this dataset in your work.
@inproceedings{10.1145/3503161.3547906,
author = {Zhang, Hanlei and Xu, Hua and Wang, Xin and Zhou, Qianrui and Zhao, Shaojie and Teng, Jiayan},
title = {MIntRec: A New Dataset for Multimodal Intent Recognition},
year = {2022},
doi = {10.1145/3503161.3547906},
booktitle = {Proceedings of the 30th ACM International Conference on Multimedia},
pages = {1688–1697},
numpages = {10}
}