Multimodal Emotion Classification

TitleMultimodal Emotion Classification
Publication TypeConference Paper
Year of Publication2019
AuthorsAnarag Illendula, Amit Sheth
Conference Name2019 World Wide Web Conference
Date Published05/2019
Conference LocationSan Francisco, CA, USA
KeywordsEmoji Understanding, Emotion Classification, Multimodal Analysis
Abstract

Most NLP and Computer Vision tasks are limited to scarcity of labelled data. In social media emotion classification and other related tasks, hashtags have been used as indicators to label data. With the rapid increase in emoji usage of social media, emojis are used as an additional feature for major social NLP tasks. However, this is less explored in case of multimedia posts on social media where posts are composed of both image and text. At the same time, w.e have seen a surge in the interest to incorporate domain knowledge to improve machine understanding of text. In this paper, we investigate whether domain knowledge for emoji can improve the accuracy of emotion classification task. We exploit the importance of different modalities from social media post for emotion classification task using state-of-the-art deep learning architectures. Our experiments demonstrate that the three modalities (text, emoji and images) encode different information to express emotion and therefore can complement each other. Our results also demonstrate that emoji sense depends on the textual context, and emoji combined with text encodes better information than considered separately. The highest accuracy of 71.98% is achieved with a training data of 550k posts.

Full Text

Citation:
Anurag Illendula and Amit Sheth. 2019. Multimodal Emotion Classification.
In Companion Proceedings of the 2019 World Wide Web Conference (WWW
’19 Companion), May 13–17, 2019, San Francisco, CA, USA. ACM, New York,
NY, USA, 11 pages. https://doi.org/10.1145/3308560.3316549

Related Files: