References & Citations
Computer Science > Computation and Language
Title: Multi-modal Stance Detection: New Datasets and Model
(Submitted on 22 Feb 2024 (v1), last revised 17 May 2024 (this version, v2))
Abstract: Stance detection is a challenging task that aims to identify public opinion from social media platforms with respect to specific targets. Previous work on stance detection largely focused on pure texts. In this paper, we study multi-modal stance detection for tweets consisting of texts and images, which are prevalent in today's fast-growing social media platforms where people often post multi-modal messages. To this end, we create five new multi-modal stance detection datasets of different domains based on Twitter, in which each example consists of a text and an image. In addition, we propose a simple yet effective Targeted Multi-modal Prompt Tuning framework (TMPT), where target information is leveraged to learn multi-modal stance features from textual and visual modalities. Experimental results on our three benchmark datasets show that the proposed TMPT achieves state-of-the-art performance in multi-modal stance detection.
Submission history
From: Ang Li [view email][v1] Thu, 22 Feb 2024 05:24:19 GMT (3485kb,D)
[v2] Fri, 17 May 2024 13:36:48 GMT (3486kb,D)
Link back to: arXiv, form interface, contact.