Crossmark

Reading between modalities: multimodal hate speech detection in low-resource Indonesian social media

Crossref DOI link: https://doi.org/10.1007/s42001-026-00473-4

Published Online: 2026-04-07

Published Print: 2026-05

Update policy: https://doi.org/10.1007/springer_crossmark_policy

Authors

Pamungkas, Endang Wahyu https://orcid.org/0000-0003-0156-6754
Syafiandini, Arida Ferti

Purworini, Dian

Widayat, Widi

Putri, Divi Galih Prasetyo

Amal, Ikhlasul

Song, Min
Funding

Funding for this research was provided by:

Universitas Muhammadiyah Surakarta (125.41/A.3-III/LRI/IV/2024)
License Information

Text and Data Mining valid from 2026-04-07

Version of Record valid from 2026-04-07
More Information

Article History

Received: 15 August 2025

Accepted: 19 February 2026

First Online: 7 April 2026

Declarations

:

: On behalf of all authors, the corresponding author states that there is no Conflict of interest.

: This research addresses a pressing societal issue-hate speech on social media-which can significantly impact individual well-being and social harmony. By developing automated detection systems, particularly in low-resource and culturally specific contexts such as Indonesian multimodal content, this study contributes to advancing responsible AI applications that promote healthier online environments. All data utilized in this study were obtained from publicly accessible sources, specifically Twitter (X), and conform to the platform’s Developer Agreement and Policy. The dataset only includes tweet IDs to ensure adherence to data-sharing ethics and privacy regulations. Although the tweets were annotated for hate speech content, we make no claims or assumptions regarding the intent or identity of the original authors. The annotation process was conducted by a team of three trained annotators with prior experience in analyzing social media content. Annotators were fully briefed on the sensitive nature of the task and were explicitly informed that the content might contain harmful or offensive material. They were encouraged to pause or discontinue their participation if the labeling process became emotionally distressing. All annotators received appropriate financial compensation for their contributions. This work promotes transparency and accountability by documenting its methodology and dataset construction in detail. Furthermore, the study emphasizes the importance of cultural sensitivity when analyzing hate speech in multilingual and multimodal settings, particularly in underrepresented regions. No human subjects were directly involved in the study beyond publicly available social media data, and no personal identifiable information (PII) is retained or disclosed.

Document is current

Any future updates will be listed below