This study presents a novel corpus of 15,356 Polish web articles, including articles identified as containing disinformation.Our dataset enables a multifaceted understanding of disinformation.We present a distinctive multilayered methodology for annotating disinformation in texts.What sets our corpus apart is its focus on uncovering hidden intent and manipulation in disinformative content.A team of experts annotated each article with multiple labels indicating both disinformation creators' intents and the manipulation techniques employed.Additionally, we set new baselines for binary disinformation detection and two multiclass multilabel classification tasks: manipulation techniques and intention types classification.
MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation
Da San Martino, Giovanni
;
2024
Abstract
This study presents a novel corpus of 15,356 Polish web articles, including articles identified as containing disinformation.Our dataset enables a multifaceted understanding of disinformation.We present a distinctive multilayered methodology for annotating disinformation in texts.What sets our corpus apart is its focus on uncovering hidden intent and manipulation in disinformative content.A team of experts annotated each article with multiple labels indicating both disinformation creators' intents and the manipulation techniques employed.Additionally, we set new baselines for binary disinformation detection and two multiclass multilabel classification tasks: manipulation techniques and intention types classification.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




