"Approaches to sentiment analysis of Hungarian political news at the sentence level"

Automated sentiment analysis of textual data is one of the central and most challenging tasks in political communication studies. However, the toolkits available are primarily for English texts and require contextual adaptation to produce valid results—especially concerning morphologically rich lang...

Teljes leírás

Elmentve itt :
Bibliográfiai részletek
Szerzők: Ring Orsolya
Szabó Martina Katalin
Guba Csenge
Váradi Bendegúz
Üveges István
Dokumentumtípus: Cikk
Megjelent: 2024
Sorozat:LANGUAGE RESOURCES AND EVALUATION 58
Tárgyszavak:
doi:10.1007/s10579-023-09717-5

mtmt:34753193
Online Access:http://publicatio.bibl.u-szeged.hu/29963
Leíró adatok
Tartalmi kivonat:Automated sentiment analysis of textual data is one of the central and most challenging tasks in political communication studies. However, the toolkits available are primarily for English texts and require contextual adaptation to produce valid results—especially concerning morphologically rich languages such as Hungarian. This study introduces (1) a new sentiment and emotion annotation framework that uses inductive approaches to identify emotions in the corpus and aggregate these emotions into positive, negative, and mixed sentiment categories, (2) a manually annotated sentiment data set with 5700 political news sentences, (3) a new Hungarian sentiment dictionary for political text analysis created via word embeddings, whose performance was compared with other available sentiment dictionaries. (4) Because of the limitations of sentiment analysis using dictionaries we have also applied various machine learning algorithms to analyze our dataset, (5) Last but not least to move towards state-of-the-art approaches, we have fine-tuned the Hungarian BERT-base model for sentiment analysis. Meanwhile, we have also tested how different pre-processing steps could affect the performance of machine-learning algorithms in the case of Hungarian texts.
Terjedelem/Fizikai jellemzők:1233-1261
ISSN:1574-020X