Regularization of word embeddings for multi-word expression identification

In this paper we compare the effects of applying various state-of-the-art word representation strategies in the task of multi-word expression (MWE) identification. In particular, we analyze the strengths and weaknesses of the usage of `1-regularized sparse word embeddings for identifying MWEs. Our e...

Teljes leírás

Elmentve itt :
Bibliográfiai részletek
Szerző: Berend Gábor
Dokumentumtípus: Cikk
Megjelent: 2018
Sorozat:Acta cybernetica 23 No. 3
Kulcsszavak:Többszavas kifejezések, Nyelvi elemzés, Szóbeágyazások, Nyelvészet - számítógép alkalmazása
Tárgyszavak:
Online Access:http://acta.bibl.u-szeged.hu/55678
Leíró adatok
Tartalmi kivonat:In this paper we compare the effects of applying various state-of-the-art word representation strategies in the task of multi-word expression (MWE) identification. In particular, we analyze the strengths and weaknesses of the usage of `1-regularized sparse word embeddings for identifying MWEs. Our earlier study demonstrated the effectiveness of regularized word embeddings in other sequence labeling tasks, i.e. part-of-speech tagging and named entity recognition, but it has not yet been rigorously evaluated for the identification of MWEs yet.
Terjedelem/Fizikai jellemzők:801-814
ISSN:0324-721X