Nasrin Taghizadeh and Hesham Faili (2016) Automatic Wordnet Development for Low-Resource Languages using Cross-Lingual WSD

Nasrin Taghizadeh and Hesham Faili (2016) "Automatic Wordnet Development for Low-Resource Languages using Cross-Lingual WSD", Volume 56, pages 61-87

PDF | doi:10.1613/jair.4968

‎Wordnets are an effective resource for natural language processing and information retrieval‎, ‎especially for semantic processing and meaning related tasks‎. ‎So far‎, ‎wordnets have been constructed for many languages‎. ‎However‎, ‎the automatic development of wordnets for low-resource languages has not been well studied‎. ‎In this paper‎, ‎an Expectation-Maximization algorithm is used to create high quality and large scale wordnets for poor-resource languages‎. ‎The proposed method benefits from possessing cross-lingual word sense disambiguation and develops a wordnet by only using a bi-lingual dictionary and a mono-lingual corpus‎. ‎The proposed method has been executed with Persian language and the resulting wordnet has been evaluated through several experiments‎. ‎The results show that the induced wordnet has a precision score of 90% and a recall score of 35%‎.

Click here to return to Volume 56 contents list