SenseDefs: a multilingual corpus of semantically annotated textual definitions: exploiting multiple languages and resources jointly for high-quality Word Sense Disambiguation and Entity Linking
Definitional knowledge has proved to be essential in various Natural Language Processing tasks and applications, especially when information at the level of word senses is exploited. However, the few sense-annotated corpora of textual definitions available to date are of limited size: this is mainly due to the expensive and time-consuming process of annotating a wide variety of word senses and entity mentions at a reasonably high scale.