Seeing is Correcting: curating lexical resources using social interfaces


This note describes OpenWordnet-PT, an automatically created, manually curated wordnet for Portuguese and introduces the newly developed web interface we are using to speed up its manual curation. OpenWordNet-PT is part of a collection of wordnets for various languages, jointly described and distributed through the Open MultiLingual WordNet and the Global WordNet Association. OpenWordnet-PT has been primarily distributed, from the beginning, as RDF files along with its model description in OWL, and it is freely available for download. We contend the creation of such large, distributed and linkable lexical resources is on the cusp of revolutionizing multilingual language processing to the next truly semantic level. But to get there, there is a need for user interfaces that allow ordinary users and (not only computational) linguists to help in the checking and cleaning up of the quality of the resource. We present our suggestion of one such web interface and describe its features supporting the collaborative curation of the data. This showcases the use and importance of its linked data features, to keep track of information provenance during the whole life-cycle of the RDF resource.



  author = {Real, Livy and Chalub, Fabricio and de Paiva, Valeria and Freitas, Claudia and Rademaker, Alexandre},
  title = {Seeing is Correcting: curating lexical resources using
                    social interfaces},
  booktitle = {Proceedings of 53rd Annual Meeting of the
                    Association for Computational Linguistics and The
                    7th International Joint Conference on Natural
                    Language Processing of Asian Federation of Natural
                    Language Processing - Fourth Workshop on Linked Data
                    in Linguistics: Resources and Applications (LDL
  year = {2015},
  pdflink1 = {/files/acl-ldl-2015.pdf},
  pdflink2 = {},
  month = jul,
  address = {Beijing, China}