About

BabelNet is both a multilingual encyclopedic dictionary, with lexicographic and encyclopedic coverage of terms, and a semantic network which connects concepts and named entities in a very large network of semantic relations, made up of about 15 million entries, called Babel synsets. Each Babel synset represents a given meaning and contains all the synonyms which express that meaning in a range of different languages. Its evolution, BabelNet live, is a new, continuously growing resource, thanks to being fed with everyday updates from all the sources it is composed of, including Wikipedia, Wiktionary, users' input, etc.

BabelNet infographic


BabelNet live covers 284 languages and is obtained from the automatic integration of:
  • WordNet, a popular computational lexicon of English (version 3.0).
  • Open Multilingual WordNet, a collection of wordnets available in different languages (downloaded in January 2017).
  • Wikipedia, the largest collaborative multilingual Web encyclopedia (January 2017).
  • OmegaWiki, a large collaborative multilingual dictionary (January 2017 dump).
  • Wiktionary, a collaborative project to produce a free-content multilingual dictionary (February 2017).
  • Wikidata, a free knowledge base that can be read and edited by humans and machines alike. (January 2017)
  • Wikiquote, a free online compendium of sourced quotations from notable people and creative works in every language (March 2015 dump).
  • VerbNet, a Class-Based Verb Lexicon (version 3.2).
  • Microsoft Terminology, a collection of terminologies that can be used to develop localized versions of applications (July 2015 dumps).
  • GeoNames, a free geographical database covering all countries and containing over eight million placenames (April 2015 dump).
  • WoNeF, an improved, expanded and evaluated automatic French translation of WordNet (high precision version, downloaded in February 2017).
  • ItalWordNet, a lexical-semantic database developed in the framework of two different research projects: EuroWordNet and SI-TAL (downloaded in February 2017).
  • ImageNet, an image database organized according to the WordNet hierarchy (2011 release).
  • FrameNet, a lexical database of English that is both human- and machine-readable (version 1.6).
  • WN-Map, automatically generated mappings among WordNet versions (2007 release).
  • Korean WordNet, a resource that has been constructed with the use of CoreNet and dictionaries of a variety of domains (downloaded in January 2017).
  • GAWN WordNet, a database consisting of Irish words and the semantic relationships among them (downloaded in January 2017).
Additionally, it contains translations obtained from sense-annotated sentences. The correctness of the WordNet-Wikipedia mapping in BabelNet 3.6 has been estimated at around 91% on open-text words.

BabelNet is fully integrated with:
  • Babelfy: a state-of-the-art multilingual disambiguation and entity linking system.
  • Wikipedia Bitaxonomy: a state-of-the-art taxonomy of Wikipedia pages aligned to a taxonomy of Wikipedia categories.
BabelNet is provided as a stand-alone resource with its Java API, a SPARQL endpoint and a Linked Data interface as part of the Linguistic Linked Open Data Cloud (LLOD Cloud) cloud.

This website provides:
 


Photo of the META-NET 2015 prize ceremony in Riga, Latvia.


Video of the META prize 2015 awarded to Roberto Navigli and his team for BabelNet.


eLex invited talk by prof. Roberto Navigli on BabelNet, Babelfy and Beyond.

Credits

Concept: Roberto Navigli
Web application design: OPPOSITE™, Luca Matteis
Programming: Francesco Cecconi, Francesco Elia, Tiziano Flati, Roberto Navigli, Simone Ponzetto, Daniele Vannella
BabelNet domains: José Camacho Collados
BabelNet infobox relations: Fabio Prisca
Logo: Lorenzo Ceccotti