Ecole d'ingénieur et centre de recherche en Sciences du numérique

What's up LOD cloud? Observing the state of linked open data cloud metadata

Assaf, Ahmad; Troncy, Raphaël; Senart, Aline

LDQ 2015, 2nd workshop on Linked Data Quality, Main conference ESWC 2015, June 1st 2015, Portoroz, Slovenia / Also published in LNCS, Volume 9341/2015

Linked Open Data (LOD) has emerged as one of the largest collections of interlinked datasets on the web. In order to benefit from this mine of data, one needs to access descriptive information about each dataset (or metadata). However, the heterogeneous nature of data sources reflects directly on the data quality as these sources often contain inconsistent as well as misinterpreted and incomplete metadata information. Considering the significant variation in size, the languages used and the freshness of the data, one realizes that finding useful datasets without prior knowledge is increasingly complicated.We have developed Roomba, a tool that enables to validate, correct and generate dataset metadata. In this paper, we present the results of running this tool on parts of the LOD cloud accessible via the datahub.io API. The results demonstrate that the general state of the datasets needs more attention as most of them suffers from bad quality metadata and lacking some informative metrics that are needed to facilitate dataset search. We also show that the automatic corrections done by Roomba increase the overall quality of the datasets metadata and we highlight the need for manual efforts to correct some important missing information.

Document Doi Bibtex

Titre:What's up LOD cloud? Observing the state of linked open data cloud metadata
Mots Clés:Dataset Profile, Metadata, Data Quality, Data Portal
Type:Conférence
Langue:English
Ville:Portoroz
Pays:SLOVÉNIE
Date:
Département:Data Science
Eurecom ref:4597
Copyright: © Springer. Personal use of this material is permitted. The definitive version of this paper was published in LDQ 2015, 2nd workshop on Linked Data Quality, Main conference ESWC 2015, June 1st 2015, Portoroz, Slovenia / Also published in LNCS, Volume 9341/2015 and is available at : http://dx.doi.org/10.1007/978-3-319-25639-9_40
Bibtex: @inproceedings{EURECOM+4597, doi = {http://dx.doi.org/10.1007/978-3-319-25639-9_40}, year = {2015}, title = {{W}hat's up {LOD} cloud? {O}bserving the state of linked open data cloud metadata}, author = {{A}ssaf, {A}hmad and {T}roncy, {R}apha{\"e}l and {S}enart, {A}line }, booktitle = {{LDQ} 2015, 2nd workshop on {L}inked {D}ata {Q}uality, {M}ain conference {ESWC} 2015, {J}une 1st 2015, {P}ortoroz, {S}lovenia / {A}lso published in {LNCS}, {V}olume 9341/2015 }, address = {{P}ortoroz, {SLOV}{\'{E}}{NIE}}, month = {06}, url = {http://www.eurecom.fr/publication/4597} }
Voir aussi: