2020
|
Kapidakis, S. Consistency and Interoperability on Dublin Core Element Values in Collections Harvested using the Open Archive Initiative Protocol for Metadata Harvesting Conference Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020), vol. 2, 2020, ISBN: 978-989-758-474-9. @conference{Kapidakis2020,
title = {Consistency and Interoperability on Dublin Core Element Values in Collections Harvested using the Open Archive Initiative Protocol for Metadata Harvesting},
author = {Kapidakis, S.},
url = {https://www.scitepress.org/Papers/2020/101120/101120.pdf},
isbn = {978-989-758-474-9},
year = {2020},
date = {2020-11-04},
booktitle = {Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020)},
journal = {Proceedings of the 12th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2020},
volume = {2},
pages = {181-188},
abstract = {When resource descriptions use the exact same value for an entity, this value is easier parsed, identified and utilized by automatic procedures. The use of controlled values, even when it is common and very useful, it is usually not enforced during the data entry. In this paper we study the use of the controlled values in many harvested collections and we study all Dublin Core elements and also their similarity. We mainly focus in the element language, as there is a lot of standardization on how to denote language values, followed by other elements that normally use controlled values. We discovered values that are repeated many times and in many collections and many more values that are used only once! The lack of coordination among collections during their creation results to many variations for each value, even when the value is used consistently and many times inside a collection. The study uses dendrogram to reveal the current usage of the Dublin Core elements inside and among active collections by clustering the collections with similar values and helps adopting better guidelines, designing better tools and improving the effectiveness of the collections.},
keywords = {Controlled Terms, controlled vocabularies, Dendrogram, Dublin core, harvesting, Language, Linked Open Data, Metadata, OAI-PMH, Repeated Values},
pubstate = {published},
tppubtype = {conference}
}
When resource descriptions use the exact same value for an entity, this value is easier parsed, identified and utilized by automatic procedures. The use of controlled values, even when it is common and very useful, it is usually not enforced during the data entry. In this paper we study the use of the controlled values in many harvested collections and we study all Dublin Core elements and also their similarity. We mainly focus in the element language, as there is a lot of standardization on how to denote language values, followed by other elements that normally use controlled values. We discovered values that are repeated many times and in many collections and many more values that are used only once! The lack of coordination among collections during their creation results to many variations for each value, even when the value is used consistently and many times inside a collection. The study uses dendrogram to reveal the current usage of the Dublin Core elements inside and among active collections by clustering the collections with similar values and helps adopting better guidelines, designing better tools and improving the effectiveness of the collections. |
2014
|
Konstantinou, N; Kouis, D; Mitrou, N Incremental export of relational database contents into RDF graphs Proceedings Article In: ACM International Conference Proceeding Series, 2014, ISBN: 9781450325387. @inproceedings{Konstantinou2014,
title = {Incremental export of relational database contents into RDF graphs},
author = {N Konstantinou and D Kouis and N Mitrou},
doi = {10.1145/2611040.2611082},
isbn = {9781450325387},
year = {2014},
date = {2014-01-01},
booktitle = {ACM International Conference Proceeding Series},
abstract = {In addition to tools offering RDF views over databases, a variety of tools exist that allow exporting database contents into RDF graphs; tools proven that in many cases demonstrate better performance than the former. However, in cases when database contents are exported into RDF, it is not always optimal or even necessary to dump the whole database contents every time. In this paper, the problem of incremental generation and storage of the resulting RDF graph is investigated. An implementation of the R2RML standard is used in order to express mappings that associate tuples from the source database to triples in the resulting RDF graph. Next, a methodology is proposed that enables incremental generation and storage of an RDF graph based on a source relational database, and it is evaluated through a set of performance measurements. Finally, a discussion is presented regarding the authors' most important findings and conclusions. textcopyright 2014 ACM.},
keywords = {Incremental, Linked Open Data, Mapping, R2RML, RDF, Relational Databases},
pubstate = {published},
tppubtype = {inproceedings}
}
In addition to tools offering RDF views over databases, a variety of tools exist that allow exporting database contents into RDF graphs; tools proven that in many cases demonstrate better performance than the former. However, in cases when database contents are exported into RDF, it is not always optimal or even necessary to dump the whole database contents every time. In this paper, the problem of incremental generation and storage of the resulting RDF graph is investigated. An implementation of the R2RML standard is used in order to express mappings that associate tuples from the source database to triples in the resulting RDF graph. Next, a methodology is proposed that enables incremental generation and storage of an RDF graph based on a source relational database, and it is evaluated through a set of performance measurements. Finally, a discussion is presented regarding the authors' most important findings and conclusions. textcopyright 2014 ACM. |