ABSTRACT
Typically tools that map non-RDF data into RDF format rely on the technology native to the source of the data when manipulation of data during the mapping is required. Depending on the data format, data manipulation can be performed using underlying technology, such as RDBMS for relational databases or XPath for XML. For CSV/Tabular data there is no such underlying technology, and instead transforming the source data into another format or pre/post-processing techniques are used. As part of this paper, we present a comparison framework for the state-of-the-art in converting CSV/Tabular data into RDF, where a key feature evaluated is transformation functions. We argue that existing approaches for transformation functions in such tools are complex - in number of steps and tools involved - and therefore not as traceable and transparent as one would like. We tackle these problems by defining a more generic, usable and amenable method to incorporate functions into uplift mapping languages, called FunUL. As proof of concept, we show an implementation of our method. Moreover, by using a real world Digital Humanities case study, we compare our approach with other approaches that we have identified to include transformation functions as part of the mapping for CSV/Tabular data.
- Bizer, C., Seaborne, A.: D2RQ - Treating Non-RDF databases as virtual RDF graphs. In: Proceedings of the 3rd international semantic web conference (ISWC2004). Volume 2004., Citeseer Hiroshima (2004).Google Scholar
- Brennan, R., Feeney, K., Mendel-Gleason, G., Bozic, B., Turchin, P., Whitehouse, H., Francois, P., Currie, T., Gohmann, S. Building the Seshat Ontology for a Global History Databank. In: The Semantic Web: ESWC (2016). Google ScholarDigital Library
- Brennan, R., Feeney, K.C., Gavin, O.: Publishing Social Sciences Datasets as Linked Data: a Political Violence Case Study. In: Exploration, Navigation and Retrieval of Information in Cultural Heritage workshop (ENRICH 2013), Dublin, Ireland (2013).Google Scholar
- Crotti Junior, A., Debruyne, C., O'Sullivan, D.: Incorporating Functions in Mappings to Facilitate the Uplift of CSV Files into RDF. In: The Semantic Web: ESWC 2016 Satellite Events (2016).Google ScholarCross Ref
- Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF Mapping Language. (2012) https://www.w3.org/TR/r2rml/.Google Scholar
- Debruyne, C., O'Sullivan, D.: R2RML-F: Towards Sharing and Executing Domain Logic in R2RML Mappings. In: Workshop on Linked Data on the Web (2016).Google Scholar
- Dimou, A., De Nies, T., Verborgh, R., Mannens, E., and Van de Walle, R.: Automated Metadata Generation for Linked Data Generation and Publishing Workflows. In: Workshop on Linked Data on the Web (2016).Google Scholar
- Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data. In: Workshop on Linked Data on the Web (2014).Google Scholar
- Hert, M., Reif, G., Gall, H.C.: A comparison of RDB-to-RDF Mapping Languages. In: Proceedings of the 7th International Conference on Semantic Systems. I-Semantics '11, New York, NY, USA, ACM (2011) 25--32. Google ScholarDigital Library
- Heyvaert, P., Dimou, A., Herregodts, A.L., Verborgh, R., Schuurman, D., Mannens, E., Van de Walle, R.: RMLEditor: A Graph-based Mapping Editor for Linked Data Mappings. In: The Semantic Web - Latest Advances and New Domains (ESWC 2016) (2016). Google ScholarDigital Library
- Hitzler, P., Krotzsch, M., Rudolph, S.: Foundations of semantic web technologies. CRC Press, (2009). Google ScholarDigital Library
- Michel, F., Djimenou, L., Faron-Zucker, C., Montagnat, J.: Translation of relational and non-relational databases into RDF with xR2RML. In: 11th Web Information Systems and Technologies (WEBIST) (2015).Google Scholar
- Pinkel, C., Schwarte, A., Trame, J., Nikolov, A., Bastinos, A.S., Zeuch, T.: Dataops: Seamless end-to-end anything-to-RDF data integration. In: The Semantic Web: ESWC 2015 Satellite Events (2015). Google ScholarDigital Library
- Purohit, S., Smith, W., Chappell, A., West, P., Lee, B., Stephan, E., Fox, P.: Effective Tooling for Linked Data Publishing in Scientific Research. In: 2016 IEEE Tenth International Conference on Semantic Computing (ICSC) (2016)Google Scholar
- Scharffe, F., Atemezing, G., Troncy, F., Gandon, F., Villata, S., Bucher, B., Hamdi, F., Bihanic, L., Képéklian, G., Cotton, F., et al. Enabling linked data publication with the Datalift platform. In Proc. AAAI Workshop on Semantic Cities, 2012.Google Scholar
- Slepicka, J., Yin, C., Szekely, P., Knoblock, C.: KR2RML: An alternative interpretation of R2RML for heterogeneous sources. In: Proceedings of the 6th International Workshop on Consuming Linked Data (2015).Google Scholar
- Stadler, C., Unbehauen, J., Westphal, P., Sherif, M.A., Lehmann, J.: Simplified RDB2RDF Mapping. In: Workshop on Linked Data on the Web. (2015).Google Scholar
- Tennison, J., Kellogg, G., Herman, I.: Model for Tabular Data and Metadata on the Web. (2015) https://www.w3.org/TR/tabular-data-model/.Google Scholar
- Turchin, P., Brennan, R., Currie, T., Feeney, K., Francois, P., Hoyer, D., Manning, J., Marciniak, A., Mullins, D., Palmisano, A., et al.: Seshat: The global history data-bank. Cliodynamics: The Journal of Quantitative History and Cultural Evolution 6 (2015).Google Scholar
Index Terms
- FunUL: a method to incorporate functions into uplift mapping languages
Recommendations
Malan: a mapping language for the data manipulation
DocEng '08: Proceedings of the eighth ACM symposium on Document engineeringMalan is a MApping LANguage that allows the generation of transformation programs by specifying a schema mapping between a source and target data schema. By working at the schema level, Malan remains independent of any transformation process; it also ...
Bridging XML-schema and relational databases: a system for generating and manipulating relational databases using valid XML documents
DocEng '01: Proceedings of the 2001 ACM Symposium on Document engineeringMany organizations and enterprises establish distributed working environments, where different users need to exchange information based on a common model. XML is widely used to facilitate this information exchange. The extensibility of XML allows the ...
XRecursive: Connecting XML with Relational Databases
Storing XML documents in a relational database is a promising solution because relational databases are mature and scale very well. They have the advantages that in a relational database XML data and structured data can coexist making it possible to ...
Comments