Fifth International Workshop on Consuming Linked Data (COLD2014)
Monday October 20, 2014
Riva del Garda, Italy
The quantity of published Linked Data continues to increase. However, applications that consume Linked Data are not yet widespread. Reasons may include a lack of suitable methods for a number of open problems, including the seamless integration of Linked Data from multiple sources, dynamic discovery of available data and data sources, provenance and information quality assessment, application development environments, and appropriate end user interfaces. Addressing these issues requires well- founded research, including the development and investigation of concepts that can be applied in systems which consume Linked Data from the Web.
Our main objective is to provide a venue for scientific discourse (including systematic analysis and rigorous evaluation) of concepts, algorithms and approaches for consuming Linked Data.
The workshop will be co-located with the 13th International Semantic Web Conference (ISWC) in Riva del Garda, Italy. Networked communication will be encouraged during the workshop using IRC, microblogging and other services, provided with the official hashtag
#cold2014 to follow the live-stream of the event.
- 2014-08-29: The ISWC2014 program has been published. COLD2014 will take place on Monday October 20.
- 2014-07-05: We have extended the submission deadline by 1 week. Abstracts are due July 7 and papers are due July 14
- 2014-04-22: The workshop has been accepted for ISWC 2014.
- Abstract submission deadline:
July 1, 2014 July 7, 2014, 23.59 Hawaii time
- Paper submission deadline:
July 7, 2014 July 14, 2014, 23.59 Hawaii time
- Acceptance notification: July 30, 2014
- Camera-ready versions of accepted papers: August 20, 2014
- Workshop date: October 20, 2014
- A Drag-and-block Approach for Linked Open Data Exploration (Tuan-Dat Trinh, Ba-Lam Do, Peter Wetz, Amin Anjomshoaa, Elmar Kiesling and Amin Tjoa)
- A Relational Learning Approach for Collective Entity Resolution in the Web of Data (Gustavo De Assis Costa and José Maria Parente de Oliveira)
- Capturing the Currency of DBpedia Descriptions and Get Insight into their Validity (Anisa Rula, Luca Panziera, Matteo Palmonari and Andrea Maurino)
- Checking Licenses Compatibility between Vocabularies and Data (Guido Governatori, Ho-Pun Lam, Antonino Rotolo, Serena Villata, Ghislain Auguste Atemezing and Fabien Gandon)
- How redundant is it? - an empirical analysis on the linked data (Honghan Wu, Boris Villazón-Terrazas, Jeff Z. Pan and Jose Manuel Gomez-Perez)
- Resource Planning for SPARQL Query Execution on Data Sharing Platforms (Stefan Hagedorn, Katja Hose, Kai-Uwe Sattler and Jürgen Umbrich)
- Towards a Linked-Data based Visualization Wizard (Ghislain Auguste Atemezing and Raphaël Troncy)
- Using Linked Data and Web APIs for Automating the Pre-processing of Medical Images (Philipp Gemmeke, Maria Maleshkova, Patrick Philipp, Michael Goetz, Christian Weber, Benedikt Kaempgen, Marco Nolden and Klaus Maier-Hein)
- Walking Linked Data: A graph traversal approach to explain clusters (Ilaria Tiddi, Mathieu D'Aquin and Enrico Motta)
The term Linked Data refers to a set of foundational principles for publishing and interlinking structured data on the Web. After Linked Data was first proposed in 2006, a grass-roots movement, led by the Linking Open Data project, started to publish and to interlink multiple open databases on the Web following the proposed principles. Due to conference workshops, tutorials and general evangelism, an increasing number of data publishers – such as the BBC, Thomson Reuters, The New York Times, the Library of Congress, BestBuy, Getty, the US and UK government – have since adopted this practice. This ongoing effort resulted in bootstrapping the “Web of Linked Data” which, today, comprises of billions of RDF triples and millions of RDF links between datasets. The published datasets now include data about books, movies, music, radio and television programs, reviews, scientific publications, genes, proteins, diseases, medicine and clinical trials, geographic locations, people, statistical and census data, companies, and many more topics besides.
All of these published datasets are openly available on the Web in standardised interoperable formats, which presents novel opportunities for the next generation of Web-based applications: data from different providers can be aggregated, allowing fragmentary information from multiple sources to be integrated so as to achieve a complementary and more complete view. While a few applications, such as the BBC music guide have used Linked Data to significant benefit, the deployment methodology has been to harvest the data of interest from the Web to create a private, disconnected repository for each specific application. Such “harvesting approaches” are typically only feasible for vertical applications tied to specific datasets, incur a high up-front cost, and are insensitive to updates in the original data-sources. New concepts for consuming Linked Data – that do not require up-front harvesting of all sources – are required to lead the Web of Linked Data to its fullest and most general potential. The concepts, patterns, and tools necessary are very different from situations where relevant resource identifiers are known a priori, where queries can be run over complete local repositories, where access to the repository is reliable and cheap, and where relevant data sources are known to be trustworthy.
While recent tutorials provided technical insight and skills on how to consume Linked Data in applications, these tutorials also drew attention to several open issues that make the development of Linked Data based applications a challenging or still impossible task. These open issues include a lack of approaches for seamless integration of Linked Data from multiple sources, for dynamic, on-the-fly discovery of available data, for information quality assessment, for querying and caching dynamic remote sources, and for implementing appropriate end-user interfaces.
These open issues can only be addressed appropriately when they are conceived as research problems that require the development and systematic investigation of novel approaches. The 5th International Workshop on Consuming Linked Data (COLD 2014) aims to provide a platform for the presentation and discussion of such approaches. Our main objective is to attract submissions that present scientific discussion (including systematic evaluation) of broadly-applicable concepts and approaches, as opposed to exposition of features implemented in current Linked Data applications. For practical systems without formalization or evaluation we refer interested participants to other offerings at ISWC, such as the Semantic Web Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.
- 9:00 - 9:15: Workshop Introduction
- 9:15 - 9:30: How redundant is it? - an empirical analysis on the linked data (Honghan Wu, Boris Villazón-Terrazas, Jeff Z. Pan and Jose Manuel Gomez-Perez)
- 9:30 - 9:45: Capturing the Currency of DBpedia Descriptions and Get Insight into their Validity (Anisa Rula, Luca Panziera, Matteo Palmonari and Andrea Maurino)
- 9:45 - 10:00: Checking Licenses Compatibility between Vocabularies and Data (Guido Governatori, Ho-Pun Lam, Antonino Rotolo, Serena Villata, Ghislain Auguste Atemezing and Fabien Gandon)
- 10:00 - 10:15: Meta Discussion: Provenance, Descriptions, Trust
- 10:15 - 10:30: A Drag-and-block Approach for Linked Open Data Exploration (Tuan-Dat Trinh, Ba-Lam Do, Peter Wetz, Amin Anjomshoaa, Elmar Kiesling and Amin Tjoa)
- 10:30 - 11:00: BREAK
- 11:00 - 11:15: Using Linked Data and Web APIs for Automating the Pre-processing of Medical Images (Philipp Gemmeke, Maria Maleshkova, Patrick Philipp, Michael Goetz, Christian Weber, Benedikt Kaempgen, Marco Nolden and Klaus Maier-Hein)
- 11:15 - 11:30: Towards a Linked-Data based Visualization Wizard (Ghislain Auguste Atemezing and Raphaël Troncy)
- 11:30 - 11:45: Visual Discussion: Interfaces, Interaction, Multimedia
- 14:00 - 15:00: Keynote: Cracks in the foundations of the Web of Data? (Claudio Gutierrez - DCC, Universidad de Chile, Chile)
The Web of Data is today a reality.
More and more data is populating the Web and diverse applications and people are publishing and consuming it.
To build it, tools and techniques from different fields have been used to address its challenges.
As the field is growing, it seems to be a good time to reflect on the foundations of this area, in order to better understand our place and its relationship with neighboring fields.
Claudio Gutierrez is an Associate Professor at the Computer Science Department, Universidad de Chile.
His research experiences lies in the intersection of Databases and the Semantic Web, focusing in data models and query languages, particularly RDF and SPARQL.
He is currently interested in Open Data, Linked Data and data management at Web scale.
He has published extensively in the area and received several best paper awards; this year he received the ACM PODS Test of Time Award for the paper "Foundations of Semantic Web Databases".
He has been involved in both, Database and SW communities, has organized sevaral workshops, and has been on PC committees of ICDT, PODS, CIKM, WWW, KR, ISWC, ESWC, RR; and currently is on the editorial board of the JWS.
- 15:00 - 15:15: Walking Linked Data: A graph traversal approach to explain clusters (Ilaria Tiddi, Mathieu D'Aquin and Enrico Motta)
- 15:15 - 15:30: Web Discussion: Discovery, Nagivation, Protocols, Linked Data
- 15:30 - 16:00: BREAK
- 16:00 - 17:00:
Panel Title: Linked Data Access
Abstract: Hosting, accessing and querying structured data on the Web gives rise to new, previously unseen challenges. This panel will feature a selection of experts to lead group discussion on the sorts of query languages and protocols needed to realise a mature Web of Linked Data.
Panelists: Claudio Gutierrez, Andreas Harth, Sandro Hawke, Axel Polleres, Ruben Verborgh
- 17:00 - 17:30: Closing and Town hall
- 20:00 - ... : Linked Data Gathering at TBD
Topics of Interest
While previous editions of the workshop have attracted a number of submissions that addressed topics related to (RDF and) Linked Data management in general,
with COLD2014 we aim to steer the workshop back towards the aforementioned core goals. To this end, we explicitly seek submissions that address research problems related to at least one of the following two aspects of Linked Data consumption:
In the context of these two aspects of Linked Data consumption, relevant topics for COLD 2014 include but are not limited to:
- Makes use of Linked Data principles, including dereferencing
- Involves direct use of multiple, real-world Linked Datasets
- Live Linked Data (i.e., algorithms and applications that make use of Linked Data at runtime)
- Architectures for consuming Linked Data (e.g., Dataspaces, Cloud, NoSQL)
- Integration of Linked Data sources (e.g., entity resolution, sameas, vocabulary mapping, etc.)
- Handling additional Web data (e.g., Deep Web, APIs, Microdata, JSON, Atom, tables, etc.)
- Web-scale data management (e.g., crawling, indexing, parallel processing, etc.)
- Novel languages for navigating and consuming Linked Data (e.g., nSPARQL, NautiLOD, etc)
- Linked Data summarization, guides and schema learning
- Query processing over multiple Linked Datasets
- Search over the Web of Linked Data
- Auto-discovery of URIs and data
- Caching and replication
- Dataset dynamics
- Reasoning on Linked Data from multiple sources
- Information quality and trustworthiness of Linked Data
- User-interface research for interacting with the Web of Linked Data
We seek novel technical research papers in the context of consuming Linked Data with a length of up to 12 pages.
Paper submissions must be formatted in the style of the
Springer Publications format for Lecture Notes in Computer Science (LNCS).
Please submit your paper via EasyChair at http://www.easychair.org/conferences/?conf=cold2014
Submissions that do not comply with the formatting of LNCS or that exceed the page limit will be rejected without review.
We note that the author list does not need to be anonymized, as we do not have a double-blind review process in place.
Submissions will be peer reviewed by three independent reviewers. Accepted papers have to be presented at the workshop to be published in the proceedings. Proceedings will be published online at CEUR-WS. We also encourage supplementary Web-based material (e.g., XHTML+RDFa versions of papers) to be submitted alongside PDFs. This material will be published on the workshop website.
The workshop proceedings are online as CEUR-WS Vol-1264.
- Cosmin Basca, University of Zurich, Switzerland
- Christian Bizer, University of Mannheim, Germany
- Carlos Buil Aranda, Pontificia Universidad Católica de Chile, Chile
- Gong Cheng, Nanjing University, China
- Oscar Corcho, Universidad Politecnica de Madrid, Spain
- Mathieu d'Aquin, Open University, UK
- Aba-Sah Dadzie, University of Birmingham, UK
- Christina Feilmayr, Johannes Kepler University of Linz, Austria
- Paul Groth, Vrije Universiteit Amsterdam, Netherlands
- Tudor Groza, The University of Queensland, Australia
- Claudio Gutierrez, Universidad de Chile, Chile
- Christophe Guéret, Data Archiving and Networked Services (DANS), Netherlands
- Andreas Harth, Karlsruhe Institute of Technology, Germany
- Sandro Hawke, W3C MIT, US
- Katja Hose, Aalborg University, Denmark
- Marcel Karnstedt, Bell Labs, Ireland
- Markus Luczak-Roesch, University of Southampton, UK
- Giuseppe Pirro, Free University of Bolzano, Italy
- Harald Sack, Hasso-Plattner-Institute for IT Systems Engineering, University of Potsdam, Germany
- Kai-Uwe Sattler, TU Illmenau, Germany
- Bernhard Schandl, Gnowsis.com, Austria
- Monika Solanki, Aston Business School, Aston University, UK
- Thomas Steiner, Université Lyon 1, France
- Raphael Troncy, EURECOM, France
- Ruben Verborgh, Ghent University - iMinds - Multimedia Lab, Belgium
- Boris Villazon-Terrazas, iSOCO, Intelligent Software Components, Spain
- Jun Zhao, University of Oxford, UK
- Antoine Zimmermann, École des Mines de Saint-Étienne, France
For further information about the workshop, please contact the workshops chairs at email@example.com
COLD 2014 is the fifth edition of the Consuming Linked Data workshop series. Previous editions are COLD 2013, COLD 2012, COLD 2011, and COLD 2010.