If you’re interested in Linked Data, no doubt you’re planning to listen in on next week’s Semantic Web Blog webinar, Getting Started With The Linked Data Platform (register here), featuring Arnaud Le Hors, Linked Data Standards Lead at IBM and chair of the W3C Linked Data Platform WG and the OASIS OSLC Core TC. It also may be on your agenda to attend this month’s Semantic Web Technology & Business Conference, where speakers including Le Hors, Manu Sporny, Sandro Hawke, and others will be presenting Linked Data-focused sessions.
In the meantime, though, you might enjoy reviewing the results of the LOD2 Project, the European Commission co-funded effort whose four-year run, begun in 2010, aimed at advancing RDF data management; extracting, creating and enriching structured RDF data; interlinking data from different sources; and authoring, exploring and visualizing Linked Data. To that end, why not take a stroll through the recently released Linked Open Data – Creating Knowledge Out of Interlinked Data, edited by LOD2 Project participants Soren Auer of the Institut für Informatik III Rheinische Friedrich-Wilhelms-Universität; Volha Bryl of the University of Mannheim, and Sebastian Tramp of the University of Leipzig?
Nearly a dozen Linked Open Data research groups, companies and service providers across Europe and South Korea were involved in the project. The downloadable ebook provides readers with an overview of how the project supported its goals to develop enterprise-ready tools and methodologies for exposing and managing very large amounts of structured information on the Web; algorithms based on machine learning for automatically interlinking and fusing data from the Web; adaptive tools for searching, browsing, and authoring of Linked Data; and a testbed and bootstrap network of multi-domain, multi-lingual ontologies from sources such as Wikipedia and OpenStreetMap.
The LOD2 Stack is the main output of the effort, comprised of tools – many of them open source – for managing the Linked Data lifecycle. “The major components of the LOD2 Stack are open-source in order to facilitate wide deployment and scale to knowledge bases with billions of triples and large numbers of concurrent users,” according to the authors. “We aim at ensuring that the stack fulfills a broad set of user requirements and thus facilitates the transition to a Web of Data.”
The Stack counts among its many components solutions for storage and querying such as OpenLink Software’s Virtuoso as the basic RDF knowledge store and SparQLed assisted SPARQL editor; PoolParty thesaurus management and SKOS editor for authoring and manual revisions; OntoWiki for supporting the visual presentation of a knowledge base as an information map; as well as the LIMES link discovery framework for the web of data and Silk Linking Framework to set RDF links from their data sources to other data sources on the web.
The ebook notes that the current version of the stack has a high number of components geared towards the extraction, storage and querying parts: “This can be explained by the large number of data formats that need to be transformed to RDF. Every component tackles a specific subset of these formats. Most of the other components contain a small selection of specialized tools for a specific task.”Future plans include a stack configurator, so that users can create their own personalized versions of the stack.
The project’s end will also see the stack rechristened as the Linked Data Stack and maintained by other projects, such as GeoKnow, a European Commission effort to deal with linking geospatial data from diverse information systems and exploting such data via the web, and DIACHRON, which looks to support archiving, provenance, annotation, citation and data quality in the context of Linked Open Data and modern databases.