Last week news came from SindiceTech about the availability of its SindiceTech Freebase Distribution for the cloud (see our story here). SindiceTech has finalized its separation from the university setting in which it incubated, the former DERI institute, now a part of the Insight Center for Data Analytics, and now is re-launching its activities, with more new solutions and capabilities on the way.
“The first thing was to launch the Knowledge Graph distribution in the cloud,” says CEO Giovanni Tummarello. “The Freebase distribution showcases how it is possible to quickly have a really large Knowledge Graph in one’s own private cloud space.” The distribution comes instrumented with some of the tools SindiceTech has developed to help users both understand and make use of the data, he says, noting that “the idea of the Knowledge Graph is to have a data integration space that makes it very simple to add new information, but all that power is at risk of being lost without the tools to understand what is in the Knowledge Graph.”
Included in the first round of the distribution’s tools for composing queries and understanding the data as a whole are the Data Types Explorer (in both tabular and graph versions), and the Assisted SPARQL Query Editor. The next releases will increase the number of tools and provide updated data. “Among the tools expected is an advanced Knowledge Graph entity search system based on our newly released SIREn search system,” he says.
SIREn’s Next Steps
In fact, the release of the SIREn semistructured search engine into a full and separate website and community is also part of SindiceTech’s recent relaunch. “SIREn version 1.0 download is now available for all,” says Tummarello, having wide applicability to any scenario involving nested or relational data. “It’s quite a mature system we are launching.” While it is not just for the semantic community or RDF users, it is, he says, “the most advanced system to search RDF and knowledge graphs in general, so it’s at the core of our Knowledge Graph solution components.”
The SIREn database, he explains, now has full JSON support; it has become a JSON search system.” Effectively, he says, the roadmap laid down for it is that it has become a NoSQL search engine built atop the Apache Solr enterprise search platform, so that those with investments in the Apache Lucene project “can look at SIREn as a way to get much better results and more features leveraging structure in their documents without changing systems,” he says.
MongoDB users, for example, can add SIREn to the architecture for those apps to gain state-of-the-art free text search and ranking, fast ad-hoc structured querying without the need to set up indexes for specific paths, and data exploration with faceting.
It doesn’t make sense, Tummarello explains, to restrict the market for SIREn only to the semantic web community given that it functions as a generic JSON search system. That said, “those who do understand the Knowledge Graph will find SIREn a core component for them as it’s simply the most advanced solution for searching and recommending on them,” he says. “Our vision is to help enterprises leverage the value of complex relational information. SIREn and Knowledge Graphs are complementary tools.”
The fascinating thing in our mission, he says, is that up to now only thebig web giants could afford to take the time to realize the potential of the Knowledge Graph. “The very fascinating thing is that we now have technologies to deliver that within the reach of [any] smart enterprise,” he says.
PivotBrowser On Tap
Also among the tools SindiceTech is including as part of its relaunch to help companies find value in their data is its relational faceted browser, PivotBrowser. “An example of what SIREn enables is the PivotBrowser, which adds a totally new dimension to realtime data browsing: relations,” says Tummarello.
The video below explains how PivotBrowser enables the extension of faceted browsing to leverage for the first time the knowledge about the interconnections between records for searching.
So, not only can you restrict any entity type by property, as in a normal faceted browser, but you can use the relations among entities to impose restrictions as well. “That is quite a novel paradigm that is applicable anytime one has relational data, be that in RDF or not,” he says. PivotBrowser is available for early adopters now who want to embed it in solutions.
Says Tummarello, “The advantage is leveraging what is sometime the biggest wealth in the information: relations between entities. In a regular faceted browsing interface, this is simply ignored.”