Loading...
You are here:  Home  >  'microformat'
Latest

Common Crawl To Add New Data In Amazon Web Services Bucket

By   /  March 13, 2012  /  Big Data News, Articles, & Education, Data Blogs | Information From Enterprise Leaders  /  No Comments

The Common Crawl Foundation is on the verge of adding to its Amazon Web Services (AWS) Public Data Set of openly and freely accessible web crawl data. It was back in January that Common Crawl announced the debut of its corpus on AWS (see our story here). Now, a billion new web sites are in […]

Read More →
Latest

Is the Publishing Industry Ready to Embrace Change? Find Out At The Semantic Web Media Summit

By   /  September 6, 2011  /  Data Blogs | Information From Enterprise Leaders  /  No Comments

The publishing industry is an interesting beast: Its front-end moves rapidly to get content out to readers, but its back-end processes to deliver that information are so tightly packed that there’s not a lot of drive to make sweeping changes in those technologies or processes. “They have to be on this schedule, so traditionally they […]

Read More →
Latest

Infochimps Adds Geo APIs and Takes A Shine to Schema.Org, Too

By   /  August 30, 2011  /  Data Education, Smart Data News, Articles, & Education  /  No Comments

On the way from Infochimps: Its Geo APIs that bring to developers data from open sources such as GeoNames and The National Climate Data Center, as well as licensed sources such as Locationary and Foursquare.  Now for the twist: The data marketplace is semantifying the geo data with a schema.org approach. This is just the first […]

Read More →
Latest

RDFa Fading Away in EPUB3 Standard

By   /  August 22, 2011  /  Data Articles | Data Science, Business Intelligence, & More  /  No Comments

To follow up our story last week about the upcoming EPUB3 standard, for which International Digital Publishing Forum membership comments were due in by today, it appears that the updates “are moving the spec even further from any apparent support/use of RDFa.” That’s according to Eric Freese, solutions architect at digital publishing solutions vendor Aptara and […]

Read More →
Latest

Sindice Puts The Web of Data At Your Disposal

By   /  June 16, 2011  /  Data Education, Smart Data News, Articles, & Education  /  No Comments

Sindice Ltd. launched as a startup company this week, complete with a publicly available beta SPARQL endpoint to its indexed and live-updated dataset of some 12 billion triples. Next week will see Sindice –which began as a joint academic research project among DERI, the Fondazione Bruno Kessler and OpenLink Software to collect, search, query and build […]

Read More →
Latest

Guided Tour of The Semantic Web At SemTech 2011

By   /  June 7, 2011  /  Big Data Conferences, Big Data News, Articles, & Education, Data Education, Smart Data Conferences, Smart Data News, Articles, & Education  /  No Comments

 An informal raise-your-hand survey of attendees at the SemTech conference in San Francisco this week revealed that a good number of attendees were here for the first time. And one of the early morning tutorials Monday provided a perfect opportunity for many of them to explore the Semantic Web in greater depth, with the W3C’s […]

Read More →
Latest

Extra, Extra: rNews Seeks To Be Semantic Standard For Online News Publishers

By   /  April 15, 2011  /  Uncategorized  /  No Comments

RAID Pic2

News publishing outlets stand to benefit from adopting Semantic Web technologies, and now there’s a lightweight way for them to begin moving in that direction, too. The International Press Telecommunications Council (IPTC) recently introduced rNews 0.1, a set of specifications and best practices for using RDFa to embed news-specific metadata (headlines, bylines, publication dates and […]

Read More →
Latest

RDFa is Sweeping the Web

By   /  February 23, 2011  /  Uncategorized  /  No Comments

A new article looks at the growing reach of RDFa and microformats on the web. The article includes a chart with information on the deployment of RDFa and other microformats across the web “based on an analysis of 12 billion web pages indexed by Yahoo! Search.” The analysis, which spans the last three years, is quite enlightening.

Read More →