by Angela Guess
A new article reports that HBase “has the edge in data management for next generation Internet and cloud computing users, claim researchers at the University of Texas – Pan American (UTPA) in Edinburg. In tests designed to find the best storage technologies for Web 3.0 — also known as the Semantic Web — Apache’s Hadoop database, HBase, out-performed MySQL Cluster, the UTPA team discovered in a classic confrontation between relational and non-relational databases. With their own algorithms to adapt the two database systems, the team found that HBase works faster with larger datasets, a major issue since the Semantic Web comprises vast amounts of tags and descriptions known as metadata.”
It continues, “Unlike the current Web 2.0, ‘the Semantic Web is interconnected metadata that facilitates better information search, discovery and integration,’ says study co-author and UTPA assistant computer science professor Artem Chebotko, Ph.D. ‘Our work, to our best knowledge, is the first to empirically compare HBase and MySQL Cluster for metadata management.’ The results make sense to Retrevo chief scientist Aditya Vailaya, Ph.D., whose Sunnyvale, CA-based firm uses Semantic Web technology to compare thousands of retail electronic devices on price, performance and myriad other metadata factors. Query performance is key with the Semantic Web’s large data stores, Vailaya says. But, most databases aren’t equipped with query-improving algorithms, and HBase lends itself to writing them.”
photo credit: HBase

















