Essential Open Source Big Data Tools

Click to learn more about author Paul Bates. The analysis of Big Data is a phenomenon that has gained considerable momentum in the past decade. The transition into the information age has made the analysis and visualization of Big Data vital to the success of any business. Data visualization tools enable researchers to gain insight into […]

Changing the Dynamics of Big Data Analytics

The tail end of 2017 saw the official delivery of Hadoop 3.0 from the Apache Software Foundation. With it came HDFS erasure coding, which enables a significant reduction in storage overhead and its costs. Opting to use erasure coding instead of three-way replication – in which three copies of each block of data must be […]

Five Consistent Principles for the Changing World of Data Integration

Click to learn more about author Kevin Petrie. While few technologies sit still, Data Architecture is especially dynamic. Open source innovators and vendors continue to create new Data Lake, streaming and Cloud options for today’s enterprise architects and CIOs. With business requirements also evolving, placing solid strategic bets has rarely been more difficult. It’s therefore no […]

Taking the Pain Out of HDFS Upgrades

by Angela Guess BlueData’s Chief Architect Tom Phelan recently wrote an article entitled “HDFS Upgrades Are Painful. But they Don’t Have to Be.” Phelan begins, “It’s hard enough to gather all the data that an enterprise needs for a Hadoop deployment; it shouldn’t be hard to manage it as well. But if you follow the […]

Hadoop Overview: A Big Data Toolkit

Big Data isn’t new. Forbes traces the origins back to the “information explosion” concept first identified in 1941. The challenge has been to develop practical methods for dealing with the 3Vs: Volume, Variety, and Velocity. Without tools to support and simplify the manipulation and analysis of large data sets, the ability to use that data […]

Semantic Web Job: Big Data Architect

New York’s Tektree Systems is in need of a Big Data Architect. The job description states, “Hadoop Data Architect with both hands-on Big Data and relational experience and deep knowledge of physical data modeling, data organization and storage technology, experienced with high volumes and able to architect and implement multi-tier solutions using the right technology […]

An Introduction to Apache HBase

by Angela Guess Hovhannes Avoyan is continuing his series of articles on the best NoSQL databases. He recently took a look at Apache HBase, “originally created for use with Apache’s Hadoop, a software framework that supports data-intensive distributed applications under a free license.” Avoyan writes, “HBase is really a clone (or a very close relative) […]

We use technologies such as cookies to understand how you use our site and to provide a better user experience. This includes personalizing content, using analytics and improving site operations. We may share your information about your use of our site with third parties in accordance with our Privacy Policy. You can change your cookie settings as described here at any time, but parts of our site may not function correctly without them. By continuing to use our site, you agree that we can save cookies on your device, unless you have disabled cookies.
I Accept