Tim Gasper of TechCrunch has created a list of five open source Big Data technologies that are making waves. He writes, "Did you know that there are over 250K viable open source technologies on the market today? Innovation is all around us… We have a lot of…choices, to say the least. What’s on our own radar, and what’s coming down the pipe for Fortune 2000 companies? What new projects are the most viable candidates for production-grade usage? Which deserve your undivided attention? We did all the research and testing so you don’t have to. Let’s look at five new technologies that are shaking things up in Big Data. Here is the newest class of tools that you can’t afford to overlook, coming soon to an enterprise near you."
He continues, "Storm and Kafka are the future of stream processing, and they are already in use at a number of high-profile companies including Groupon, Alibaba, and The Weather Channel. Born inside of Twitter, Storm is a “distributed real-time computation system”. Storm does for real-time processing what Hadoop did for batch processing. Kafka for its part is a messaging system developed at LinkedIn to serve as the foundation for their activity stream and the data processing pipeline behind it. When paired together, you get the stream, you get it in-real time, and you get it at linear scale."
Image: Courtesy Flickr/ Wonderlane