Advertisement

A Brief History of the Hadoop Ecosystem

In 2002, internet researchers just wanted a better search engine, and preferably one that was open-sourced. That was when Doug Cutting and Mike Cafarella decided to give them what they wanted, and they called their project “Nutch.” Hadoop was originally designed as part of the Nutch infrastructure, and was presented in the year 2005. The […]

StreamSets Launches StreamSets Transformer

A recent press release states, “StreamSets, Inc., provider of the industry’s first DataOps platform for modern data integration, released today StreamSets® Transformer, a simple-to-use, drag-and-drop UI tool to create native Apache Spark applications. Designed for a wide range of users — even those without specialized skills — StreamSets Transformer enables the creation of pipelines for […]

Ten Myths About Data Science

Click to learn more about author Daniel Jebaraj. Introduction Data Science is now being used as a competitive weapon. As with other technologies and processes that can transform the way companies operate, there’s a lot of contradictory information about it that’s causing considerable confusion. Most of today’s business leaders have heard that Data Science can […]

Datawatch Angoss Simplifies Data Science and Analytic Tasks on the Apache Spark Platform

A recent press release reports, “Datawatch Corporation today announced the general availability of Datawatch Angoss KnowledgeSTUDIO for Apache Spark, enabling organizations to act more confidently with their data and rely on consistent, trustful results in making better business decisions. In combination with its market-leading data visualization approach for building, exploring and segmenting data using patented […]

Databricks Introduces Global Partner Program

A recent press release reports, “Databricks, the leader in unified analytics and founded by the original creators of Apache Spark™, today launched the Accelerate Impact Partner Program. Through the program, Consulting and Systems Integrator partners can leverage Databricks’ Unified Analytics expertise, comprehensive training programs, and global team to empower customers. Over the last 12 months, […]

Paxata Announces Apache Spark-Powered Data Preparation Runtime Fabric

According to a new press release, “Paxata, the pioneer in self-service data preparation for analytics, today announced the general availability of its Fall ’18 release, the next major update to the company’s award-winning Adaptive Information Platform. The latest release includes a new Adaptive Workload Management capability, which delivers an elastic resource allocation service on a […]