by Angela Guess
Darryl Taft reports that “The Apache Software Foundation (ASF) announced that its Apache Sqoop big data tool has graduated from the Apache Incubator to become a top-level project (TLP). Sqoop is designed to efficiently transfer bulk data between Apache Hadoop and structured data stores such as relational databases. Apache Sqoop allows the import of data from external data stores and enterprise data warehouses into a Hadoop Distributed File System or related systems like Apache Hive and HBase.”
Taft continues, “ASF officials said Sqoop builds on the Hadoop infrastructure and parallelizes data transfer for fast performance and best use of system and network resources. In addition, Sqoop allows fast copying of data from external systems to Hadoop to make data analysis more efficient, and mitigates the risk of excessive load to external systems. ‘Connectivity to other databases and warehouses is a critical component for the evolution of Hadoop as an enterprise solution, and that’s where Sqoop plays a very important role’ said Deepak Reddy, Hadoop Manager at Coupons.com, in a statement.”
photo credit: Apache

















