Data pipelines are a set of processes that move data from one place to another, typically from the source of data to a storage system. These processes involve data extraction from various sources, transformation to fit business or technical needs, and loading into a final destination for analysis or reporting. The goal is to automate […]
Case Study: Cox Automotive Solves Data Drift and ETL Challenges
According to Pat Patterson, Community Champion at StreamSets, “data drift” is such a problem now that “only about one fifth of a data analyst’s time is actually spent analyzing the data.” The remainder is spent “wrangling it into shape and getting it from where it is to the actual analysis platform.” Speaking at the Enterprise […]
Data Lakes: What They are and How to Use Them
Click to learn more about author Jaya Shankar Byrraju. For most companies, having data means having access to wealth. And the key to fully leveraging the wealth that data represents lies in how effectively companies harness, manage, parse, and interpret it. But first, the data must exist somewhere. Enter data lakes. These are central repositories […]
Collibra Introduces Enhanced Machine Learning Capabilities in New Release
A new press release reports, “Collibra, the Data Intelligence company, today announced the release of a platform-wide upgrade to improve access to critical data and expedite time to insight. The new release features machine learning enhancements to Collibra Catalog and marks the availability of Collibra Privacy & Risk, a sustainable approach to compliance with modules […]
Intuit Announces Acquisition of Origami Logic
A recent press release states, “Intuit Inc. makers of TurboTax, QuickBooks, Mint and Turbo, announced today it has entered into an agreement to acquire Origami Logic, the makers of an advanced data integration, ingestion, and analytics platform. Based in Silicon Valley, Origami Logic developed technology to analyze and gain insights from multiple data sets. Intuit […]
Make the Most of Graph Databases Through Interactive Analytics
New Big Data systems and advanced technologies are revolutionizing how businesses analyze their data assets and discover new value and insights across their business practices. Barry Zane, Senior Vice President and John Rueter, Vice President of Marketing at Cambridge Semantics, both recently sat down with DATAVERSITY® to discuss how companies are implementing new technologies such […]
Weaving Your Own Big Data Fabric
Click to learn more about author Ravi Shankar. With Big Data, anyone with a modest budget can store, manage, and process vast amounts of data. The problem is, many companies are storing data from different systems in different formats, creating Big Data silos that results in large datasets that need to be integrated manually. Aside from […]
Reduce Data Lake Ingestion Time by 75% with Ingestion Factory
by Angela Guess According to a new press release, “It is estimated that data preparation eats up as much as 80% of data analysts’ time, leaving less bandwidth for actual analytics and significantly reducing a data lake’s return on investment. To dramatically speed up data ingestion and data preparation in the data lake, Zaloni offers […]