What is a Data Lake?

By on

data lakeA data lake is an environment where a vast amount of data, of various types and structures, can be ingested, stored, assessed, and analyzed. Data lakes serve many purposes, including:

  • An environment for data scientists to mine and analyze data.
  • A central storage area for raw data, with minimal, if any transformation.
  • Alternate storage for detailed historical data warehouse.
  • An online archive for records.
  • An environment to ingest streaming data with automated pattern identification.

Other Definitions of a Data Lake Include:

Businesses Use Data Lakes to:

  • Find and act on business opportunities.
  • Stimulate innovation.
  • Deal with complex and diversified data.
  • Meet business demands of more insights, agility, and flexibility.
  • Store different types of data in their original formats until they need to be structured and analyzed.

Image used under license from Shutterstock.com

 

We use technologies such as cookies to understand how you use our site and to provide a better user experience. This includes personalizing content, using analytics and improving site operations. We may share your information about your use of our site with third parties in accordance with our Privacy Policy. You can change your cookie settings as described here at any time, but parts of our site may not function correctly without them. By continuing to use our site, you agree that we can save cookies on your device, unless you have disabled cookies.
I Accept