A modern data architecture is required to support the data-driven organization that every enterprise wants to be. Without a solid data architecture – composed of the models, policies, rules, and standards you set for how data is collected, stored, managed, and used – your ability to attain a holistic view of your business, make informed […]
Taking the Chill Out of Selecting the Appropriate Iceberg Data Catalog
Over the past few years, the industry has increasingly recognized the need to adopt a data lakehouse architecture because of the inherent benefits. This approach improves data infrastructure costs and reduces time-to-insight by consolidating more data workloads into a single source of truth on the organization’s data lake. This is made possible by data lakehouse table […]
Tonic.ai Launches Secure Unstructured Data Lakehouse for Large Language Models
According to a new press release, Tonic.ai has launched Tonic Textual, a secure data lakehouse for large language models (LLMs), designed to address integration and privacy challenges in leveraging unstructured data for generative AI. This platform aims to streamline the preparation of unstructured data for retrieval-augmented generation (RAG) systems and LLM fine-tuning, tackling significant obstacles in […]
Data Lakehouse Architecture 101
A data lakehouse, in the simplest terms, combines the best functionalities of a data lake and a data warehouse. It offers a unified platform for seamlessly integrating both structured and unstructured data, providing businesses agility, scalability, and flexibility in their data analytics processes. Unlike traditional data warehouses that rely on rigid schemas for organizing and […]
Cloudera Introduces Next Phase of Open Data Lakehouse on Private Cloud
According to a new press release, Cloudera, a data company specializing in trusted enterprise AI, has unveiled the next phase of its open data lakehouse on private cloud, aiming to revolutionize on-premises data experiences for scalable analytics and AI. With recent enhancements, Cloudera has become the sole provider offering an open data lakehouse with Apache Iceberg […]
Databricks Announces Data Intelligence Platform for Communications
According to a new press release, Databricks has announced the launch of the Data Intelligence Platform for Communications, a comprehensive data and AI platform designed specifically for telecommunications carriers and network service providers. This platform offers Communication Service Providers (CSPs) a unified foundation for data and AI, allowing them to gain insights into networks, operations, […]
Dremio Partners with Carahsoft to Help Public Sector Organizations Harness Data Analytics
According to a new press release, Dremio and Carahsoft Technology Corp. have officially partnered, with Carahsoft serving as Dremio’s Master Government Aggregator®. This collaboration aims to offer Dremio’s comprehensive cloud and software portfolio to various public sector entities, including government, defense, intelligence, and education, through Carahsoft’s reseller partners and multiple government contracts such as NASA […]
Distributed Data Architecture Patterns Explained
Distributed data architecture, models using multiple platforms, and processes for data-driven goals continue to generate increased interest. As William McKnight, president of McKnight Consulting Group (MCG) and well-known data architecture advisor, says, “Seldom a database vendor does not interact with concepts around distributed data architectures: the data lakehouse, data mesh, data fabric, and data cloud, and I am […]
Data Warehouse vs. Data Lakehouse
The phrase “data warehouse vs. data lakehouse” offers an exciting topic for ongoing debate in the global Data Management world. While businesses have relied on traditional data warehouses for storing structured and semi-structured data for years, the more recent technological solution of the data lakehouse is growing in importance because of its unique ability to provide structure to raw data. […]
The Semantic Lakehouse Explained
Data lakes and semantic layers have been around for a long time – each living in their own walled gardens, tightly coupled to fairly narrow use cases. As data and analytics infrastructure migrates to the cloud, many are challenging how these foundational technology components fit in the modern data and analytics stack. In this article, […]