Click to learn more about author Mathias Golombek. The Challenge Michael Stonebraker, winner of the Turing Award 2014, has been quoted as saying: “The change will come when business analysts who work with SQL on large amounts of data give way to data scientists, which will involve more sophisticated analysis, predictive modeling, regressions and Bayesian […]
Demystifying Data Architecture
Ludwig Mies van der Rohe said, “Architecture starts when you carefully put two bricks together”—and Data Architecture begins upon creating, storing, and putting two or more characters together, be they sets of records, emails, pictures, audio, video. This resonated well with initial thoughts about Data Architecture, as it is comprised of things, the functionality of […]
What Is ACID?
ACID properties characterize RDBMS (Relational Database Management System) database processing or a data warehouse. Originally coined in the early 1980s, according to DAMA DMBoK, the ACID philosophy consists of requirements for generating and maintaining reliable database transactions. ACID provides consistency before, during and after transactions through five properties: Atomic: Each task in a transaction succeeds […]
Dos and Don’ts of Database Management for Online Businesses in 2021
Click to learn more about author Hiral Rana. Data has for years acted as the building blocks upon which all urban civilizations now stand. This proved even truer during the COVID-19 pandemic, when unimagined chaos ensued around the globe, sending multiple businesses into closure. On the flip side, the epidemic also presented a silver lining […]
Messy Data Shouldn’t Stop Machine Learning in Its Tracks
Click to learn more about author Jon Reilly. Businesses are creating data at an incredible pace that will only accelerate. In fact, data storage company Seagate predicts it will pass a yearly rate of “163 zettabytes (ZB) by 2025. That’s ten times the amount of data produced in 2017.” Moore’s Law – the principle that […]
Why 2021 Will Be a Big Year for Apache Cassandra (and Its Users)
Click to learn more about author Ben Bromhead. The upcoming GA release of Apache Cassandra 4.0 is set to be the most stable “.0” release of the project (or any distributed database) ever. The effort across the entire community has been monumental and everyone involved with this release will deserve not only a well-earned lap […]
What Is BASE?
BASE describes database processing germane to a NoSQL database, such as a data lake. An increasing number of data volumes and variability, according to DAMA DMBoK, spurred the BASE philosophy. Its popularity rose in 2008. BASE provides less assurance than ACID, but it scales very well and reacts well to rapid data changes. BASE construction […]
Which Data Security Pitfalls Lurk in the Hybrid Cloud?
Click to learn more about author Bernard Brode. After a period of unrivalled innovation when it comes to data storage and sharing, it seems that – for most companies, at least – hybrid models are about to reign supreme. In layman’s terms, a hybrid model is an efficient combination of existing IT systems and public […]
Will They Blend? Theobald Meets HANA
Click to learn more about author Maarit Widmann. In the “Will They Blend?” blog series, we experiment with the most interesting blends of data and tools. Whether it’s mixing traditional sources with modern data lakes, open-source devops on the cloud with protected internal legacy tools, SQL with noSQL, web-wisdom-of-the-crowd with in-house handwritten notes, or IoT […]
Becoming a Prized Data Warehouse and Data Integration Tester
Click to learn more about author Wayne Yaddow. Data warehouse (DW) testers with data integration QA skills are in demand. Data warehouse disciplines and architectures are well established and often discussed in the press, books, and conferences. They have become a standard necessity for most modern organizations. Each business often uses one or more data […]