Click to learn more about author Ashok Sharma. It is ironic to witness the rate of cyberattacks, data breaches, and unauthorized use of personal data growing directly proportional to laws being established to regulate the collection, use, retention, disclosure, and disposal of personal information worldwide. With the growing use of big data, AI, and machine learning, […]
Why Open Source Matters in Cloud DevOps
Click to learn more about author Nati Shalom. From its origin in the free software movement, open-source software has grown in popularity and adoption across industries worldwide. Open-source Linux now runs the majority of the world’s server workloads, Kubernetes (and Docker) adoption is growing exponentially and pushing the container and cloud-native revolutions, and on the […]
Case Study: Cox Automotive Solves Data Drift and ETL Challenges
According to Pat Patterson, Community Champion at StreamSets, “data drift” is such a problem now that “only about one fifth of a data analyst’s time is actually spent analyzing the data.” The remainder is spent “wrangling it into shape and getting it from where it is to the actual analysis platform.” Speaking at the Enterprise […]
Data Cleansing: Why It’s Important
Click to learn more about author Avee Mittal. Data cleansing is an important step to prepare data for analysis. It is a process of preparing data to meet the quality criteria such as validity, uniformity, accuracy, consistency, and completeness. Data cleansing removes unwanted, duplicate, and incorrect data from datasets, thus helping the analyst to develop […]
What Is ACID?
ACID properties characterize RDBMS (Relational Database Management System) database processing or a data warehouse. Originally coined in the early 1980s, according to DAMA DMBoK, the ACID philosophy consists of requirements for generating and maintaining reliable database transactions. ACID provides consistency before, during and after transactions through five properties: Atomic: Each task in a transaction succeeds […]
Lessons from a Real Disaster Recovery: Preparing Your Backup and DR Systems (Part III)
Click to learn more about author W. Curtis Preston. A system administrator who has actually been through a real disaster told me in a recent podcast interview that most of the challenges he had were more about basic infrastructure like internet, lodging, and food. In this final article in the series, I want to focus […]
Dealing with Database End-of-Life Issues: What Approach Should You Take?
Click to learn more about author Matt Yonkovit. Databases play a critical role in our applications – after all, no one ever talks about their services using less data today. However, like applications, databases require ongoing management and regular updates. At some point, the database version you use will reach its End of Life (EOL). […]
Rethinking Extract Transform Load (ETL) Designs
Click to learn more about author Aditi Raiter. Are you in a work environment where streaming architecture is not yet implemented across all IT systems? Have you ever been in a situation when you had to represent the ETL team by being up late for L3 support only to find out that one of your […]
Dos and Don’ts of Database Management for Online Businesses in 2021
Click to learn more about author Hiral Rana. Data has for years acted as the building blocks upon which all urban civilizations now stand. This proved even truer during the COVID-19 pandemic, when unimagined chaos ensued around the globe, sending multiple businesses into closure. On the flip side, the epidemic also presented a silver lining […]
What Is Canonical Data Modeling?
Canonical Data Modeling documents, using Data Modeling techniques, how messages or packets pass between different systems internally in the organization and across different company systems, to do e-business. Data sometimes vary, across systems, in their definitions. For example, a company may have defined “customer” for a data warehouse constructed 10 years ago and then characterized […]