Kubernetes (sometimes abbreviated to “kube”) is open-sourced, was originally developed by Google, and organizes containers into logical units for transport and use in the cloud. Containers support the construction of self-contained environments capable of transporting data, and the software supporting it. Containers are, ultimately, a way to package software and other application components. It is […]
Case Study: Enterprise Metadata Repository Facilitates Change at the University of Washington
This past summer The University of Washington went live with Workday, a Software-as-a-Service human capital management solution. It replaced a decades-old HR/payroll (HR/P) system and represented the largest administrative transformation in the school’s history. Human resources personnel are affected by the change, of course, but so too is practically everyone else on campus. That’s to […]
Data Governance vs. Blockchain
Data Governance is the hot topic in today’s security- and privacy-concerned digital ecosystem. Blockchain technology offers data-centered security to sensitive, transactional systems that require tamper-resistant, fully auditable tracking mechanisms. Thus, blockchain and Data Governance (DG) complement each other in many ways, while maintaining distinct operational philosophies and application turfs. Blockchain is today’s preferred technology for […]
Data Containers Demystified: A Reliable Data Movement Solution
The Data Management industry has seen a significant rise in the recent interest of data containers. As Cloud Computing has gained popularity, methods for transporting data and its processing instructions, have been investigated, with data containers coming in as a viable a solution. Data containers solve the problem of getting software to run reliably, while […]
A Brief History of the Hadoop Ecosystem
In 2002, internet researchers just wanted a better search engine, and preferably one that was open-sourced. That was when Doug Cutting and Mike Cafarella decided to give them what they wanted, and they called their project “Nutch.” Hadoop was originally designed as part of the Nutch infrastructure, and was presented in the year 2005. The […]
IoT vs. Serverless Computing
The Internet of Things (IoT) describes an interconnected network of physical and digital devices, sensors, mechanical components, and communication protocols with the ability to transfer and exchange data machine-to-machine or machine-to-interactions. The serverless computing model is cloud-based and all its resources are managed by the service provider. The client is charged based on the consumption […]
So You Want to be a Data Manager?
A data manager develops and governs data-oriented systems designed to meet the needs of an organization or research team. Data Management includes accessing, validating, and storing data that is needed for research and day-to-day business operations. Currently, a wide array of organizations are using big data to gain insights into customer behavior and to provide […]
Blockchain Offers Internet of Things Data Quality and Data Security
The rapid advances of blockchain technology and the Internet of Things are changing how business on the internet gets done. Blockchain provides superior data security and Data Quality and, as a consequence, is changing the way people approach big data. This can be quite useful, as security remains a primary concern for Internet of Things […]
A Brief History of Microservices
The history and origins of microservices are a continuing effort to provide better communication between different platforms, greater simplicity, and more user-friendly systems. Microservices are typically thought of as a software development technique which organizes an application as a group of loosely coupled services. It is, however, any kind of small service which interacts with […]
Case Study: Cox Automotive Solves Data Drift and ETL Challenges
According to Pat Patterson, Community Champion at StreamSets, “data drift” is such a problem now that “only about one fifth of a data analyst’s time is actually spent analyzing the data.” The remainder is spent “wrangling it into shape and getting it from where it is to the actual analysis platform.” Speaking at the Enterprise […]