Data Architecture and Artificial Intelligence: How Do They Work Together?

By on

data architectureArtificial Intelligence (AI) is rapidly gaining ground as core business competency. Today’s Machine Learning (ML) or Deep Learning (DL) algorithms promise to revolutionize business models and processes, restructure workforces, and transform data infrastructures to enhance process efficiency and improve decision-making throughout the enterprise. Gone are the days of data silos and manual algorithms.

However, Why AI Would Be Nothing Without Big Data reconfirms this widespread belief by stating that AI’s growth was stunted in the past mainly due to the unavailability of large data sets. Big Data changed all that – enabling businesses to take advantage of high-volume and high-velocity data to train AI algorithms for business-process improvements and enhanced decision making.

The Road to AI Leads through Information Architecture describes how hybrid Data Management, Data Governance, and Business Analytics can together transform enterprise-wide decision making. According to this author, these three core business practices can enable organizations of all sizes “to unleash the power of AI in the enterprise.”

The Role of Data Architecture in Unleashing the Power of Artificial Intelligence

William McKnight, the president McKnight Consulting Group, has said that that “Information Architecture” plays a key role in establishing order in the continuous evolution of emerging data technologies. McKnight discusses specific measures that organizations should take to embrace AI and streaming data technologies, and the long-range impact of General Data Protection Regulation (GDPR) on enterprise Data Management practices. He recognizes that while streaming data is the only way to deal with the high velocity of Big Data, strong Data Governance measures will ensure GDPR compliance.

Recently, the umbrella field of AI has gained traction because of the innovative IT solutions enabled by Machine Learning or Deep Learning technologies. The terms “intelligent” or “smart” associated with any IT system specifically point toward the ML or Dl capabilities of such systems.

Adapting Your Data Strategy for the Rise of Artificial Intelligence in Business suggests that well-managed Data Architectures and AI technologies are poised to drive future innovations in IT, which will bring in better opportunities for businesses through technological disruptions. However, these trends also indicate that the businesses will need highly capable Data Science field experts, groomed in AI, predictive modeling, ML, and DL, among other skills, to drive this transformative tech leadership.

A DATAVERSITY® webinar points out that all core Data Management technologies like Artificial Intelligence, Machine Learning, or Big Data Require a sound Data Architecture with Data Storage and Data Governance best practices in place. This webinar discusses how the latest Data Architecture Trends support organizational goals. Tomorrow’s data technology expert will be responsible for implementing and sustaining a Data Strategy and will be expected to handle the risks and the newer profit opportunities with equal finesse.

But what kind of data infrastructure will allow that to happen? A well-defined and structured Data Architecture that accommodates Big Data, IoT, and AI while complying with all the applicable GDPR regulations. Three Impressions From Data Architecture Summit 2017 lists Data Governance as the key theme in a global environment of expanding data sources and all-engulfing AI technologies.

Cloud: The Present and Future Savior of Enterprise Analytics

As businesses increasingly begin to rely on data and Analytics for competing, Data Architectures are beginning to assume larger roles in the enterprise. In the era of Digital Businesses, the new norm for Data Architectures is a dynamic and scalable model that is, to some extent, met by Public Cloud. The latest Analytics requirement is to process data at the source, thus allowing AI-based Analytics across the Data Center network to the edge of the enterprise, as discussed in How to Create Cloud-Based Data Architectures.

Gartner provides the direct benefits of Cloud infrastructure in the management and delivery of data-driven, actionable intelligence. The Analytics Everywhere trend, which is gaining momentum, will drive the change from on-premise or hosted Analytics to the Edge Computing era, where Business Analytics will happen in real time, and much closer to the source of data.

In the IoT Age, businesses cannot afford to lose valuable time and money in collecting and depositing the incoming data to a far-away location. Analytics will happen at the edge of businesses, which signals the next phase of Cloud Computing. The Cloud-First strategy is already here with more and more organizations adopting the Cloud. So, what’s next for Analytics? Edge Computing? Serverless Computing?

If Data Architectures are robust enough, Analytics will have the potential to go “viral,” both within and outside the organization. In that scenario, even Citizen Data Scientists will be able to conduct self-service Analytics at the point of data ingestion.

Human-Centric AI System Designs: A Panacea?

Andrew Ng recommends AI be adopted as an enterprise-wide decision-making strategy. As Artificial Intelligence technologies enable accurate forecasting techniques, enhanced process management through automation, and higher performance metrics for the whole organization, businesses that choose to ignore AI will be left behind. Machine Learning, Deep Learning, human-machine interactions, and autonomous systems can jointly deliver results unmatched by any other business system.

Sanjit Dang of Intel Capital said at a recent Global AI Conference that: “AI is eating the world and will eventually eat every vertical.”

Artificial Intelligence for Data-Driven Disruption discusses the power of an “AI-powered engine” to deliver real-time insights for managerial decision-making. With the ever-rising volume, variety, and velocity of business data, every business user from the Citizen Data Scientist to the seasoned Data Stewards will need quick and timely access to data.

Living in the smart-systems era, the humans cannot overlook the fact that even AI algorithms can fail to deliver results if not implemented or adapted properly in the human work environments. The AI algorithms used today are similar to the ones used many years ago, but the computers or processors have become faster and more powerful.

While it is widely acknowledged that advanced Artificial Intelligence can automate many rote human tasks and can even “think” in limited cases, AI systems have not really passed “disaster situations” as in the case of self-driving cars or natural-calamity predictions. Thus, while AI algorithms can be extensively trained with the use of data to emulate human thinking to an extent, AI researchers have still not been able to establish the human-cognitive abilities of a robot or a smart machine.

The most fundamental difference is that the human brain can respond to original situations while the machine brain can only adopt second-hand situations transmitted through human-experience data, as explained in Smarter Together: Why Artificial Intelligence Needs Human-Centered Design.

Future algorithms can be trained to emulate human-cognitive capabilities. But while humans can err due to overconfidence, machine intelligence strictly relies on a study and application of data-driven facts. Even a bad algorithm can improve human thinking, thus according to “Kasparov’s law,” the process has to be improved to enable the best possible human-machine collaboration.

The Artificial Intelligence algorithms of the future should be designed from a human point of view, to reflect the actual business environment and information goals of the decision-maker. The AI Software Engineer is the person in a Data Science team who plays the critical role of bridging the gap between Data Scientists and Data Architects.

Architectural Requirements of Machine Learning-Driven Artificial Intelligence Systems

In Machine Learning, data is both the teacher and the trainer that shapes the algorithm in a specific way without any programming. Thus, data preparation for ML pipelines can be challenging if the Data Architectures have not been refined enough to interoperate with the underlying analytic platforms.

Machine Learning is best-suited for high-volume and high-velocity data, as explained in Preparing and Architecting for Machine Learning. According the article, the Data Architecture layer in an end-to-end Analytics sub system must support the Data Preparation requirements for Machine Learning algorithms to work. A dedicated development life cycle supporting ML learning models has to be available, and the ML platform must support several ML frameworks for custom solutions from commercial vendors.  The Public Cloud is a great storage and compute environment for ML systems simply because of its architectural elasticity.

An organization can only take advantage of this huge mass of data from many different sources if a sound Data Architecture (data as an enterprise layer) is in place across the organization and if end-to-end AI-powered Analytics systems have been deployed to empower all types of business users to engage in just-in-time analytics and BI activities. Also, take a look at the Predictive Data Pipelines and Architectures Track on Q.Con’s AI Conference held in April 2018. This conference showcased real-world AI systems used for predictions, recommenders, fraud prevention or ranking systems.

The Future

In the coming years, as information derived from “data” becomes a corporate asset with high revenue potentials, organizations will become more disciplined about monetizing and measuring the impact of data like the other KPIs.

Gartner states that by 2021, Data Centers will have to integrate AI capabilities in their architectures. Make Room for AI Applications in the Data Center Architecture predicts that AI applications will penetrate every vertical in the near future, so it makes sense to adopt Artificial Intelligence, Machine Learning, and Deep Learning practices in the Data Centers. As these technologies will challenge existing data storage technologies, newer and better platforms like the Edge or the Serverless may be the answer.

 

Photo Credit: TechnoVectors/Shutterstock.com

We use technologies such as cookies to understand how you use our site and to provide a better user experience. This includes personalizing content, using analytics and improving site operations. We may share your information about your use of our site with third parties in accordance with our Privacy Policy. You can change your cookie settings as described here at any time, but parts of our site may not function correctly without them. By continuing to use our site, you agree that we can save cookies on your device, unless you have disabled cookies.
I Accept