Ideally, a machine learning engineer would have both the skills of a software engineer and the experience of a data scientist and data engineer. However, data scientists and software engineers usually come from very different backgrounds, and data scientists should not be expected to be great programmers, nor should software engineers be expected to provide […]
Four Predictions for Natural Language Processing in 2021
Click to learn more about author David Talby. 2020 has been a year of massive growth for applied natural language processing (NLP). Even in the wake of COVID-19 and stunted IT budgets, a recent study showed that NLP spending increased 10-30 percent across organization industries, company sizes, and geographies (Gradient Flow). NLP tools can be […]
What Are GPUs and Why Do Data Scientists Love Them?
Click to learn more about author Eva Murray. Move over, CPUs. The GPUs have arrived in modern enterprises, and data scientists are eager to use them for their modeling and deep learning applications. Why is this happening, and what are the advantages GPUs bring for Data Science applications? Read on and find out. What Are GPUs? GPUs, or graphics […]
Big Data Ecosystem Updates: Machine Learning, Deep Learning, and the Edge
One of the recent stories within the Big Data ecosystem is that Cisco is joining the AI Hardware frame with a new deep learning server powered by eight GPUs. Cisco is promising support within its AI push for Kubeflow, “which is an open source tool that makes TensorFlow compatible with the Kubernetes container orchestration engine,” […]
Fraud Detection Using a Neural Autoencoder
Click to learn more about author Rosaria Silipo. The co-authors of this column were Kathrin Melcher and Maarit Widmann. The Fraud Detection Problem Fraud detection belongs to the more general class of problems — the anomaly detection. Anomaly is a generic, not domain-specific, concept. It refers to any exceptional or unexpected event in the data, […]
Neural Machine Translation with Sequence to Sequence RNN
Click to learn more about author Rosaria Silipo. The co-authors of this column were Kathrin Melcher and Simon Schmid Automatic machine translation has been a popular subject for machine learning algorithms. After all, if machines can detect topics and understand texts, translation should be just the next step. Machine translation can be seen as a […]
GridGain Professional Edition 2.7 Introduces TensorFlow Integration, Enhanced Usability
A recent press release states, “GridGain Systems, provider of enterprise-grade in-memory computing solutions based on Apache® Ignite™, today announced the immediate availability of GridGain Professional Edition 2.7, a fully supported version of Apache Ignite 2.7. GridGain Professional Edition 2.7 introduces TensorFlow™ integration for enhanced training of deep learning (DL) models. GridGain Professional Edition 2.7 also […]
Out in the Open: Where Big Data and Open Source Coincide
Click to learn more about author Gilad David Maayan. Big Data is a term used to describe large volumes of data in disparate formats that streams into various organizational systems at high-speed. This data requires the use of special tools to analyze it and derive insights from it that can give businesses a competitive edge. […]
Bonsai Expands TensorFlow Support with Gears, Extending Functionality of AI Platform
by Angela Guess A new press release reports, “Today at the O’Reilly AI Conference, Bonsai, provider of an AI platform that empowers enterprises to build and deploy intelligent systems, released Gears, a top feature requested by customers in the Bonsai Early Access Program. Gears further extends the value of Bonsai to data scientists, providing them […]
CData Software Releases the ODBC Reader for TensorFlow
by Angela Guess A new press release reports, “CData Software (www.cdata.com), a leading provider of standards-based drivers and data access solutions for real-time data integration, today released the CData ODBC Reader for TensorFlow, a new open-source project that facilitates the integration of Google’s TensorFlow Machine Learning with real-time data access through ODBC. Extending TensorFlow with […]