Spark SQL Archives - DATAVERSITY

Case Study: Deriving Spark Encoders and Schemas Using Implicits

Dávid SzakallasFebruary 13, 2020February 13, 2020

Click to learn more about author Dávid Szakallas. In recent years, the size and complexity of our Identity Graph, a data lake containing identity information about people and businesses around the world, begged the addition of Big Data technologies in the ingestion process. We used Apache Pig initially, and then migrated to Apache Spark a […]

How to Dive into Data Lakes and Not Drown

Jennifer ZainoDecember 7, 2016December 7, 2016

Kyvos Insights has a question for Enterprise Analytics executives: How are your Data Lakes working out for you? For many that embarked on journeys to stock up Hadoop based Data Lakes, with everything from structured transaction records to non-relational data such as log files, Internet clickstream records, sensor data, social streams, and so on, the answer has been […]