Apache

Big Data

Dealing With Big Data – Computerphile

Big Data sounds may be a buzz word, and is hard to quantify, but the problems with large data sets are very real. Dr Isaac Triguero explains some of the challenges.

Read More
Big Data

Kafka + Spark Streaming + Hive Example

Davis Busteed walks us through building a proof of concept for Spark Streaming from a Kafka Source to Hive. Check out the README and resource files at https://github.com/dbusteed/kafka-spark-streaming-example 

Read More
LinkedIn Open Sources a Tool that Formats Big Data for TensorFlow
AI Big Data TensorFlow

LinkedIn Open Sources a Tool that Formats Big Data for TensorFlow

LinkedIn has just open sourced a tool it created to convert Apache Spark-based big data into a format that can be readily consumed by TensorFlow. The tool, Avro2TF “bridges the gap and presents an elegant solution for ML engineers, freeing them up to focus on different deep learning algorithms,” the team stated. The team noted […]

Read More