Hive

Big Data

Kafka + Spark Streaming + Hive Example

Davis Busteed walks us through building a proof of concept for Spark Streaming from a Kafka Source to Hive. Check out the README and resource files at https://github.com/dbusteed/kafka-spark-streaming-example 

Read More
Spark vs. Tez: What's the Difference?
Big Data Spark

Spark vs. Tez: What’s the Difference?

At work recently, a question came up about whether Spark or Tez is better. Here’s an interesting article with some interesting perspectives. On paper, Spark and Tez have a lot in common: both possess in-memory capabilities, can run on top of Hadoop YARN and support all data types from any data sources. So, what’s the […]

Read More
Big Data

What’s in Hive 3.0?

What is new in Apache Hive 3.0? from DataWorks Summit

Read More
Azure Big Data

Fast Interactive Queries with Hive on LLAP

In this video, Murali Krishnaprasad discusses Interactive Query (also called Hive LLAP, or Low Latency Analytical Processing, or Live Long and Process), which is an Azure HDInsight cluster type. Interactive Query supports in-memory caching, which makes Hive queries super-fast and interactive. See how to use HDInsight Interactive Query to analyze extremely large datasets (~100TB) in common […]

Read More
Big Data

What is Hive?

Rafael Coss, manager Big Data Enablement for IBM, explains Hive defined in 3 minutes.

Read More