Hive

http://hive.apache.org/

[]

Hive Architecture

Hive Components: In Hive we have shell/ MetaShell/ Hive execution engine/ hive JDBC/ Compliers

Limitation of hive: Interactive data latency in data (can be overcomes by SPARK) Hive 14 has update, but the operation is very slow.

Abilities of Hive
 * 1) Performs filter operations using where clause
 * 2) equi joins are supported
 * 3) store results on local or HDFS drives
 * 4) Store results on another table
 * 5) alter tables, add colunes
 * 6) Partitions
 * 7) Expose data for BI

Schema: Schema on Write Schema on read

HQL- Hive Query Language

used to express MapR in Hive, HQL data Types

Partition -> Buckets -> Data Buckets -> Data

There are two type of Partitioning Static Dynamic

when we know the fixed no of partition are available, we go for Static partition Dynamic: when more no of partition are there we go for Dynamic partitions