Skip to main content

One doc tagged with "big data"

View all tags

Apache Hive Overview

Apache Hive is a data warehousing system built on top of Apache Hadoop for providing data query and analysis. It provides an SQL-like interface to query data stored in Hadoop's Distributed File System (HDFS) or other compatible storage systems. Hive translates SQL queries into MapReduce jobs or other execution frameworks like Apache Spark or Apache Tez, allowing users to interact with massive datasets using familiar SQL syntax.