Hadoop Examples

Hadoop Examples is a set of simple example scripts to illustrate Hadoop ecosystem tools like Hive and Pig.

Installation

EXAMPLES_DIR is an environment variable you can set to point to the directory where the hadoop-examples.jar is installed.

There is also a script: utils/setup_env.sh that can be sourced inside other shell scripts to try to find the hadoop-examples.jar. It is ugly, but sometimes convenient :-/

Release Notes

HBase Block Size

November 2016

HBase Block Size utility =hbase/hbase_blocks/hbase_blocks.rb= creates a table with a specified HBase block size. Writes data, flushes, then uses admin object to get the region name. Displays exact command =hbase hfile= to use to view the store file's index. Some okay/kinda cool JRuby stuff there.

Streaming Config Dumper

MapReduce scripts to print their ENV variables, which also include Hadoop configuration stuff for streaming jobs.

See =mr/streaming_config_dumper/=

Hive and Pig

12/20/2013

Incremental insert example in Hive Inserts non-duplicate data into a join table from incrementally updated source tables See hive/incremental_insert/
Added example of Pig's EXPLAIN command to show a diagram of the execution plan for SPLIT versus FILTER See pig/explain-split-vs-filter/
Added example of Hive's PARTITION feature See hive/partition-example/

Name		Name	Last commit message	Last commit date
Latest commit History 289 Commits
avro		avro
cloudera-director/api-examples		cloudera-director/api-examples
cloudera-manager/python_rest_api		cloudera-manager/python_rest_api
data-engineering		data-engineering
hbase		hbase
hdfs		hdfs
hive		hive
impala		impala
kafka-examples		kafka-examples
kite-sdk		kite-sdk
kudu		kudu
mr		mr
pig		pig
spark		spark
sql-diffs		sql-diffs
sqoop		sqoop
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hadoop Examples

Installation

Release Notes

HBase Block Size

Streaming Config Dumper

Hive and Pig

About

Releases

Packages

Contributors 2

Languages

NathanNeff/hadoop-examples

Folders and files

Latest commit

History

Repository files navigation

Hadoop Examples

Installation

Release Notes

HBase Block Size

Streaming Config Dumper

Hive and Pig

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages