| Class | Description |
|---|---|
| GenSort |
A single process data generator for the terasort data.
|
| TeraChecksum | |
| TeraGen |
Generate the official GraySort input data set.
|
| TeraGen.SortGenMapper |
The Mapper class that given a row number, will generate the appropriate
output line.
|
| TeraInputFormat |
An input format that reads the first 10 characters of each line as the key
and the rest of the line as the value.
|
| TeraOutputFormat |
An output format that writes the key and value appended together.
|
| TeraSort |
Generates the sampled split points, launches the job, and waits for it to
finish.
|
| TeraSort.SimplePartitioner |
A total order partitioner that assigns keys based on their first
PREFIX_LENGTH bytes, assuming a flat distribution.
|
| TeraValidate |
Generate 1 mapper per a file that checks to make sure the keys
are sorted within each file.
|
| Enum | Description |
|---|---|
| TeraGen.Counters |
Copyright © 2014 Apache Software Foundation. All rights reserved.