| Class | Description |
|---|---|
| BroadcastAndOneToOneExample | |
| BroadcastAndOneToOneExample.InputProcessor | |
| BroadcastAndOneToOneExample.OneToOneProcessor | |
| ExampleDriver |
A description of an example program based on its class and a
human-readable description.
|
| FilterLinesByWord | |
| FilterLinesByWord.TextLongPair | |
| FilterLinesByWordOneToOne | |
| GroupByOrderByMRRTest |
Simple example that does a GROUP BY ORDER BY in an MRR job
Consider a query such as
Select DeptName, COUNT(*) as cnt FROM EmployeeTable
GROUP BY DeptName ORDER BY cnt;
i.e.
|
| GroupByOrderByMRRTest.MyGroupByReducer |
Intermediate reducer aggregates the total count per department.
|
| GroupByOrderByMRRTest.MyMapper |
Mapper takes in a single line as input containing
employee name and department name and then
emits department name with count of 1
|
| GroupByOrderByMRRTest.MyOrderByNoOpReducer |
Shuffle ensures ordering based on count of employees per department
hence the final reducer is a no-op and just emits the department name
with the employee count per department.
|
| Join |
Given a set of sorted datasets keyed with the same class and yielding
equal partitions, it is possible to effect a join of those datasets
prior to the map.
|
| MapredWordCount |
This is an example Hadoop Map/Reduce application using the mapred apis.
|
| MapredWordCount.MapClass |
Counts the words in each line.
|
| MapredWordCount.Reduce |
A reducer class that just emits the sum of the input values.
|
| MRRSleepJob |
Dummy class for testing MR framefork.
|
| MRRSleepJob.EmptySplit | |
| MRRSleepJob.ISleepReducer | |
| MRRSleepJob.MRRSleepJobPartitioner | |
| MRRSleepJob.SleepInputFormat | |
| MRRSleepJob.SleepMapper | |
| MRRSleepJob.SleepReducer | |
| RandomTextWriter |
This program uses map/reduce to just run a distributed job where there is
no interaction between the tasks and each task writes a large unsorted
random sequence of words.
|
| RandomWriter |
This program uses map/reduce to just run a distributed job where there is
no interaction between the tasks and each task write a large unsorted
random binary sequence file of BytesWritable.
|
| RPCLoadGen | |
| RPCLoadGen.RPCSleepProcessor | |
| SecondarySort |
This is an example Hadoop Map/Reduce application.
|
| SecondarySort.FirstGroupingComparator |
Compare only the first part of the pair, so that reduce is called once
for each value of the first part.
|
| SecondarySort.FirstPartitioner |
Partition based on the first part of the pair.
|
| SecondarySort.IntPair |
Define a pair of integers that are writable.
|
| SecondarySort.IntPair.Comparator |
A Comparator that compares serialized IntPair.
|
| SecondarySort.MapClass |
Read two integers from each line and generate a key, value pair
as ((left, right), right).
|
| SecondarySort.Reduce |
A reducer class that just emits the sum of the input values.
|
| Sort<K,V> |
This is the trivial map/reduce program that does absolutely nothing
other than use the framework to fragment and sort the input values.
|
| TestOrderedWordCount |
An MRR job built on top of word count to return words sorted by
their frequency of occurrence.
|
| TestOrderedWordCount.IntSumReducer | |
| TestOrderedWordCount.MyOrderByNoOpReducer |
Shuffle ensures ordering based on count of employees per department
hence the final reducer is a no-op and just emits the department name
with the employee count per department.
|
| TestOrderedWordCount.TokenizerMapper | |
| TezExampleBase | |
| UnionExample | |
| UnionExample.TokenProcessor | |
| UnionExample.UnionProcessor |
Copyright © 2014 Apache Software Foundation. All rights reserved.