org.apache.hadoop.examples.terasort
Class TeraGen
java.lang.Object
org.apache.hadoop.conf.Configured
org.apache.hadoop.examples.terasort.TeraGen
- All Implemented Interfaces:
- org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool
public class TeraGen
- extends org.apache.hadoop.conf.Configured
- implements org.apache.hadoop.util.Tool
Generate the official GraySort input data set.
The user specifies the number of rows and the output directory and this
class runs a map/reduce program to generate the data.
The format of the data is:
- (10 bytes key) (constant 2 bytes) (32 bytes rowid)
(constant 4 bytes) (48 bytes filler) (constant 4 bytes)
- The rowid is the right justified row id as a hex number.
To run the program:
bin/hadoop jar hadoop-*-examples.jar teragen 10000000000 in-dir
Methods inherited from class org.apache.hadoop.conf.Configured |
getConf, setConf |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface org.apache.hadoop.conf.Configurable |
getConf, setConf |
NUM_ROWS
public static String NUM_ROWS
TeraGen
public TeraGen()
run
public int run(String[] args)
throws IOException,
InterruptedException,
ClassNotFoundException
- Specified by:
run
in interface org.apache.hadoop.util.Tool
- Parameters:
args
- the cli arguments
- Throws:
IOException
InterruptedException
ClassNotFoundException
main
public static void main(String[] args)
throws Exception
- Throws:
Exception
Copyright © 2009 The Apache Software Foundation