org.apache.hadoop.examples.terasort
Class TeraGen

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.hadoop.examples.terasort.TeraGen
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class TeraGen
extends org.apache.hadoop.conf.Configured
implements org.apache.hadoop.util.Tool

Generate the official GraySort input data set. The user specifies the number of rows and the output directory and this class runs a map/reduce program to generate the data. The format of the data is:

To run the program: bin/hadoop jar hadoop-*-examples.jar teragen 10000000000 in-dir


Nested Class Summary
static class TeraGen.Counters
           
static class TeraGen.SortGenMapper
          The Mapper class that given a row number, will generate the appropriate output line.
 
Field Summary
static String NUM_ROWS
           
 
Constructor Summary
TeraGen()
           
 
Method Summary
static void main(String[] args)
           
 int run(String[] args)
           
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Field Detail

NUM_ROWS

public static String NUM_ROWS
Constructor Detail

TeraGen

public TeraGen()
Method Detail

run

public int run(String[] args)
        throws IOException,
               InterruptedException,
               ClassNotFoundException
Specified by:
run in interface org.apache.hadoop.util.Tool
Parameters:
args - the cli arguments
Throws:
IOException
InterruptedException
ClassNotFoundException

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception


Copyright © 2009 The Apache Software Foundation