|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |
Interface Summary | |
---|---|
SplitInput.SplitCallback | Used to pass information back to a caller once a file has been split without the need for a data object |
Class Summary | |
---|---|
Bump125 | Helps with making nice intervals at arbitrary scale. |
ConcatenateVectorsJob | |
ConcatenateVectorsReducer | |
MatrixDumper | Export a Matrix in various text formats: * CSV file Input format: Hadoop SequenceFile with Text key and MatrixWritable value, 1 pair TODO: Needs class for key value- should not hard-code to Text. |
SequenceFileDumper | |
SplitInput | A utility for splitting files in the input format used by the Bayes classifiers or anything else that has one item per line or SequenceFiles (key/value) into training and test sets in order to perform cross-validation. |
SplitInputJob | |
SplitInputJob.SplitInputComparator | Randomly permute key value pairs |
SplitInputJob.SplitInputMapper | Mapper which downsamples the input by downsamplingFactor |
SplitInputJob.SplitInputReducer | Reducer which uses MultipleOutputs to randomly allocate key value pairs between test and training outputs |
|
||||||||||
PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES |