org.apache.mahout.classifier.df.mapreduce.inmem
Class InMemBuilder
java.lang.Object
org.apache.mahout.classifier.df.mapreduce.Builder
org.apache.mahout.classifier.df.mapreduce.inmem.InMemBuilder
public class InMemBuilder
- extends Builder
MapReduce implementation where each mapper loads a full copy of the data in-memory. The forest trees are
splitted across all the mappers
Constructor Summary |
InMemBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath)
|
InMemBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath,
Long seed,
org.apache.hadoop.conf.Configuration conf)
|
Method Summary |
protected void |
configureJob(org.apache.hadoop.mapreduce.Job job)
Used by the inheriting classes to configure the job |
protected DecisionForest |
parseOutput(org.apache.hadoop.mapreduce.Job job)
Parse the output files to extract the trees and pass the predictions to the callback |
Methods inherited from class org.apache.mahout.classifier.df.mapreduce.Builder |
build, getDataPath, getDistributedCacheFile, getNbTrees, getNumMaps, getOutputPath, getRandomSeed, getTreeBuilder, isOutput, loadDataset, runJob, setNbTrees, setOutputDirName, sortSplits |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
InMemBuilder
public InMemBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath,
Long seed,
org.apache.hadoop.conf.Configuration conf)
InMemBuilder
public InMemBuilder(TreeBuilder treeBuilder,
org.apache.hadoop.fs.Path dataPath,
org.apache.hadoop.fs.Path datasetPath)
configureJob
protected void configureJob(org.apache.hadoop.mapreduce.Job job)
throws IOException
- Description copied from class:
Builder
- Used by the inheriting classes to configure the job
- Specified by:
configureJob
in class Builder
- Parameters:
job
- Hadoop's Job
- Throws:
IOException
- if anything goes wrong while configuring the job
parseOutput
protected DecisionForest parseOutput(org.apache.hadoop.mapreduce.Job job)
throws IOException
- Description copied from class:
Builder
- Parse the output files to extract the trees and pass the predictions to the callback
- Specified by:
parseOutput
in class Builder
- Parameters:
job
- Hadoop's job
- Returns:
- Built DecisionForest
- Throws:
IOException
- if anything goes wrong while parsing the output
Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.