next up previous contents
Next: Solutions to frequent problems Up: METAL The METAL Machine Previous: Running Experiments   Contents

Subsections


Structure and Organization of Output Data

The log file

For each experiment a log file with the name filename_seed.log is created. The log file contains the log of what run_exp has been doing. If run_exp is invoked several times for the same filestem and seed in the same output directory, each new log will be added to the end of any existing one, unless the option -o (overwrite) has been given to the run_exp command. The log will contain more information from the run_exp command if the -d (debug) option was given and will also include debuggin information from the interface scripts called if the option -lad was given.

The results file

The :results file contains a group of variables that describe the experiment and database, and another group of varibales that contain information for each combination of algorithm, fold, and repetition.

The .stats file

The stats file contains all the measures that get calculated from the .pred and .targets files by the run_stats program (the run_stats program gets called automatically at the end of run_exp unless explicitly supressed).

The variables in the .stats file for classification-type experiments:

The variables in the .stats file for regression-type experiments:

The .dct file

The DCT program and its output are documented in [DCT doc].

The targets files

For each fold of the crossvalidation, a file containing only the targets of the test file for this fold gets stored in the results directory. The name of this file is of the form <filestem>_<seed>_<fold>.targets.

These files are necessary for the run_stats program to calculate error estimates and similar measures.

The prediction files

For each combination of learning algorithm and fold of the crossvalidation, a file containing only the predictions of this learning algorithm for the test file gets stored in the results directory. The name of this file is of the form <filestem>_<seed>_<fold>_<alg>.pred.

These files are necessary for the run_stats program to calculate error estimates and similar measures.


next up previous contents
Next: Solutions to frequent problems Up: METAL The METAL Machine Previous: Running Experiments   Contents
2002-10-17