| Trees | Indices | Help |
|
|---|
|
|
A classifier that assumes that P is linear in position. This is known as a (linear) logistic discriminant analysis:
This is a script that can be run from the Linux command line. Usage: l_classifier [flags] < input_data >log_file This script also produces two files: classes.chunk and classified.fiat.
Flags:
-test Run some tests.
-D Print extra debug information. Repeated -D flags
increase verbosity.
-quiet Print less.
-c 0.NN Ignore the specified fraction (0 <=
0.NN < 1) of the worst classifications. See q_classifier_r.evaluate_Bayes for details. When
building the classifiers, if 0.NN > 0, this
essentially says that "nothing is extremely improbable, because
there's a 0.NN chance that it is just a mistake."
This makes the classifier boundaries less sensitive to points on the
outskirts of regions.
-ftest 0.NN Use a fraction 0 <
0.NN < 1 of the data for the test set; the remainder is
used for training the classifiers.
-coverage N.N This script generates a group of
classifiers for a particular test-set/training-set pair, but then it
samples a new test set and repeats until an average datum appears in
a test set N.N times.
-nperdim N.N This controls how many classifiers are
generated per test-set/training-set pair. The number is
N.N times the number of dimensions in the feature
vector.
-i filename Take input from the specified file instead
of the standard input.
The input data is a multicolumn ASCII file with one line per measurement to be classified. Columns are separated by whitespace and are:
The standard output contains miscellaneous progress information and
lines (that are prefixed with "WRONG") that list incorrect
classifications. However, comprehensive classification information can
be found in classes.chunk. This provides a list of all the
classifiers that were generated, and contains enough information to
reconstruct the classifiers so that they could be applied to another set
of data. (classes.chunk is in a format readable by
chunkio.py.) It is recommended that it be read in by
read_classified.read_classes_header (if you
just want the top few lines of header information), or read_classified.read_classes for the full
description.
The header contains information on classifier performance. It contains attribute/value pairs as follows:
Chance.
Perfection across all classifiers (normally 0).
| Variables | |
__package__ = None
|
|
Imports: die, Q, QC
| Trees | Indices | Help |
|
|---|
| Generated by Epydoc 3.0.1 on Wed Dec 8 11:00:41 2010 | http://epydoc.sourceforge.net |