Trees | Indices | Help |
|
---|
|
A classifier that assumes that P is linear in position. This is known as a (linear) logistic discriminant analysis:
This is a script that can be run from the Linux command line. Usage: l_classifier [flags] < input_data >log_file This script also produces two files: classes.chunk and classified.fiat.
Flags:
-test
Run some tests.
-D
Print extra debug information. Repeated -D flags
increase verbosity.
-quiet
Print less.
-c 0.NN
Ignore the specified fraction (0 <=
0.NN < 1
) of the worst classifications. See q_classifier_r.evaluate_Bayes
for details. When
building the classifiers, if 0.NN > 0
, this
essentially says that "nothing is extremely improbable, because
there's a 0.NN
chance that it is just a mistake."
This makes the classifier boundaries less sensitive to points on the
outskirts of regions.
-ftest 0.NN
Use a fraction 0 <
0.NN < 1
of the data for the test set; the remainder is
used for training the classifiers.
-coverage N.N
This script generates a group of
classifiers for a particular test-set/training-set pair, but then it
samples a new test set and repeats until an average datum appears in
a test set N.N
times.
-nperdim N.N
This controls how many classifiers are
generated per test-set/training-set pair. The number is
N.N
times the number of dimensions in the feature
vector.
-i filename
Take input from the specified file instead
of the standard input.
The input data is a multicolumn ASCII file with one line per measurement to be classified. Columns are separated by whitespace and are:
The standard output contains miscellaneous progress information and
lines (that are prefixed with "WRONG") that list incorrect
classifications. However, comprehensive classification information can
be found in classes.chunk
. This provides a list of all the
classifiers that were generated, and contains enough information to
reconstruct the classifiers so that they could be applied to another set
of data. (classes.chunk
is in a format readable by
chunkio.py
.) It is recommended that it be read in by
read_classified.read_classes_header
(if you
just want the top few lines of header information), or read_classified.read_classes
for the full
description.
The header contains information on classifier performance. It contains attribute/value pairs as follows:
Chance
.
Perfection
across all classifiers (normally 0).
Variables | |
__package__ = None
|
Imports: die, Q, QC
Trees | Indices | Help |
|
---|
Generated by Epydoc 3.0.1 on Wed Dec 8 11:00:41 2010 | http://epydoc.sourceforge.net |