support vector machine based classifier for deviation from standard written English
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
gabe 4239109446 Readme actually has content sorta 12 years ago
bin initial commit 12 years ago
data initial commit 12 years ago
thirdparty initial commit 12 years ago
CC initial commit 12 years ago
GPL initial commit 12 years ago
INSTALL initial commit 12 years ago
LICENSE initial commit 12 years ago
Makefile initial commit 12 years ago
README.md Readme actually has content sorta 12 years ago
SVMUtil.cpp initial commit 12 years ago
SVMUtil.h initial commit 12 years ago
classify.sh initial commit 12 years ago
parameterresult.h initial commit 12 years ago
parametersearch.cpp initial commit 12 years ago
parametersearch.h initial commit 12 years ago
stupidfilter.cpp initial commit 12 years ago

README.md

stupidfilter

To run the StupidFilter directly just type bin/stupidfilter data/c_rbf

It will take data from standard in followed by a EOF and return a 0.000000 classification for stupid text and a 1.000000 for nonstupid text. Once we have regression working more accurately, this number will be actually be a floating point that describes how sure we are of the classification, so it's worth keeping it as a float in your implementations.

We have provided an example bash implementation in classify.sh. Note that we're normalizing whitespace with a call to sed. It's a good idea to strip HTML and normalize whitespace to avoid false positives.