You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
4239109446 | 12 years ago | |
---|---|---|
bin | 12 years ago | |
data | 12 years ago | |
thirdparty | 12 years ago | |
CC | 12 years ago | |
GPL | 12 years ago | |
INSTALL | 12 years ago | |
LICENSE | 12 years ago | |
Makefile | 12 years ago | |
README.md | 12 years ago | |
SVMUtil.cpp | 12 years ago | |
SVMUtil.h | 12 years ago | |
classify.sh | 12 years ago | |
parameterresult.h | 12 years ago | |
parametersearch.cpp | 12 years ago | |
parametersearch.h | 12 years ago | |
stupidfilter.cpp | 12 years ago |
README.md
stupidfilter
To run the StupidFilter directly just type bin/stupidfilter data/c_rbf
It will take data from standard in followed by a EOF and return a 0.000000 classification for stupid text and a 1.000000 for nonstupid text. Once we have regression working more accurately, this number will be actually be a floating point that describes how sure we are of the classification, so it's worth keeping it as a float in your implementations.
We have provided an example bash implementation in classify.sh. Note that we're normalizing whitespace with a call to sed. It's a good idea to strip HTML and normalize whitespace to avoid false positives.