Step | Parameter | Values |
---|---|---|
sklearn.feature_extraction.text.HashingVectorizer | n_features | 1024, 8192, 65,536 |
ngram_range | [4,4], [8, 8], [16, 16], [4,16] | |
sklearn.feature_extraction.text.TfidfTransformer | norm | l1, l2, None |
usd_idf | True, False | |
sklearn.naive_bayes.MultinomialNB | alpha | 0.001, 0.01, 0.1 |
class_prior | None, array of class weights | |
post processing | confidence | 0, 0.2, 0.4, 0.6, 0.8 |