Commit Graph

5 Commits

Author SHA1 Message Date
KenorFR
cefb4fc7a7 Ngram word (#370)
* Add NGramWordTokenizer

* Update doc
Add test
Check coding standards
2019-04-05 21:23:09 +02:00
Pol Dellaiera
02dab41830 Provide a new NGramTokenizer with minGram and maxGram support (#350)
* Issue #349: Provide a new NGramTokenizer.

* Issue #349: Add tests.

* Fixes from code review.

* Implement NGramTokenizer with min and max gram support

* Add missing tests for ngram

* Add info about NGramTokenizer to docs and readme

* Add performance test for tokenization
2019-02-15 17:31:10 +01:00
David Monllaó
f7537c049a documentation add tokenizer->fit required to build the dictionary (#155) 2017-11-16 21:40:11 +01:00
Arkadiusz Kondas
7c0767c15a create docs for tf-idf transformer 2016-07-12 00:21:34 +02:00
Arkadiusz Kondas
365a9baeca update docs 2016-05-07 23:53:42 +02:00