Marcin Michalski
db82afa263
Update to phpunit 8 and bump min php to 7.2 ( #367 )
...
* Update to phpunit 8
* Require at least PHP 7.2
2019-04-10 20:42:59 +02:00
KenorFR
cefb4fc7a7
Ngram word ( #370 )
...
* Add NGramWordTokenizer
* Update doc
Add test
Check coding standards
2019-04-05 21:23:09 +02:00
Arkadiusz Kondas
dbbce0e066
Implement LabelEncoder ( #369 )
2019-04-02 11:07:00 +02:00
Marcin Michalski
d3888efa7a
Update phpstan & easy coding standard ( #366 )
2019-03-25 14:55:14 +01:00
Arkadiusz Kondas
5e02b893e9
Fix FilesDataset arrays and TokenCountVectorizer numeric token ( #363 )
2019-03-20 23:22:45 +01:00
Pol Dellaiera
02dab41830
Provide a new NGramTokenizer with minGram and maxGram support ( #350 )
...
* Issue #349 : Provide a new NGramTokenizer.
* Issue #349 : Add tests.
* Fixes from code review.
* Implement NGramTokenizer with min and max gram support
* Add missing tests for ngram
* Add info about NGramTokenizer to docs and readme
* Add performance test for tokenization
2019-02-15 17:31:10 +01:00
Arkadiusz Kondas
d30c212f3b
Check if feature exist when predict target in NaiveBayes ( #327 )
...
* Check if feature exist when predict target in NaiveBayes
* Fix typo
2018-11-07 09:39:51 +01:00
Arkadiusz Kondas
18c36b971f
Mnist Dataset ( #326 )
...
* Implement MnistDataset
* Add MNIST dataset documentation
2018-11-07 08:02:56 +01:00
Marcin Michalski
53c5a6b9e5
Update phpstan to 0.10.5 ( #320 )
2018-10-28 07:44:52 +01:00
Marcin Michalski
0beb407b16
Update easy coding standard to ^5.1 ( #317 )
2018-10-16 21:42:06 +02:00
Marcin Michalski
d29c5906df
Return labels in MultilayerPerceptron output ( #315 )
2018-10-15 19:47:42 +02:00
Arkadiusz Kondas
e255369636
Fix Imputer docs and check if train data was set ( #314 )
...
* Update docs for Imputer class
* Throw exception when trying to transform imputer without train data
* Update changelog
2018-10-10 21:36:18 +02:00
Arkadiusz Kondas
8fdb3d11fc
Make SVM non-locale aware ( #288 )
2018-07-04 23:42:22 +02:00
Tomáš Votruba
46fa2c2cca
Update to EasyCodingStandard 4 ( #273 )
...
* update ECS config to v4
* composer: require Symplify 4
* apply coding-standard: use constants over functions, protected setUp() in tests, array indentation
* ecs: add false positive case
* composer: update lock
* bump to ECS 4.4
* update composer.lock
* shorten ECS config name
* ecs: ignore assignments in while()
* fix cs
2018-06-15 07:57:45 +02:00
Arkadiusz Kondas
a36fe086d3
Add performance test for LeastSquares ( #263 )
...
* Install phpbench 🚀
* Add first benchmark for LeastSquares
* Update README and CONTRIBUTING guide
* Fix typo
2018-03-10 21:48:16 +01:00
Ivana Momcilovic
af2d732194
KMeans associative clustering ( #262 )
...
* KMeans associative clustering added
* fix travis error
* KMeans will return provided keys as point label if they are provided
* fix travis
* fix travis
2018-03-08 22:27:16 +01:00
Marcin Michalski
e156076539
Add DecisionTreeLeaf.getNodeImpurityDecrease test ( #261 )
2018-03-07 23:16:25 +01:00
Yuji Uchiyama
66ca874062
Throw proper exception ( #259 )
...
* Throw proper exception
* Fix coding style
2018-03-06 23:26:36 +01:00
Marcin Michalski
a40c50b48b
Add Optimizer tests and remove initialTheta ( #252 )
...
* Add Optimizer tests
* Remove Optimizer.initialTheta and rename Optimizer.setInitialTheta to setTheta
2018-03-04 22:44:22 +01:00
Marcin Michalski
55749c7c92
Add Cluster tests ( #254 )
2018-03-04 17:06:46 +01:00
Marcin Michalski
33efab20a5
Add LUDecomposition triangular factor tests ( #253 )
2018-03-04 17:05:25 +01:00
Marcin Michalski
941d240ab6
Add RandomForest exception tests ( #251 )
2018-03-04 17:02:36 +01:00
Arkadiusz Kondas
8976047cbc
Add removeColumns function to ArrayDataset ( #249 )
...
* Add removeColumns function to ArrayDataset
* Add removeColumns to docs
* Fix cs
2018-03-03 16:04:21 +01:00
Yuji Uchiyama
af9ccfe722
Add tests for LogisticRegression ( #248 )
2018-03-03 11:19:58 +01:00
Arkadiusz Kondas
d188790276
Add MLP partial train test after restore from file ( #243 )
2018-02-26 00:02:04 +01:00
Marcin Michalski
9e375ca544
Ensure DataTransformer::testSet samples array is not empty ( #204 )
2018-02-25 22:56:36 +01:00
Yuji Uchiyama
4562f1dfc9
Add a SvmDataset class for SVM-Light (or LibSVM) format files ( #237 )
...
* Add data loader for svm format
* Add tests for error cases
* Set proper exception messages
* Add documents
* Add error checking code for invalid column format
* Add missing documents
2018-02-24 11:17:35 +01:00
Arkadiusz Kondas
a96f03e8dd
Fix Optimizer initial theta randomization ( #239 )
...
* Fix Optimizer initial theta randomization
* Add more tests for LUDecomposition and FuzzyCMeans
2018-02-23 23:05:46 +01:00
Yuji Uchiyama
add00c6108
Fix apriori keys ( #238 )
...
* Add test to check keys of rules
* Reindex after array_filter/array_unique in Apriori
2018-02-22 17:02:55 +01:00
Arkadiusz Kondas
0a15561352
Fix KMeans and EigenvalueDecomposition ( #235 )
...
* Fix kmeans cluster and eigenvalue decomposition
* Fix kmeans space
* Fix code style
2018-02-18 00:09:24 +01:00
Arkadiusz Kondas
16dc16b0d9
Add phpstan strict rules ( #233 )
...
* Add phpstan strict rules
* Fix travis coveralls
* Add phpstan-phpunit strict rules
* Fix eigen decomposition test name and phpstan ingored error
2018-02-16 07:25:24 +01:00
Marcin Michalski
6ac61a860c
Fix 'toSmall' typo ( #234 )
2018-02-15 18:14:06 +01:00
Arkadiusz Kondas
451f84c2e6
Add SelectKBest docs
2018-02-14 20:34:53 +01:00
Arkadiusz Kondas
b4b190de7f
Fix pipeline transformers
2018-02-14 20:34:53 +01:00
Arkadiusz Kondas
998879b6fc
Switch SelectKBest constructor parameters
2018-02-14 20:34:53 +01:00
Arkadiusz Kondas
9e5b3a0c69
Implement first regression scoring function UnivariateLinearRegression
2018-02-14 20:34:53 +01:00
Arkadiusz Kondas
fbf84ca95f
Implement SelectKBest algo for feature selection
2018-02-14 20:34:53 +01:00
Arkadiusz Kondas
52c9ba8291
Fix: phpunit include tests path ( #230 )
...
* Fix phpunit include path
* Add tests for Covariance
2018-02-11 18:17:50 +01:00
Yuji Uchiyama
53f8a89556
Fix support of a rule in Apriori ( #229 )
...
* Clean up test code
* Add test to check support and confidence (failed due to a bug)
* Fix support value of rules
2018-02-11 12:42:46 +01:00
Arkadiusz Kondas
3ba35918a3
Implement VarianceThreshold - simple baseline approach to feature selection. ( #228 )
...
* Add sum of squares deviations
* Calculate population variance
* Add VarianceThreshold - feature selection transformer
* Add docs about VarianceThreshold
* Add missing code for pipeline usage
2018-02-10 18:07:09 +01:00
Andreas Möller
4b5d57fd6f
Enhancement: Flatten directory structure ( #220 )
2018-02-10 12:08:58 +01:00
Yuji Uchiyama
71cc633c8e
Fix apriori generates an empty array as a part of the frequent item sets ( #224 )
2018-02-07 10:02:38 +01:00
Yuji Uchiyama
ec091b5ea3
Support probability estimation in SVC ( #218 )
...
* Add test for svm model with probability estimation
* Extract buildPredictCommand method
* Fix test to use PHP_EOL
* Add predictProbability method (not completed)
* Add test for DataTransformer::predictions
* Fix SVM to use PHP_EOL
* Support probability estimation in SVM
* Add documentation
* Add InvalidOperationException class
* Throw InvalidOperationException before executing libsvm if probability estimation is not supported
2018-02-06 20:39:25 +01:00
Jonathan Baldie
c32bf3fe2b
Configure an Activation Function per hidden layer ( #208 )
...
* ability to specify per-layer activation function
* some tests for new addition to layer
* appease style CI whitespace issue
* more flexible addition of layers, and developer can pass Layer object in manually
* new test for layer object in mlp constructor
* documentation for added MLP functionality
2018-02-01 23:15:36 +01:00
Jonathan Baldie
e318921076
Fix string representation of integer labels issue in NaiveBayes ( #206 )
...
* Update NaiveBayes.php
This fixes an issue using string labels that are string representations of integers, e.g. "1998" getting cast to (int)1998.
* Update NaiveBayes.php
fixes superfluous whitespace error
* added tests for naive bayes with numeric labels
* added array_unique
* nested array_flips for speed
* nested the array flips inside the array map
* to appear style CI test
2018-01-31 21:44:44 +01:00
Yuji Uchiyama
554c86af68
Choose averaging method in classification report ( #205 )
...
* Fix testcases of ClassificationReport
* Fix averaging method in ClassificationReport
* Fix divided by zero if labels are empty
* Fix calculation of f1score
* Add averaging methods (not completed)
* Implement weighted average method
* Extract counts to properties
* Fix default to macro average
* Implement micro average method
* Fix style
* Update docs
* Fix styles
2018-01-29 18:06:21 +01:00
Marcin Michalski
ba7114a3f7
Add libsvm exception tests ( #202 )
2018-01-26 22:07:22 +01:00
Arkadiusz Kondas
7435bece34
Add test for Pipeline save and restore with ModelManager ( #191 )
2018-01-12 10:54:20 +01:00
Yuji Uchiyama
d953ef6bfc
Fix the implementation of conjugate gradient method ( #184 )
...
* Add unit tests for optimizers
* Fix ConjugateGradient
* Fix coding style
* Fix namespace
2018-01-12 10:53:43 +01:00
David Monllaó
e83f7b95d5
Fix activation functions support ( #163 )
...
- Backpropagation using the neuron activation functions derivative
- instead of hardcoded sigmoid derivative
- Added missing activation functions derivatives
- Sigmoid forced for the output layer
- Updated ThresholdedReLU default threshold to 0 (acts as a ReLU)
- Unit tests for derivatives
- Unit tests for classifiers using different activation functions
- Added missing docs
2018-01-09 11:09:59 +01:00