Commit Graph

280 Commits

Author SHA1 Message Date
Meirza c91a5b730e Fixed ${var} string interpolation deprecations in PHP 8.1 2023-01-26 06:06:46 +07:00
Mirko Haaser 71bed5008c Use stopword list from https://github.com/stopwords-iso/stopwords-de 2022-10-19 09:37:13 +02:00
Mirko Haaser c4d79f7fbd Add German Stopwords 2022-10-19 09:34:20 +02:00
Tima df72b129d2
Added Russian stopwords (#425)
* Added Russian stopwords

* Added Russian stopwords
2020-03-04 09:01:54 +01:00
Marcin Michalski 2ee0d373eb
Fix static analysis errors from phpstan upgrade to 0.12 (#426) 2020-03-03 18:52:29 +01:00
Marcin Michalski 0d59441329 Update phpunit, phpbench, easy coding standard (#415) 2019-11-08 15:28:42 +01:00
Andrew Feeney dcf9206327 Fix grammatical error, and make wording consistent (#410)
- "File can't be open" should be "File can't be opened"
- Use cannot instead of "can not" or "can't" for consistency
2019-10-28 07:49:47 +01:00
Arkadiusz Kondas 1a856c9099
Fix division by zero in ANOVA for small size dataset (#391) 2019-06-22 22:54:47 +02:00
Arkadiusz Kondas 4590d5cc32
Implement OneHotEncoder (#384) 2019-05-15 08:00:46 +02:00
Arkadiusz Kondas 3baf1520e3
Update dependencies and phpstan (#383) 2019-05-14 22:43:08 +02:00
Arkadiusz Kondas b500f0b648
Implement FeatureUnion 🚀 (#382) 2019-05-14 21:26:25 +02:00
Arkadiusz Kondas ff118eb2ba
Implement LambdaTransformer (#381) 2019-05-13 22:10:34 +02:00
Arkadiusz Kondas 417174d143
Implement ColumnFilter preprocessor (#378) 2019-05-12 22:41:31 +02:00
Arkadiusz Kondas 717f236ca9
Implement NumberConverter (#377) 2019-05-12 22:25:17 +02:00
Arkadiusz Kondas 1e1d794655
Fix DecisionTreeRegressor for big dataset (#376) 2019-05-12 21:27:21 +02:00
Arkadiusz Kondas 91812f4c4a
Implement DecisionTreeRegressor (#375) 2019-05-12 20:04:39 +02:00
Arkadiusz Kondas 8544cf7083
Implement regression metrics (#373) 2019-05-10 23:10:05 +02:00
Arkadiusz Kondas f6aa1a59b0
Remove phpunit readAttributes deprecated methods (#372) 2019-04-12 07:49:30 +02:00
KenorFR cefb4fc7a7 Ngram word (#370)
* Add NGramWordTokenizer

* Update doc
Add test
Check coding standards
2019-04-05 21:23:09 +02:00
Arkadiusz Kondas dbbce0e066
Implement LabelEncoder (#369) 2019-04-02 11:07:00 +02:00
Marcin Michalski d3888efa7a Update phpstan & easy coding standard (#366) 2019-03-25 14:55:14 +01:00
Arkadiusz Kondas 5e02b893e9
Fix FilesDataset arrays and TokenCountVectorizer numeric token (#363) 2019-03-20 23:22:45 +01:00
Pol Dellaiera 02dab41830 Provide a new NGramTokenizer with minGram and maxGram support (#350)
* Issue #349: Provide a new NGramTokenizer.

* Issue #349: Add tests.

* Fixes from code review.

* Implement NGramTokenizer with min and max gram support

* Add missing tests for ngram

* Add info about NGramTokenizer to docs and readme

* Add performance test for tokenization
2019-02-15 17:31:10 +01:00
Pol Dellaiera 40f1ca06aa Issue #351: Replace pow() and sqrt() with double stars notation. (#352) 2019-02-08 22:24:02 +01:00
Pol Dellaiera 4b837fae8e Improve distance performance and reduce duplication in distance classes. (#348)
* Issue #347: Reduce duplicated code.

* Issue #347: Replace array_* with regular loops for better perfomance.
2019-02-06 08:00:17 +01:00
Arkadiusz Kondas 1934d8af81
Update dependencies and fix code styles (#334) 2018-12-12 21:56:44 +01:00
Arkadiusz Kondas d30c212f3b
Check if feature exist when predict target in NaiveBayes (#327)
* Check if feature exist when predict target in NaiveBayes

* Fix typo
2018-11-07 09:39:51 +01:00
Arkadiusz Kondas 18c36b971f
Mnist Dataset (#326)
* Implement MnistDataset

* Add MNIST dataset documentation
2018-11-07 08:02:56 +01:00
Marcin Michalski f2dd40cb6f Properly check cluster points label (#323) 2018-10-29 20:04:06 +01:00
Marcin Michalski 53c5a6b9e5 Update phpstan to 0.10.5 (#320) 2018-10-28 07:44:52 +01:00
Marcin Michalski 0beb407b16 Update easy coding standard to ^5.1 (#317) 2018-10-16 21:42:06 +02:00
Marcin Michalski d29c5906df Return labels in MultilayerPerceptron output (#315) 2018-10-15 19:47:42 +02:00
Arkadiusz Kondas e255369636
Fix Imputer docs and check if train data was set (#314)
* Update docs for Imputer class

* Throw exception when trying to transform imputer without train data

* Update changelog
2018-10-10 21:36:18 +02:00
Arkadiusz Kondas 8fdb3d11fc
Make SVM non-locale aware (#288) 2018-07-04 23:42:22 +02:00
Yuji Uchiyama ab22cc5b68 Change the default kernel type in SVC to Kernel::RBF (#267)
* Change the default kernel type in SVC to Kernel::RBF

* Update CHANGELOG.md
2018-06-20 23:28:11 +02:00
Tomáš Votruba 46fa2c2cca Update to EasyCodingStandard 4 (#273)
* update ECS config to v4

* composer: require Symplify 4

* apply coding-standard: use constants over functions, protected setUp() in tests, array indentation

* ecs: add false positive case

* composer: update lock

* bump to ECS 4.4

* update composer.lock

* shorten ECS config name

* ecs: ignore assignments in while()

* fix cs
2018-06-15 07:57:45 +02:00
Ivana Momcilovic af2d732194 KMeans associative clustering (#262)
* KMeans associative clustering added

* fix travis error

* KMeans will return provided keys as point label if they are provided

* fix travis

* fix travis
2018-03-08 22:27:16 +01:00
Mustafa Karabulut 0d80c78c57 Micro optimization for matrix multiplication (#255)
* Micro optimization for matrix multiplication

* code cs fix

* added a comment block for the change
2018-03-08 19:19:09 +01:00
Yuji Uchiyama 66ca874062 Throw proper exception (#259)
* Throw proper exception

* Fix coding style
2018-03-06 23:26:36 +01:00
Marcin Michalski a40c50b48b Add Optimizer tests and remove initialTheta (#252)
* Add  Optimizer tests

* Remove Optimizer.initialTheta and rename Optimizer.setInitialTheta to setTheta
2018-03-04 22:44:22 +01:00
Marcin Michalski 55749c7c92 Add Cluster tests (#254) 2018-03-04 17:06:46 +01:00
Marcin Michalski 941d240ab6 Add RandomForest exception tests (#251) 2018-03-04 17:02:36 +01:00
Arkadiusz Kondas 8976047cbc
Add removeColumns function to ArrayDataset (#249)
* Add removeColumns function to ArrayDataset

* Add removeColumns to docs

* Fix cs
2018-03-03 16:04:21 +01:00
Yuji Uchiyama cbd9f5fde1 Inline static constructors of exceptions (#250) 2018-03-03 16:03:53 +01:00
Marcin Michalski 9e375ca544 Ensure DataTransformer::testSet samples array is not empty (#204) 2018-02-25 22:56:36 +01:00
Yuji Uchiyama 4562f1dfc9 Add a SvmDataset class for SVM-Light (or LibSVM) format files (#237)
* Add data loader for svm format

* Add tests for error cases

* Set proper exception messages

* Add documents

* Add error checking code for invalid column format

* Add missing documents
2018-02-24 11:17:35 +01:00
Arkadiusz Kondas a96f03e8dd
Fix Optimizer initial theta randomization (#239)
* Fix Optimizer initial theta randomization

* Add more tests for LUDecomposition and FuzzyCMeans
2018-02-23 23:05:46 +01:00
Yuji Uchiyama add00c6108 Fix apriori keys (#238)
* Add test to check keys of rules

* Reindex after array_filter/array_unique in Apriori
2018-02-22 17:02:55 +01:00
Arkadiusz Kondas 0a15561352
Fix KMeans and EigenvalueDecomposition (#235)
* Fix kmeans cluster and eigenvalue decomposition

* Fix kmeans space

* Fix code style
2018-02-18 00:09:24 +01:00
Arkadiusz Kondas 16dc16b0d9
Add phpstan strict rules (#233)
* Add phpstan strict rules

* Fix travis coveralls

* Add phpstan-phpunit strict rules

* Fix eigen decomposition test name and phpstan ingored error
2018-02-16 07:25:24 +01:00