Commit Graph

111 Commits

Author SHA1 Message Date
Arkadiusz Kondas
bca2196b57 Prevent Division by zero error in classification report 2016-11-20 22:49:26 +01:00
Arkadiusz Kondas
349ea16f01 Rename demo datasets and add Dataset suffix 2016-09-30 14:02:08 +02:00
Arkadiusz Kondas
84af842f04 Fix division by zero in ClassificationReport #21 2016-09-27 20:07:21 +02:00
Arkadiusz Kondas
1ce6bb544b Run php-cs-fixer 2016-09-21 21:51:19 +02:00
Arkadiusz Kondas
8072ddb2bf Update phpunit to 5.5 2016-09-21 21:46:16 +02:00
Patrick Florek
fa87eca375 Add new class Set for simple Set-theoretical operations
### Features

* Works only with primitive types int, float, string
* Implements set theortic operations union, intersection, complement
* Modifies set by adding, removing elements
* Implements \IteratorAggregate for use in loops

### Implementation details

Based on array functions:
* array_diff,
* array_merge,
* array_intersection,
* array_unique,
* array_values,
* sort.

### Drawbacks

* **Do not work with objects.**
* Power set and Cartesian product returning array of Set
2016-09-10 13:24:43 +02:00
Patrick Florek
90038befa9 Apply comments / coding styles
* Remove user-specific gitignore
* Add return type hints
* Avoid global namespace in docs
* Rename rules -> getRules
* Split up rule generation

Todo:
* Move set theory out to math
* Extract rule generation
2016-09-02 00:26:01 +02:00
Patrick Florek
c8bd8db601 # Association rule learning - Apriori algorithm
* Generating frequent k-length item sets
* Generating rules based on frequent item sets
* Algorithm has exponential complexity, be aware of it
* Apriori algorithm is split into apriori and candidates method
* Second step rule generation is implemented by rules method
* Internal methods are invoked for fine grain unit tests
* Wikipedia's train samples and an alternative are provided for test cases
* Small documentation for public interface is also shipped
2016-08-23 15:44:53 +02:00
Arkadiusz Kondas
6421a2ba41 Develop to master (#18)
* Fix Backpropagation test with explicit random generator seed

* remove custom seed - not working :(

* Updated links in readme
2016-08-21 14:03:20 +02:00
Arkadiusz Kondas
c506a84164 refactor Backpropagation methods and simplify things 2016-08-10 23:03:02 +02:00
Arkadiusz Kondas
66d029e94f implement and test Backpropagation training 2016-08-10 22:43:47 +02:00
Arkadiusz Kondas
e5d39ee18a implements and test multilayer perceptron methods 2016-08-09 13:27:48 +02:00
Arkadiusz Kondas
64859f263f test abstraction from LayeredNetwork 2016-08-07 23:41:08 +02:00
Arkadiusz Kondas
95b29d40b1 add Layer, Input and Bias for neutal network 2016-08-05 10:20:31 +02:00
Arkadiusz Kondas
7062ee29e1 add Neuron and Synapse classes 2016-08-02 20:30:20 +02:00
Arkadiusz Kondas
637fd613b8 implement activation function for neural network 2016-08-02 13:07:47 +02:00
Pablo Joán Iglesias
38deaaeb2e testScalarProduct check for non numeric values (#13)
* testScalarProduct check for non numeric values

test for non numeric values.

* updating pr #13

using global namespace fro stdClass
2016-07-26 08:13:52 +02:00
Arkadiusz Kondas
403824d23b test exception on kmeans 2016-07-24 14:01:17 +02:00
Arkadiusz Kondas
448eaafd78 remove unused exception 2016-07-24 13:52:52 +02:00
Arkadiusz Kondas
2a76cbb402 add .coverage to git ignore 2016-07-24 13:42:50 +02:00
Arkadiusz Kondas
093e8fc89c add more tests for CReport 2016-07-19 22:01:39 +02:00
Arkadiusz Kondas
074dcf7470 php-cs-fixer 2016-07-19 21:59:23 +02:00
Arkadiusz Kondas
9665457159 implement ClassificationReport class 2016-07-19 21:58:59 +02:00
Arkadiusz Kondas
7abee3061a docs for files dataset and php-cs-fixer 2016-07-16 23:56:52 +02:00
Arkadiusz Kondas
e0b560f31d create FilesDataset class 2016-07-16 23:29:40 +02:00
Arkadiusz Kondas
9f140d5b6f fix problem with token count vectorizer array order 2016-07-14 13:25:11 +02:00
Arkadiusz Kondas
7c0767c15a create docs for tf-idf transformer 2016-07-12 00:21:34 +02:00
Arkadiusz Kondas
f04cc04da5 create StratifiedRandomSplit for cross validation 2016-07-10 14:13:35 +02:00
Arkadiusz Kondas
6c7416a9c4 implement ConfusionMatrix metric 2016-07-07 00:29:58 +02:00
Arkadiusz Kondas
cce68997a1 implement StopWords in TokenCountVectorizer 2016-07-06 23:22:29 +02:00
Arkadiusz Kondas
601ff884e8 php-cs-fixer 2016-06-17 00:34:15 +02:00
Arkadiusz Kondas
424519cd83 implement fit fot TokenCountVectorizer 2016-06-17 00:33:48 +02:00
Arkadiusz Kondas
be7423350f add more tests for fit metod in preprocessors 2016-06-17 00:23:27 +02:00
Arkadiusz Kondas
3e9e70810d implement fit on Imputer 2016-06-17 00:16:49 +02:00
Arkadiusz Kondas
557f344018 add fit method for Transformer interface 2016-06-17 00:08:10 +02:00
Arkadiusz Kondas
4554011899 rename labels to targets for Dataset 2016-06-16 23:56:15 +02:00
Arkadiusz Kondas
7f4a0b243f transform samples for prediction in pipeline 2016-06-16 16:10:46 +02:00
Arkadiusz Kondas
26f2cbabc4 fix Pipeline transformation 2016-06-16 10:26:29 +02:00
Arkadiusz Kondas
d21a401365 implement Tranformer interface on preprocessing classes 2016-06-16 10:03:57 +02:00
Arkadiusz Kondas
7c5e79d2c6 change transformer behavior to reference 2016-06-16 10:01:40 +02:00
Arkadiusz Kondas
15519ba122 simple pipeline test 2016-06-16 09:58:17 +02:00
Arkadiusz Kondas
cc50d2c9b1 implement TfIdf transformation 2016-06-15 16:04:09 +02:00
Arkadiusz Kondas
da6d94cc46 create stop words class 2016-06-14 11:54:04 +02:00
Arkadiusz Kondas
2f51716388 change token count vectorizer to return full token counts 2016-06-14 09:58:11 +02:00
Arkadiusz Kondas
23eff0044a add test with dataset example 2016-05-31 20:01:54 +02:00
Arkadiusz Kondas
fb04b57853 implement data Normalizer with L1 and L2 norm 2016-05-08 20:35:01 +02:00
Arkadiusz Kondas
65cdfe64b2 implement Median and MostFrequent strategy for imputer 2016-05-08 19:33:39 +02:00
Arkadiusz Kondas
a761d0e8f2 mode (dominant) from numbers 2016-05-08 19:23:54 +02:00
Arkadiusz Kondas
ed1e07e803 median function in statistic 2016-05-08 19:12:39 +02:00
Arkadiusz Kondas
b0ab236ab9 create imputer tool for completing missing values 2016-05-08 14:47:17 +02:00