BELMKN : Bayesian Extreme Learning Machine Kohonen Network

Unsupervised Extreme Learning Machine(ELM) is a non-iterative algorithm used for feature extraction. This method is applied on the IRIS Dataset for non-linear feature extraction, cluster prediction and finally clustering is carried out using k-means.

Objective

To perform non-linear feature learning using Unsuoervised Extreme Learning Machine, predicting the number of clusters in the dataset using Bayesian Information Criterion (BIC) and finally clustering using k-means, Self Organizing Maps / Kohonen Network and EM Algorithm

Modules

Unsupervised Extreme Learning Machine : In this module, feature extraction of the dataset is performed using Unsupervised Extreme Learning Machine. It is a non-iterative algorithm with a single hidden layer where the weights between the input layer and the hidden layer are randomly initialized and the weights between the hidden layer and the output layer are computed using the objective function. Hence, it is guaranteed to converge at a global minima.
Bayesian Information Criterion : Bayesian Information Criterion is a statistical method use dto find out the number of clusters in the dataset. It uses the Expectation Maximization(EM) ALgorithm to find the number of clusters in the dataset. This module is added to automate the process of cluster prediction as for k-means clustering we need to specify priorly the number of clusters.

Techniques used for clustering feature learning information obtained from Unsupervised ELM

K-means Clustering : Linearly clustering where input is the feature learning information from Unsupervised ELM and the number of clusters from BIC. Finally, we display the confusion matrix and clustering accuracy.
Self Organizing Maps / Kohonen Network : It is a clustering technique developed by Kohonen visualized as a neural network. It has only 2 layers : the input layer and the output layer. The number of input layer neurons is the no of features in the dataset and the number of output neurons is the desired number of clusters. It updates the weights between the layers based on the neighbourhood concept and the minimum distance criteria. In this implementation a Gaussian neighbourhood is used.
Clustering using Expectation Maximization (EM Algorithm) : EM clustering is a soft clustering technique whereas the above two mentioned methods are hard clustering methods. In soft clustering, instead of putting each data point into a separate cluster, a probability or likelihood of that data point to be in those clusters is assigned. For whichever cluster, the likelihood of that sample is high, the sample is assigned to that particular cluster.

Datasets used for Analysis

Synthetic Datasets :

Four Class Linearly Separable Dataset
Flame Shaped Dataset
Face Shaped Dataset

NOTE : Synthetic.py contains the python code for generating synthetic datasets

UCI Machine learning Repository Datasets

1. Cancer
2. Dermatology
3. E.Coli
4. Glass
5. Heart
6. Horse
7. Iris
8. Thyroid
9. Vehicle
10. Wine

NOTE : The csv files for the 10 datasets has been uploaded after some preprocessing.

Results Screenshots

Clustering Results for Synthetic Datasets :

i. For the four class linearly separable dataset, the results for all the methods remain same.

ii. Flame shaped Dataset Clustering Results

iii. Face Shaped Dataset Result

Clustering Accuracy for IRIS Dataset : The clustering accuracy achieved for IRIS dataset is 96.67% which was the highest. The number of hidden neurons set was 120.

Cluster Prediction using BIC : The number of clusters predicted by BIC is 3 which matches the original number of clusters in the Iris Dataset. The number of clusters is found out using the Elbow Criterion from BIC value vs no of clusters graph.

References

[1] Senthilnath, J.; Simha C, S.; G, N.; Thapa, M.; M, I. BELMKN: Bayesian Extreme Learning Machines Kohonen Network. Algorithms 2018, 11, 56.
Link : http://www.mdpi.com/1999-4893/11/5/56
[2] Huang, G.; Song, S.; Gupta, J.N.; Wu, C. Semi-supervised and unsupervised extreme learning machines. IEEE Trans. Cybern. 2014, 44, 2405–2417.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
BIC_Iris.png		BIC_Iris.png
BIC_for_IRIS.py		BIC_for_IRIS.py
ELM_for_IRIS.py		ELM_for_IRIS.py
ELM_kmeans_ClusteringResult.png		ELM_kmeans_ClusteringResult.png
EM_Algo.py		EM_Algo.py
Four_class_linearly _separable.tif		Four_class_linearly _separable.tif
LICENSE		LICENSE
README.md		README.md
Results_Face_Datset.tif		Results_Face_Datset.tif
Results_Flame_Dataset.tif		Results_Flame_Dataset.tif
SOM_with_neigh_for_IRIS.py		SOM_with_neigh_for_IRIS.py
Spectral_Clustering.py		Spectral_Clustering.py
Synthetic.py		Synthetic.py
Unsupervised_Extreme_Learning_Machine.pptx		Unsupervised_Extreme_Learning_Machine.pptx
cancer_sort.csv		cancer_sort.csv
dermatology.csv		dermatology.csv
ecoli_sort.csv		ecoli_sort.csv
glass_sort.csv		glass_sort.csv
heart_sort.csv		heart_sort.csv
horse_sort.csv		horse_sort.csv
iris_sort.csv		iris_sort.csv
k_means.py		k_means.py
thyroid_sort.csv		thyroid_sort.csv
vehicle_dataset_sorted.csv		vehicle_dataset_sorted.csv
wine_sort.csv		wine_sort.csv

License

sumanth-bmsce/Unsupervised_Extreme_Learning_Machine

Folders and files

Latest commit

History

Repository files navigation

BELMKN : Bayesian Extreme Learning Machine Kohonen Network

Objective

Modules

Datasets used for Analysis

Synthetic Datasets :

UCI Machine learning Repository Datasets

Results Screenshots

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages