Classification Algorithms in Machine Learning l Machine Learning l Career Launcher

Objectives of Machine Learning

The basic objective of machine learning is to find patterns in data. These patterns can be in the form of:

Prediction in Machine Learning: Prediction in Machine Learning means predicting exact values for dependent variables when the independent variables and model is ready. There are several different Machine Learning Techniques that can be used which will be discussed in a later article.
Classification Algorithms in Machine Learning: There are several different methods to perform Classification Algorithms in Machine Learning. Classification Algorithms in Machine Learning means that putting new observations in a group of existing observations that carry similar characteristics. One of the most famous Classification Algorithms in Machine Learning methods are the clustering techniques. This topics will be discussed in more detail in this article.

Classification Algorithms in Machine Learning

Classification Algorithms in Machine Learning helps us determine the class of an object which allows the computer to make decisions regarding it based on information about the class. Classification Algorithms in Machine Learning has several different techniques such as:

Clustering Techniques (Hierarchical Clustering, K-Means Clustering, K-Nearest Neighbor Clustering)
Decision Trees & Random Forest
Support Vector Machines
Linear Classifiers (Logistic Regression, Naive Bayes)

There are some more Classification Algorithms in Machine Learning but these are the most commonly used ones.

Clustering Techniques

The Clustering Techniques are the most common Classification Algorithms in Machine Learning. The most basic clustering technique is the hierarchical clustering.

In Hierarchical Clustering, the distance between each pair of observations is measured using Euclidian distance. The closest observations are put into the same cluster and the average distance between the two observations is taken for further clustering. This logic continues until there is only one cluster left. The number of clusters is determined when the jump in the distance of the observations becomes too large. It is also possible to find the number of clusters from the dendogram that is formed in SPSS.

Once the number of clusters are identified, the K-Means Clustering can be used. K-Means Clustering is a better method of clustering like observations together but requires the number of clusters as an input. Thus the number of clusters is identified through the Hierarchical Clustering and the members of the clusters is identified through the K-Means Clustering.

Decision Trees and Random Forest

Decision Trees are techniques in which there is no need for input apart from the initial dimensions. While the clustering techniques are a bottom-top approach, decision trees are more of a top-to-bottom approach. All the observations are divided according to one dimension and dimensions are steadily added until the tree is formed.

One of the difficulty of choosing decision trees as the Classification Algorithms in Machine Learning is that the decision tree, when allowed to run its course, will end with all the observations classified separately. This is called overfitting. It means that the model developed is adhering to the training data too closely due to which its ability to classify outliers will be greatly reduced. In order to counter this problem, the decision tree is "pruned" which means the division stops once all the groups have a reasonable of members.

Random Forests are another technique used to overcome the problem of overfitting. Here the test data, training data and validation data are identified using sampling with replacement. Following this , the different datasets are used to develop the model. The number of trees formed are huge which means that the bias in any sample is averaged out over the course of the implementation. The overall model is developed by counting the voting of each member across the decision trees before arriving at the required model.

Support Vector Machines

Support Vector Machines can be used as Prediction Algorithm in Machine Learning as well as a Classification Algorithm in Machine Learning. However it is usually used for Classification purposes. In this method, the observations are plotted on an n-dimensional space where n represents the dimensions in the dataset. Following the plotting of the observations, the ideal hyper-plane is chosen which best divides the observations into two clusters. While SVM was developed as a Classification Algorithm in Machine Learning for just two clusters, it has since been expanded into more clusters.

The hyper-plane in the SVM also does not need to be linear in nature. It is possible to transform the data using a technique called kernel which ensures that the division can be done for any dataset.

Linear Classifiers

Linear Classifiers are the Classification Algorithms in Machine Learning which make the classification decision on the basis of a linear combination of the dimensions/ characteristics. There are two major linear classifiers:

Naive Bayes
Logistic Regression

Naive Bayes is the Classification Algorithm in Machine Learning which assumes that all the dimensions are independent of each other (even if they are interlinked). Thus based on this assumption, the classification model is built.

Logistic Regression is a modification of the Linear Regression technique. While Regression techniques are generally used as Prediction Algorithms in Machine Learning, Logistic Regression is used as a Classification Algorithm in Machine Learning. Another distinction of the logistic regression is that it accepts nominal input as well as ordinal inputs while the linear regression only accepts ordinal value. The logistic regression still runs as a normal regression however when the value is predicted, gives a output of 0 or 1 depending on the value in reference to a set threshold. This threshold is set at 0.5 by default however it can be changed depending on the value of the independent variables.

Classification Algorithms in Machine Learning

Objectives of Machine Learning