Support vector machine (SVM)

The basis Support vector machine (SVM) is a supervised learning method. It can be used for regression or classification purposes. SVM is useful for hyper-text categorization, classification of images, recognition of characters… The basic visual idea is the creation of planes (lines) to separate features. The position of these planes can be adjusted to maximized the … Read more

Machine learning, source of errors

Before to start What is an error? Observation prediction error = Target – Prediction = Bias + Variance + Noise The main sources of errors are Bias and Variability (variance). Underfitting or overfitting. Underclustering or overclustering. Improper validation (after the training). It could be that comes from the wrong validation set. It is important to divide … Read more

k-means clustering

The basis K-means clustering is an unsupervised learning method. The aim is to find clusters and the CentroIDs that can potentially identify the What is a cluster? a set of data points grouped in the same category. What is a CentroID? center or average of a given cluster. What is “k”? the number of CentroIDs … Read more

Naive Bayes classification

The basis It’s based on Bayes’ theorem (check the wikipedia link, and see how complex the decision trees could be). Assumes predictors contribute independently to the classification. Works well in supervised learning problems. Works with continuous and discrete data. Can work with discrete data sets. It is not sensitive to non-correlating “predictors”. Naives Bayes plot Example: … Read more

Understanding Logistics regression

The basis Logistics or Logit regression. It’s a regression model where the dependent variable (DV) is categorical. Outcome follows a Bernoulli distribution. Success = 1 , failure = 0. Natural logarithm of odds ratio ln (p/1-p)… logit(p). Inverse log curve gives a nice “s” curve to work with. Equate logarithm of odds ratio with regression line equation. Solve for probability … Read more

Linear regression example

I was looking for a simple example of a regression and how to calculate it by hand. I found this one: least squares example. the main formula to calculate the linear regression is   y = Ḇo + Ḇ1xcontinue learning the basis !

Overfitting

What is overfitting? when you are preparing a machine learning solution, you work basically with data sets that contains: Data: relevant and/or important data. Noise: inrelevant and/or non important data. With this data you want to identify a trigger, a signal that responds to your target pattern you want that your code identifies. So you … Read more

Understanding machine learning

I have watched this video from @TessFerrandez: Machine Learning for Developers. The video explains how the process of building a machine learning solution is. She explains it in plain English and with very nice examples easy to remind the concepts. The video helped me to link a lot of technical ideas explained in the courses with … Read more