Machine learning, source of errors

Before to start What is an error? Observation prediction error = Target – Prediction = Bias + Variance + Noise The main sources of errors are Bias and Variability (variance). Underfitting or overfitting. Underclustering or overclustering. Improper validation (after the training). It could be that comes from the wrong validation set. It is important to divide … Read more

k-means clustering

The basis K-means clustering is an unsupervised learning method. The aim is to find clusters and the CentroIDs that can potentially identify the What is a cluster? a set of data points grouped in the same category. What is a CentroID? center or average of a given cluster. What is “k”? the number of CentroIDs … Read more

Naive Bayes classification

The basis It’s based on Bayes’ theorem (check the wikipedia link, and see how complex the decision trees could be). Assumes predictors contribute independently to the classification. Works well in supervised learning problems. Works with continuous and discrete data. Can work with discrete data sets. It is not sensitive to non-correlating “predictors”. Naives Bayes plot Example: … Read more

Understanding Logistics regression

The basis Logistics or Logit regression. It’s a regression model where the dependent variable (DV) is categorical. Outcome follows a Bernoulli distribution. Success = 1 , failure = 0. Natural logarithm of odds ratio ln (p/1-p)… logit(p). Inverse log curve gives a nice “s” curve to work with. Equate logarithm of odds ratio with regression line equation. Solve for probability … Read more

Linear regression example

I was looking for a simple example of a regression and how to calculate it by hand. I found this one: least squares example. the main formula to calculate the linear regression is   y = Ḇo + Ḇ1xcontinue learning the basis !

Serverless

Serverless is a buzzword It points to the future of software development in a post cloud world. The idea of “Serverless” is NOT about removing the servers completely (or you couldn’t use the internet at all), but essentially paying for services that mean that someone else manages the servers for you thereby reducing maintenance load. It’s … Read more

Kepner Tregoe Method

The Kepner Tregoe Method, also known as the KT-method, developed by Charles Kepner and Benjamin Tregoe. When we should use it? The Kepner Tregoe method is a problem analysis model which fits well when the “problem” is disconnected from the “decision”. The Kepner-Tregoe Matrix comprises 4 steps: How to use it

Alexa,

The event, This Christmas I went to a friend’s home, they had Alexa at home. At one point I was curious about how it worked and what type of things you can ask Alexa. Play music, ask about weather, add products to the shopping cart, define an alarm to get up… What stunned me the … Read more

Change behaviors

This is a list of changes I have done and I should do. I DECREASED OR STOPPED USING Facebook (once a week only). LinkedIn (once a day only). Twitter (once a day only) Television (since 2012). Sugar (no sugar at home since 2016) Limited the fried food (since 2007) Smartphones during meals (removed since October … Read more

Guru99, python and more

There are so many tutorials about almost everything, this time I found this one: https://www.guru99.com Guru99 Vision: Fun & Free Education for ALL Guru99 Mission: To bring all feasible courses , online. Python was the one who took the first look, but I went quickly to other tutorials. I am inmersed now on a SAP HCM project, so … Read more