18

Oct

By Benjamin | No Comments

Spam comment analysis in R
Imagine login into your blog and find out more than hundred spam messages, not cool!. I am not letting the spammers win, so I decided to crack some patterns and try to understand/learn something about these little annoying bots.
For this post, I am performing spam...

30

Apr

Introduction to text mining in R
I was checking some Machine Learning challenges at Hackerrank and found a particular challenge which consist on document classification. The source is over here. I downloaded the dataset and decided to make my own text mining analysis instead. The dataset...

21

Feb

By Benjamin | No Comments

If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digital monkeys have read Dracula
Introduction
The infinite monkey theorem states that a monkey hitting keys at random on a typewriter keyboard for an infinite amount of time will almost surely type a...