joy of data » real-life

↧

Image may be NSFW.
Clik here to view.

The tf-idf-Statistic For Keyword Extraction

February 27, 2014, 6:45 am

The tf-idf-statistic (“term frequency – inverse document frequency”) is a common tool for the purpose of extracting keywords from a document by not just considering a single document but all documents...

View Article

Image may be NSFW.
Clik here to view.

Titanic challenge on Kaggle with decision trees (party) and SVMs (kernlab)

March 28, 2014, 10:30 am

The Titanic challenge on Kaggle is about inferring from a number of personal details whether a passenger survived the disaster or did not. I gave two algorithms a try, which are decision trees using R...

View Article

Image may be NSFW.
Clik here to view.

Germans used to have more Sex in Summer!

January 1, 2015, 7:56 am

Wow – what a headline … okay, I admit it’s phrased quite sensational given that it anticipates just one possible interpretation of increasingly more births around summer / autumn compared to in spring...

View Article

Image may be NSFW.
Clik here to view.

Humor is a powerful, alternative Method for processing Data and reporting...

January 7, 2015, 10:15 am

“Je n’ai pas peur des représailles. Je n’ai pas de gosses, pas de femme, pas de voiture, pas de crédit. Ça fait sûrement un peu pompeux, mais je préfère mourir debout que vivre à genoux.” (“I am not...

View Article

Image may be NSFW.
Clik here to view.

As a Data Scientist it is my Obligation to support #nobagida, #nopegida and...

January 19, 2015, 8:00 am

Political Opinion on a Scale from 0 to 2π par(mfrow=c(2,1),mar=c(2,4,4,1)) barplot(rep(1,10), main="") mtext("#[a-z]{2}gida = ...", side=3, adj=.1, line=1, cex=2, col=rgb(.4,.4,.4))...

View Article