Normal procedure
Ask friends and family
Problem
That's inferring statistics from a sample n=1
Better approach
Data based decisions
How to name my brother's new baby?
Colombian National Registry
- Me: Hey, do you have data of Colombians' names?
- CNR: Sure of course!
- Me: Great, can I have all of them
- CNR: Of course, it is just $0.40 per name
- Me: π€¦ββοΈ
Choosing schools for my nephew
- National exam results
- Is open data!
- 3 hrs later
Shiny packaged black boxes
How is Rappi doing on Twitter?
- 30k tweets in a week of 2019
π‘π πππππ₯°?
- Machine learning π©! ???
- Detects sentiment ! ???
I looked at 180 of these tweets
Other lessons
- Self appointed data scientists
- Everyone wants a piece of it
- Lack of focus on insights
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data
Wikipedia
The purpose of visualization is insight, not pictures
What's the first thing you do with a new dataset?
MLExplore.js
- Interpret and interact with TSNE+Kmeans
Lessons Learned
- Open data
- ππΌ Insights! ππΌ
- ML is just another tool
- Visual Analytics empowers users