πŸ‡¨πŸ‡΄ @guerravis
πŸ‡ΊπŸ‡Έ @duto_guerra

Intro to Machine Learning
From definitions to examples


John Alexis Guerra GΓ³mez
πŸ‡¨πŸ‡΄@guerravis
πŸ‡ΊπŸ‡Έ@duto_guerra

https://johnguerra.co/viz/introToML



Use spacebar and the arrows to advance slides

Data Science

What is it?

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from structured and unstructured data
Wikipedia
... to work effectively with heterogeneous, real-world data and to extract insights from the data using the latest tools and analytical methods.
UC Berkeley MIDS program brochure

However

When you search online this is what you see

Google Trends

Data Science is way more than Machine Learning!

The purpose of visualization is insight, not pictures

The purpose of data analytics is insight, not (just) models

Classical programming: data+rules = answers. Machine Learning data+answers=rules

What part of ML do you want to use?

ML Training
ML Prediction

Prediction

ML Prediction

How to use it (prediction)?

  • Op1. Use models (apis) trained by Google, Amazon, IBM or Microsoft
  • Op2. Download models pretrained by others and run them yourself

What can you use it for?

  • Photos πŸ–Ό
  • Videos πŸ“Ή
  • Document/Text Processing πŸ“ƒ
  • Speech πŸ‘„πŸ‘‚πŸΌ
  • Structured data πŸ’Ύ?

Photos πŸ–Ό and Videos πŸ“Ή

What can I detect on photos πŸ–Ό?

  • Objects 🐈 πŸ• 🍎
  • Faces πŸ‘±πŸ½β€β™‚οΈπŸ‘±β€β™€οΈ
  • Celebrities 🍾
  • Landmarks πŸ—Ό
  • Text in images πŸ—Ό
Video πŸ“Ή is about the same but on streaming

How can I use it?

Develop locally

Pose Detection

https://johnguerra.co/viz/mlPose/

Object Detection

https://johnguerra.co/viz/mlObject/

How can I use it?

Demos

Document/Text processing πŸ“ƒ

What can I do with documents πŸ“ƒ?

  • OCR πŸ–Ό β†’ πŸ”€
  • Sentiment analysis πŸ˜†πŸ˜‘
  • Topic extraction 🟑🟠🟣
  • Entities detection
  • Political Affiliation? πŸ‘”πŸŽ‰
  • Psychological Profile?

Demos

Speech πŸ‘„πŸ‘‚πŸΌ

What can I do with Speech πŸ‘„πŸ‘‚πŸΌ?

  • Speech recognition πŸ‘‚πŸΌ
  • Speech generation πŸ‘„

Data Analysis

Let's compare them with a real world example

How is Rappi doing on Twitter?

  • 30k tweets in a week of 2019

Approach 1

πŸ˜‘πŸ˜ πŸ˜’πŸ˜πŸ˜πŸ˜ƒπŸ₯°?

  • Machine learning 🎩! ???
  • Detects sentiment ! ???

I hired a data πŸ’ (might be me)

Analyzed 180 tweets

  • πŸ˜‘πŸ˜ πŸ˜’πŸ˜πŸ˜πŸ˜ƒπŸ₯°

Here are some of them

Rappi tweet
😐 -10%
Rappi tweet
😑 -80%
Rappi tweet
πŸ₯° 80%
Rappi tweet
😐 -10%
Rappi tweet
😐 -20%
Rappi tweet
πŸ₯° 90%
Rappi tweet
πŸ˜’ -40%
Rappi tweet
πŸ˜’ -30%

Would you hire this data πŸ’?

Well, actually

  • It wasn't a data πŸ’
  • It was a πŸ’»
  • Would you use it?

Well, actually, actually

Will you trust it?

I don't

Approach 2

Approach 3

It's up to you!

  • Interactivity πŸ‘‰ Ask questions
  • Slice and dice
  • Overview first, Zoom/Filter, then details on demand

Rappi Dashboard Link πŸ˜‰

Β‘No coma Machine Learning, coma πŸ–!

My insights toolset?

Training

ML Training

Tensorflow Models

https://www.tensorflow.org/js/models
ML Training

Tensorflow.js

Multivariate Data?

-> Dimensionality Reduction + Clustering

MLExplore.js

FabiΓ‘n PeΓ±a

Visual Analytics Applications

  • Present information
  • Discover insights
  • Preprocess + Explore
  • Open the black box

Present information

  • Prediscovered insight
  • General public

Discover insights

  • For experts
  • Navigate the data

Preprocess

  • Understand new data
  • Cleanup

Cleanup

Explore

Open the black box

MLExplore.js

  • Interpret and interact with TSNE+Kmeans
FabiΓ‘n PeΓ±a