Get New Netflix Movies by a Recommendation System

Photo by Thibault Penin on Unsplash

Dataset

Data Exploration

Data summary 
--------------------------------------------------
Movie total count: 17770
Users total count: 480189
Rating total count: 100480507
Image by Author
Elapsed time: 0:07:08.215719
Image by Author
Average Ratings
------------------------------
weekday
Friday 3.585274
Monday 3.577250
Saturday 3.591791
Sunday 3.594144
Thursday 3.582463
Tuesday 3.574438
Wednesday 3.583751
Name: rating, dtype: float64
Image by Author
user
305344 17112
2439493 15896
387418 15402
1639792 9767
1461435 9447
Name: rating, dtype: int64
Image by Author
0.00        1
0.05 7
0.10 15
0.15 21
0.20 27
0.25 34
0.30 41
0.35 50
0.40 60
0.45 73
0.50 89
0.55 109
0.60 133
0.65 163
0.70 199
0.75 245
0.80 307
0.85 392
0.90 520
0.95 749
1.00 17112
Name: rating, dtype: int64
Image by Author

Sparse Matrix

Photo by cmdline on cmdlinetips
Sparse Matrix created. Shape is: (user, movie):  (2649430, 17771)
{'global': 3.582890686321557}
User Rating Average: 4.25
Movie Rating Average 32: 3.9922680412371134

Cold Start Problem

Users total count: 480189
Total training users count: 405041
Total users not count in training: 75148 (15.65%)
Movies total count: 17770
Total training movies count: 17424
Total movies not count in training: 346 (1.95%)

Movie Similarity Matrix

# Shape
movie_sparse_matrix.shape
(17771, 17771)

Recommendation

  • Movie id
  • Movie release Year
  • Movie title
movie_titles.head()
Image by Author
Movie: Toy Story
Total User Ratings = 4785.
We find 17342 movies that are similar to this one and we'll print the most similar ones.
Image by Author
Image by Author

Conclusion

Gif by Giphy on Giphy

--

--

--

Mechanical Engineer Business Intelligence developer, passionate about technology, I have knowledge and experience to create a BI architecture and much more 📚.

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Graphs and ML: Linear Regression

Pro Data Scientist

Crop and Identify Specific Objects in an Image or Video in the Browser using Machine Learning

The Curse of Dimensionality!

Contributing to the Good of Travel Analysis

PII. The Great Taboo of Data Science

“Polynomial Regression” in 200 words.

Apache Spark On Dataproc vs Google-BigQuery

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Guilherme Matheus

Guilherme Matheus

Mechanical Engineer Business Intelligence developer, passionate about technology, I have knowledge and experience to create a BI architecture and much more 📚.

More from Medium

S2DS: Real-World Experience to Jumpstart your Career

How Data Science and Machine Learning Can change the pricing methods in Diamond Industry.

What Is AutoML and How Can I Use It?

Human Autonomy: Using Technology to Achieve our Goals