Movie lens 100k Recommender Data

This dataset (ml-latest) describes 5-star rating and free-text tagging activity from MovieLens, a movie recommendation service. This dataset was generated on September 26, 2018.

Users were selected at random for inclusion. All selected users had rated at least 1 movie. No demographic information is included. Each user is represented by an id, and no other information is provided. The data are contained in the files links.csv, movies.csv, ratings.csv and tags.csv.

Feedback type: Explicit

Rating scale: 1 to 5


Dataset Link

https://grouplens.org/datasets/movielens/latest/


Date Range

January 09, 1995 - September 26, 2018.


Data Size

Total data size: 265 MB

Basic Statistics

No. of users: 283k

No. of movies: 58k

No. of ratings: 27 million


Netflix Recommendation System

This is the official data set used in the Netflix Prize competition. The data consists of about 100 million movie ratings, and the goal is to predict missing entries in the movie-user rating matrix.

Feedback type: Explicit

Rating scale: 1 to 5


Dataset Link

http://academictorrents.com/details/9b13183dc4d60676b773c9e2cd6de5e5542cee9a


Date Range

October 1998 - December 2005


Data Size

Total data size: 700 MB


Basic Statistics

No. of users: 480k

No. of movies: 17k

No. of ratings: 100 million


Anime Ratings Dataset

This data set contains information on user preference data from users on anime. Each user is able to add anime to their completed list and give it a rating and this data set is a compilation of those ratings.

Feedback type: Explicit

Rating scale: -1 to 10


Dataset Link

https://www.kaggle.com/CooperUnion/anime-recommendations-database


Date Range

NA


Data Size

Anime rating data: 914.51 KB

Anime metadata: 106.24 MB


Basic Statistics

No. of users: 73.5k

No. of anime:  12.3k

No. of ratings: 7.81 million

Filmtrust Dataset

It is a small dataset crawled from the entire Filmtrust website which is currently down.

Feedback type: Explicit

Rating scale:  0 to 5 (0.5 increment)


Dataset Link

https://www.librec.net/datasets/filmtrust.zip


Data Range

NA


Data Size

93 KB


Basic Statistics

No of user: 1.5k

No of movie: 2k

No of rating: 25.4k


CiaoDVD Dataset

This is the bipartite user–movie rating network of the site http://dvd.ciao.co.uk/ from 2013. The dataset contains 303 instances of multiple edges. The timestamps are only precise up to one day

Feedback type: Explicit

Rating Scale: 1 to 5 (1 increment)

Dataset Link:

https://www.librec.net/datasets/CiaoDVD.zip

Data Range

NA


Data Size

Movie-rating: 2.5MB

Review-rating: 20.7MB


Basic Statistics

No of users: 17.6k

No of movie: 16.1k

No of rating: 72.6k


MovieTweetings Dataset

MovieTweetings is a dataset consisting of ratings on movies that were contained in well-structured tweets on Twitter. This dataset is the result of research conducted by [Simon Dooms] (http://scholar.google.be/citations?user=owaD8qkAAAAJ) (Ghent University, Belgium) and has been presented on the CrowdRec 2013 workshop which is co-located with the ACM RecSys 2013 conference.

Feedback type: Explicit

Rating Scale: 0 to 10 (1 increment)

Dataset Link:
https://github.com/sidooms/MovieTweetings

Data Range

NA


Data Size

10k: 509 KB

200k: 7.1 MB


Basic Statistics

No of users: 68.1k

No of movies: 35.8k

No of rating: 874k

See instant AI recommendation results with caboom
Start with your data or a sample for instant results right away.
Request Access