Contents




Datasets


In HetRec 2011, we make available several datasets with various types of user preferences about resources belonging to three domains -movies, Web pages, and music tracks- and having diverse meta-information.

  • hetrec2011-movielens-2k

    This is an extension of MovieLens10M dataset, which contains personal ratings and tags about movies. From the original dataset, only those users with both ratings and tags have been mantained.

    In the dataset, the movies are linked to Internet Movie Database (IMDb) and RottenTomatoes (RT) movie review systems. Each movie does have its IMDb and RT identifiers, English and Spanish titles, picture URLs, genres, directors, actors (ordered by "popularity"), RT audience' and experts' ratings and scores, countries, and filming locations.

  • hetrec2011-delicious-2k

    This dataset has been obtained from Delicious social bookmarking system. Its users are interconnected in a social network generated from Delicious "mutual fan" relations.

    Each user has bookmarks, tag assignments, i.e. tuples [user, tag, bookmark], and contact relations within the dataset social network. Each bookmark has a title and URL.
  • hetrec2011-lastfm-2k

    This dataset has been obtained from Last.fm online music system. Its users are interconnected in a social network generated from Last.fm "friend" relations.

    Each user has a list of most listened music artists, tag assignments, i.e. tuples [user, tag, artist], and friend relations within the dataset social network. Each artist has a Last.fm URL and a picture URL.

The datasets are kindly hosted by GroupLens research group at University of Minnesota.

Participants are encouraged to evaluate their recommendation approaches with the provided data, and enrich it with additional data, which could be published at the workshop website for future use.

Moreover, in this edition, we are very interested in providing such heterogeneous datasets from different domains to encourage researchers to develop cross-domain recommendation approaches and user studies. User judgements of resource recommendations would be very valuable for futher research, and could be made publicly available as a result of the workshop.


2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems (HetRec 2011)
5th ACM Conference on Recommender Systems (RecSys 2011)
23rd-27th October 2011 | Chicago, IL, USA