HOME

TheInfoList



OR:

MovieLens is a web-based
recommender system A recommender system, or a recommendation system (sometimes replacing 'system' with a synonym such as platform or engine), is a subclass of information filtering system that provide suggestions for items that are most pertinent to a particular ...
and
virtual community A virtual community is a social network of individuals who connect through specific social media, potentially crossing geographical and political boundaries in order to pursue mutual interests or goals. Some of the most pervasive virtual communi ...
that recommends movies for its users to watch, based on their film preferences using
collaborative filtering Collaborative filtering (CF) is a technique used by recommender systems.Francesco Ricci and Lior Rokach and Bracha ShapiraIntroduction to Recommender Systems Handbook Recommender Systems Handbook, Springer, 2011, pp. 1-35 Collaborative filtering ...
of members' movie ratings and movie reviews. It contains about 11 million ratings for about 8500 movies. MovieLens was created in 1997 by GroupLens Research, a research lab in the Department of Computer Science and Engineering at the
University of Minnesota The University of Minnesota, formally the University of Minnesota, Twin Cities, (UMN Twin Cities, the U of M, or Minnesota) is a public land-grant research university in the Twin Cities of Minneapolis and Saint Paul, Minnesota, United States. ...
, in order to gather research data on personalized recommendations.


History

MovieLens was not the first recommender system created by GroupLens. In May 1996, GroupLens formed a commercial venture called Net Perceptions, which served clients that included
E! Online E! (an initialism for Entertainment Television) is an American basic cable channel which primarily focuses on pop culture, celebrity focused reality shows, and movies, owned by the NBCUniversal Television and Streaming division of NBCUniver ...
and
Amazon.com Amazon.com, Inc. ( ) is an American multinational technology company focusing on e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence. It has been referred to as "one of the most influential econo ...
. E! Online used Net Perceptions' services to create the recommendation system for Moviefinder.com, while Amazon.com used the company's technology to form its early recommendation engine for consumer purchases. When another movie recommendation site, eachmovie.org, closed in 1997, the researchers who built it publicly released the anonymous rating data they had collected for other researchers to use. The GroupLens Research team, led by Brent Dahlen and Jon Herlocker, used this
data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the ...
to jumpstart a new movie recommendation site, which they chose to call MovieLens. Since its inception, MovieLens has become a very visible research platform: its data findings have been featured in a detailed discussion in a New Yorker article by
Malcolm Gladwell Malcolm Timothy Gladwell (born 3 September 1963) is an English-born Canadian journalist, author, and public speaker. He has been a staff writer for ''The New Yorker'' since 1996. He has published seven books: '' The Tipping Point: How Little ...
, as well as a report in a full episode of ABC Nightline. Additionally, MovieLens data has been critical for several research studies, including a collaborative study between Carnegie Mellon University, University of Michigan, University of Minnesota, and University of Pittsburgh, "Using Social Psychology to Motivate Contributions to Online Communities". During Spring in 2015, a search for "movielens" produced 2,750 results in Google Books and 7,580 in Google Scholar.


Recommendations

MovieLens bases its recommendations on input provided by users of the website, such as movie ratings. The site uses a variety of recommendation algorithms, including
collaborative filtering Collaborative filtering (CF) is a technique used by recommender systems.Francesco Ricci and Lior Rokach and Bracha ShapiraIntroduction to Recommender Systems Handbook Recommender Systems Handbook, Springer, 2011, pp. 1-35 Collaborative filtering ...
algorithms such as item-item, user-user, and regularized
SVD ''Svenska Dagbladet'' (, "The Swedish Daily News"), abbreviated SvD, is a daily newspaper published in Stockholm, Sweden. History and profile The first issue of ''Svenska Dagbladet'' appeared on 18 December 1884. During the beginning of the ...
. In addition, to address the cold-start problem for new users, MovieLens uses preference elicitation methods. The system asks new users to rate how much they enjoy watching various groups of movies (for example, movies with dark humor, versus romantic comedies). The preferences recorded by this survey allow the system to make initial recommendations, even before the user has rated a large number of movies on the website. For each user, MovieLens predicts how the user will rate any given movie on the website. Based on these predicted ratings, the system recommends movies that the user is likely to rate highly. The website suggests that users rate as many fully watched films as possible, so that the recommendations given will be more accurate, since the system would then have a better sample of the user's film tastes. However, MovieLens' rating incentive approach is not always particularly effective, as researchers found more than 20% of the movies listed in the system have so few ratings that the recommender algorithms cannot make accurate predictions about whether subscribers will like them or not. The recommendations on movies cannot contain any marketing values that can tackle the large number of movie ratings as a "seed dataset". In addition to movie recommendations, MovieLens also provides information on individual films, such as the list of actors and directors of each film. Users may also submit and rate tags (a form of metadata, such as "based on a book", "too long", or "campy"), which may be used to increase the film recommendations system's accuracy.


Reception

By September 1997, the website had reached over 50,000 users. When the '' Akron Beacon Journal''s Paula Schleis tried out the website, she was surprised at how accurate the website was in terms of recommending new films for her to watch based on her film tastes. Outside of the realm of movie recommendations, data from MovieLens has been used by Solution by Simulation to make Oscar predictions. Hickey, Walt. "Do Your Oscar Predictions Stack Up? Here's What The Data Says." FiveThirtyEight. N.p., 18 Feb. 2016. Web. 08 Mar. 2016.


Research

In 2004, a collaborative study with researchers from Carnegie Mellon University,
University of Michigan , mottoeng = "Arts, Knowledge, Truth" , former_names = Catholepistemiad, or University of Michigania (1817–1821) , budget = $10.3 billion (2021) , endowment = $17 billion (2021)As o ...
,
University of Minnesota The University of Minnesota, formally the University of Minnesota, Twin Cities, (UMN Twin Cities, the U of M, or Minnesota) is a public land-grant research university in the Twin Cities of Minneapolis and Saint Paul, Minnesota, United States. ...
and
University of Pittsburgh The University of Pittsburgh (Pitt) is a public state-related research university in Pittsburgh, Pennsylvania. The university is composed of 17 undergraduate and graduate schools and colleges at its urban Pittsburgh campus, home to the univers ...
designed and tested incentives derived from the social psychology principles of
social loafing In social psychology, social loafing is the phenomenon of a person exerting less effort to achieve a goal when they work in a group than when working alone. It is seen as one of the main reasons groups are sometimes less productive than the combin ...
and goal-setting on MovieLens users. The researchers saw that under-contribution seemed to be a problem for the community and set up a study to discern the most effective way to motivate users to rate and review more films. The study executed two field experiments; one involved email messages that reminded users of the uniqueness of their contributions and the benefits that follow from them, and the other gave users a range of individual or group goals for contribution. The first experiment, based on the analysis of the MovieLens community’s cumulative response, found that users were more likely to contribute to the community when they were reminded of their uniqueness, leading them to think that their contributions are not duplicates of what other users are able to provide. Contrary to the researchers’ hypothesis, they also found that users were less likely to contribute when it was made salient to them the benefit they receive from rating or the benefit others receive when they rate. Lastly, they found no support for the relationship between uniqueness and benefit. The second experiment found that users were also more likely to contribute when they were given specific and challenging goals and were led to believe that their contributions were needed in order to accomplish the group’s goal. The study found that in this particular context, giving users group-level goals actually increased contributions compared to individual goals, where the researchers predicted that the reverse would be true due to the effects of social loafing. The relationship between goal difficulty and user contributions in both the group and individual cases gave weak evidence that beyond a certain difficulty threshold, performance drops, instead of plateaus as previously hypothesized in Locke and Latham’s goal-setting theory.


Datasets

GroupLens Research, a human-computer interaction research lab at the
University of Minnesota The University of Minnesota, formally the University of Minnesota, Twin Cities, (UMN Twin Cities, the U of M, or Minnesota) is a public land-grant research university in the Twin Cities of Minneapolis and Saint Paul, Minnesota, United States. ...
, provides the rating data sets collected from MovieLens website for research use. The full data set contains 26,000,000 ratings and 750,000 tag applications applied to 45,000 movies by 270,000 users. It also includes tag genome data with 12 million relevance scores across 1,100 tags (Last updated 8/2017). There are many types of research conducted based on the MovieLens data sets. Liu et al. used MovieLens data sets to test the efficiency of an improved random walk algorithm by depressing the influence of large-degree objects. GroupLens has terms of use for the dataset, an
it accepts requests via the internet


References


External links

* {{official Online film databases Recommender systems American film review websites