Imagine a movie with a female vampire, with a lot of chasing and stabbing, some axes and swords, maybe someone wearing leather pants who died at the end from leukaemia, and above all that it’s a sequel, based on a video game and you can’t recognize any of the actors. This is called a recipe for a disaster movie.
No one wants to spend a movie night with popcorn and drinks ready, to watch a bad movie. But how can you tell if it’s bad or good in advance? Well, thanks to Data Mining, now you can!
In the context of the participation to the Linked Data Mining Challenge , My colleague (Emir Munoz) and I ran an experiment on a dataset of 2000 movies to find movie features useful to predict whether a movie is good or bad. With the help of Linked Open Data: DBpedia, IMDB , Metacritics , we discovered that the most important feature is the critics review. If the critics love a movie, it’s great. That simple.
But what if I don’t like the critics’ point of view? They can complicate things sometimes and make a nice simple movie turn out to be a horrible one. That’s where more features come to help. The next important thing in a movie – which we all usually look for- would be the plot of a movie. A simple analysis from our study case showed that good movies plots contain keywords like: “family relationships”, “moral ambiguity”, “very little dialogue” and interestingly, people like drama. Deep dark drama, so keywords like: “frustration”, “crying”, “sleeplessness”, “schizophrenic”, “melancholy” are in the recipe of a good movie.
In addition, we discovered that the most popular genre for a movie is “PG-13”, which is something between “NC-17” movies with highly sexual/violent content and “G” movies for general audience and children appropriate content. Also, movies with famous awarded actors and directors are most likely to succeed if they contained the previous features.
In Summary, you can discover many interesting stuff with mining Linked Data sources, as we did with best movies features, which you can use later on with the aid of Machine Learning techniques to create a classifier for good/bad movies that could save you a disappointing movie nights and much more.
Our approach  will be presented in ESWC’15  as part of the Know@lod workshop. See you there!