Prediction of Movie Success Using Classification

Authors

  •   Ritu Jhalani Assistant Professor (Computer Science), International School of Informatics & Management, Sector-12 , Mahaveer Marg, Mansarovar, Jaipur - 302020, Rajasthan
  •   Harshita Virwani M.C.A. Student, The IIS University, Sector-12, Mahaveer Marg, Mansarovar, Jaipur - 302020, Rajasthan
  •   Divyanshi Goyal M.C.A. Student, The IIS University, Sector-12, Mahaveer Marg, Mansarovar, Jaipur - 302020, Rajasthan
  •   Somya Vashishtha M.C.A. Student, The IIS University, Sector-12, Mahaveer Marg, Mansarovar, Jaipur - 302020, Rajasthan

DOI:

https://doi.org/10.17010/ijcs/2018/v3/i6/141443

Keywords:

Classification

, Data Mining, Decision Tree, Naïve Bayes, Orange Tool.

Manuscript received September 2

, 2018, revised September 20, accepted October 10, 2018. Date of publication November 6, 2018

Abstract

In the film industry, the largest producer of films in the world is India. The Indian film industry was established in 1913 and is the second oldest in the world. India was the third largest with box office revenue of US $ 2.18 billion in 2017. The Indian film industry is multi-lingual. Hindi film Industry is the largest film industry in India and is mostly based in Mumbai (Bombay), which is referred to as "Bollywood". This paper attempts to predict whether an upcoming movie would be a blockbuster, neutral or a flop. By predicting this, it can help production houses in advertising and to find the best time period to release a movie by looking at the overall environment. This paper proposes making use of classification technique of Data Mining i.e. Naïve Bayes Theorem and Decision Tree on Data Mining Tool named Orange. Data mining is a process to transform raw data into useful information. By applying data mining, we can discover a large set of patterned data. Machine Learning Statistics and Database Systems are involved in Data Mining. In Knowledge Discovery (KDD) process, data mining is an analysis step. Classification helps us to classify the data according to the attributes of the data with respect to a predefined set of classes. Naive Bayes is a theorem of data mining which is able to predict categorical class labels (blockbuster, neutral, and flop) that classifies data according to rating, month, year of release, genres such as drama, action, romance, comedy, mystery, thriller, and other attributes, and values to classify an upcoming movie. Decision tree helps in supervised learning by creating a training model which can help us predict class values by learning decisions from prior data. To perform the research, we used a Data Mining Tool named Orange which is an open source component-based Visual Programming Software Package for data visualization, machine learning, data mining, and data analysis.

Downloads

Download data is not yet available.

Downloads

Published

2018-12-23

How to Cite

Jhalani, R., Virwani, H., Goyal, D., & Vashishtha, S. (2018). Prediction of Movie Success Using Classification. Indian Journal of Computer Science, 3(6), 7–12. https://doi.org/10.17010/ijcs/2018/v3/i6/141443

References

"List of Bollywood films of 2018," Wikipedia. [Online]. Available: https://en.wikipedia.org/wiki/List_of_Bollywood_films_of_2018

"List of Bollywood films," Wikipedia. [Online]. Available: https://en.wikipedia.org/wiki/Lists_of_Bollywood_films

V. R. Nithin, M. Pranav, P. B. Sarath, and Lijiya, "Predicting movie success based on IMDB Data," 2014.

J. Ericson and J. Goodman, "A predictor for movie success," 2013.

S. Pramod, A. Joshi and A. G. Mary, "Prediction of movie success for real world movie dataset," Int. J. of Advance Res., Ideas and Innovations in Technol., vol. 3, no. 3, 2017. [Online]. Available: https://www.ijariit.com/manuscripts/v3i3/V3I3-1228.pdf

M. H. Latif and H. Afzal, "Prediction of movies popularity using machine learning techniques," Int. J. of Comput. Sci. and Network Security, vol. 16, no. 8, 127-131, 2016.

M. Saraee, S. White and J. Eccleston, "A data mining approach to analysis and prediction of movie ratings," in The 5th Int. Conf. on Data Mining, Text Mining, and their Bus. Appl., 2004. [Online]: Available: http://usir.salford.ac.uk/18838/1/Wessex_movie.pdf