Skip to content

I analyzed Spider-Man Movie reviews from IMDb. I employed basic NLP techniques like TF-IDF, Sentiment Analysis and Topic Modelling and I shared the results with solid visualizations. All done with R.

Notifications You must be signed in to change notification settings

Okancan-Balci/IMDB_Spider-Man_Text_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IMDB Spider-Man Movies User Reviews EDA/LDA

Data Acquisition

The data was collected with a custom Selenium Webdriver web scraper for IMDb reviews which could be found here.

Data Cleaning

The initial cleaning was done in Python Jupyter Notebook environment and then the cleaned data was stored in a .csv file. Further cleaning and analysis were done with R and the steps could be followed on Table of Contents below.

The rendered notebook is on Kaggle and it can be accessed from here.

Data Analysis

I analyzed Spider-Man movie reviews from IMDb. As well as text analysis I also analyzed review related variables such as Review Rating, Review Helpfulness and Review Date.

In text analysis I used basic Natural Language Processing techniques such as:

  • Counts and Frequency of the Words in a review
  • TF-IDF Analysis
  • Sentiment Analysis
  • Topic Modelling with Latent Dirichlet Allocation (LDA)

Table of Contents

  • Introduction
  • Non-Textual Analysis
  • Text Analysis
  • Topic Modelling
  • Conclusion
  • About

    I analyzed Spider-Man Movie reviews from IMDb. I employed basic NLP techniques like TF-IDF, Sentiment Analysis and Topic Modelling and I shared the results with solid visualizations. All done with R.

    Topics

    Resources

    Stars

    Watchers

    Forks

    Releases

    No releases published

    Packages

    No packages published