This Project for Udacity course of Data Analysis Nonodgree. Second Project
Project: TMDb Movie Data Analysis
We always find behind each art or product someone who works with infatuation and passion and shows to us what we do not see, therefore the author is that person who has a great interest in films and watching. And He asks us: What shows us whether the film is good or bad? there are should be factors that influence this. Through that, I tried to present something good through this project and find some a logical analysis that adds some good notes and I present them to author.
A note specifying which dataset you analyzed
Depending on link of Google doc. Then, I chose TMDb movie data. This data set contains information more than 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.
In the beginning, I knew the dataset, and then to questin what it was written and what the head of each column was and what if we could discover and see logical information, for example the value of revenue each movies in the year of issue etc. therefore decided my steps the following:
Input varibles id imdb_id popularity budget revenue original_title cast homepage director tagline kewords overview runtime genres production_companies release_date vote_count vote_average release_year budget_adj revenue_adj A statement of the question(s) you posed
I set brainstorming session some questions I could answer using the data set I chose, then I start answering those questions.
- What are ten movies Vs highest profit for each one?
- We note Avatar movie is top one of profit where is more than billion and half.
- Which movie title had the longest run time?
- There is Band of Brothers longest run time
- What is year has the muchest recording of movies?
- There is 558 poducted movies at year of 2015
- A description of what you did to investigate those questions
Top revenue
Top revenue | original_title | id |
---|---|---|
2781505847 | Avatar | 19995 |
Top profitable
Top profit | original_title | runtime |
---|---|---|
2544505847 | Avatar | 162 |
Top number release movies at year
Top release_year | Number of Movies |
---|---|
2011 | 219 |
Top runtime movie
Top runtime | original_title |
---|---|
705 | Band of Brothers |
1.https://mode.com/example-gallery/python_horizontal_bar/
2.https://stackoverflow.com/search?q=sort+number+with+df+python
3.https://pandas.pydata.org/pandas-docs/stable/getting_started/basics.html?highlight=delete%20row
4.https://stackoverflow.com/questions/11285613/selecting-multiple-columns-in-a-pandas-dataframe
5.https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.plot.bar.html