w3resource

Python Pandas IMDb Movies Data: Exercises, Practice, Solution

[An editor is available at the bottom of the page to write and execute the scripts.]

Sample Table (based on IMDb - movies csv):
Download movies_metadata.csv table (50) records from here.

Pandas IMDb Movies Data Analysis [17 exercises with solution]

1. Write a Python Pandas program to get the columns of the DataFrame (movies_metadata.csv file). Go to the editor
Click me to see the sample solution

2. Write a Pandas program to get the information of the DataFrame (movies_metadata.csv file)including data types and memory usage. Go to the editor
Click me to see the sample solution

3. Write a Pandas program to get the details of the third movie of the DataFrame (movies_metadata.csv file). Go to the editor
Click me to see the sample solution

4. Write a Pandas program to count the number of rows and columns of the DataFrame (movies_metadata.csv file). Go to the editor
Click me to see the sample solution

5. Write a Pandas program to get the details of the columns title and genres of the DataFrame. Go to the editor
Click me to see the sample solution

6. Write a Pandas program to get the details of the movie with title 'Grumpier Old Men'. Go to the editor
Click me to see the sample solution

7. Write a Pandas program to get the details of  fifth movie of the DataFrame. Go to the editor
Click me to see the sample solution

8. Write a Pandas program to create a smaller DataFrame with a subset of all features. Go to the editor
Click me to see the sample solution

9. Write a Pandas program to display the first 10 rows of the DataFrame. Go to the editor
Click me to see the sample solution

10. Write a Pandas program to sort the DataFrame based on release_date. Go to the editor
Click me to see the sample solution

11. Write a Pandas program to access those movies, released after 1995-01-01.Go to the editor
Click me to see the sample solution

12. Write a Pandas program to sort movies on runtime in descending order. Go to the editor
Click me to see the sample solution

13. Write a Pandas program to get those movies whose revenue more than 2 million and spent less than 1 million. Go to the editor
Click me to see the sample solution

14. Write a Pandas program to get the longest runtime and shortest runtime. Go to the editor
Click me to see the sample solution

15. Write a Pandas program to calculate the number of votes garnered by the 70% movie. Go to the editor
Click me to see the sample solution

16. Write a Pandas program to display the movies (title, runtime) longer than 30 minutes and shorter than 360 minutes. Go to the editor
Click me to see the sample solution

17. Write a Pandas program to display the movies (title, number of votes) that received specified number of votes. Go to the editor
Click me to see the sample solution

Python Code Editor:


More to Come !

Do not submit any solution of the above exercises at here, if you want to contribute go to the appropriate exercise page.

Test your Python skills with w3resource's quiz



Python: Tips of the Day

Calling an external command from Python:

The subprocess module in the standard library:

Example:

import subprocess
subprocess.run(["ls", "-l"])

The advantage of subprocess vs. system is that it is more flexible (you can get the stdout, stderr, the "real" status code, better error handling, etc...).

The official documentation recommends the subprocess module over the alternative os.system():

The subprocess module provides more powerful facilities for spawning new processes and retrieving their results; using that module is preferable to using this function [os.system()].

The Replacing Older Functions with the subprocess Module section in the subprocess documentation may have some helpful recipes.

For versions of Python before 3.5, use call:

import subprocess
subprocess.call(["ls", "-l"])

Ref: https://bit.ly/2ANeuPR