Pandas
Purpose
The purpose of this final project is to have you use Python language constructs and modules that you have learned throughout the semster to accurately complete this final lab.
To show a complete understanding of the concepts and constructs of the Python programming language. The main focus of the assignment will be on Pandas, but will also include concepts from throughout the semster.
Partners
You are encouraged to work with one other partner on this assignment.
If you work with a partner, you must both be in the same lab section.
Put your names at the top of your assignment and include your lab section.
Starting materials
Download the movies.py file onto your lab computer.
You will also need to download movie_metadata.csv and place it in the same folder.
Lasly, download sample_output.txt to see example output for all the tasks.
Final Project
In this assignment, you will be using python Pandas library to manipulate data from a CSV file. The data that you will be manipulating holds information from over 5000 movies in IMDB. How can we tell what the greatest movie of all time is? You will be able to anwser this question once you have completed this final project. For this assignment, you will be writing your code in the provided movies.py file.
- Task1:
Start by implementing the functions new_csv(). import_csv() is used to import the file movies.csv which is a cleaned up version made by new_csv() from the original movie_metadata.csv file. The new csv file will only contain columns that are used in the rest of the program.
- Task2:
Next implement the function top10movies() which will give back the top 10 movies of all times ranked by either budget, gross, IMDB score, or facebook likes, depending on the user choice. The top 10 ranking must be printed in a neat manner within python shell.
- Task3:
Now write the function showMovieInfo() that will print out the information for a specfic movie such as "Interstellar", for example. The summary should tell the user who directed the movie, where the movie was released, the year it was made, the budget for the film, how much of a profit it made, the IMDB score, and the amount of likes that it has on Facebook.
- Task4:
Implement the function analyseMovies() which will prompt the user for a Country and a Year, and then based on that print a short summary about the movies filmed in that country in the specified year. The summary should contain at least: Number of movies filmed, movie with largest budget, and movie with smallest budget.
- Task5:
Lastly, implement the function topDirectors() which will print out a list of 10 directors that directed the most movies. It will also print out information about the movies which that director directed: average budget, average imdb score, and average number of likes
- Note:
You will find more informations on the functions in the comments in the starting code.
Grading - 100 points possible
- 10 points.
Task 1
- 15 points.
Task 2
- 15 points.
Task 3
- 25 points.
Task 4
- 25 points.
Task 5
- 10 points.
Ten points will be given for adding comments explaining implementation of your program.
Helpful Links