Last Updated on April 30, 2023 by mishou

I. Where is IMDb Datasets?

IMDb Datasets are here:https://www.imdb.com/interfaces/https://datasets.imdbws.com/

II. Download a …tsv file and extract it

Run the command on Google Colaboratory when you want to download the name.basics.tsv.gz file.

!wget https://datasets.imdbws.com/name.basics.tsv.gz !wget https://datasets.imdbws.com/title.principals.tsv.gz

Run the command to extract GZ file:

!gunzip name.basics.tsv.gz !gunzip title.principals.tsv.gz

III. Load the data with Pandas

import pandas as pd df_name = pd.read_csv('/content/name.basics.tsv', sep = '\t') df_principals = pd.read_csv('/content/title.principals.tsv', sep = '\t')

IV. Calculate ages

You can see all the scripts here on Google Colaboratory: https://colab.research.google.com/drive/1MMWbMtZFHJmeN93t14UZiEYWrDctmIeO?usp=sharingTo be continued.

By mishou

Leave a Reply

Your email address will not be published. Required fields are marked *