IMDb

The IMDb database: moderately large, real database of movies.

Original source: kt.ijs.si

Versions

  • Imdb_ijs (by Janez Kranjc)

    • Added foreign key constrains by removal of violating samples. Note that accented chars are missing in "movies"."name" column. Unfortunatelly, accented chars are already missing in the original sql dump.
  • IMDB_1R (by Oliver Schulte)

  • Imdb_small (by Jan Motl)

Dataset details

Associated task:
Classification
Domain:
Entertainment
Data types:
Size:
477.1 MB
Count of tables:
7
Count of rows:
5,694,919
Count of columns:
21
Missing values:
Yes
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
738,576
Target table:
actors
Target column:
gender
Target ID:
?
Target timestamp:
?

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "imdb_ijs" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).