Sakila

The venerable sakila test database: small, fake database of movies.

Original source: dev.mysql.com

Versions

  • Sakila (by Jan Motl)

Dataset details

Associated task:
Regression
Domain:
Retail
Data types:
Size:
6.4 MB
Count of tables:
16
Count of rows:
47,010
Count of columns:
89
Missing values:
Yes
Compound keys:
No
Loops:
Yes
Type:
Synthetic
Instance count:
15,991
Target table:
payment
Target column:
amount
Target ID:
payment_id
Target timestamp:
payment_date

Algorithms

Dataset versionTargetAlgorithmAuthor textMeasureValue
sakilaamountPredictor FactoryPredictor FactoryAccuracy0.5909

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "sakila" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).