Pubs

The pubs sample database is modeled after a book publishing company.

Original source: www.codeproject.com

Versions

  • Pubs (by Jan Motl)

    • Removed user defined data types

Dataset details

Associated task:
Regression
Domain:
Retail
Data types:
Size:
400 KB
Count of tables:
11
Count of rows:
255
Count of columns:
64
Missing values:
Yes
Compound keys:
No
Loops:
No
Type:
Synthetic
Instance count:
18
Target table:
titles
Target column:
ytd_sales
Target ID:
title_id
Target timestamp:
pubdate

Algorithms

Dataset versionTargetAlgorithmAuthor textMeasureValue
pubsytd_salesPredictor FactoryPredictor FactoryAccuracy0.15

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "pubs" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).