VOC

VOC database provides a peephole view into the administrative system of an early multi-national company, the Vereenigde geoctrooieerde Oostindische Compagnie (VOC for short - The (Dutch) East Indian Company) established on March 20, 1602.

Original source: www.monetdb.org

Versions

  • Voc (by Jan Motl)

    • Typos in dates were fixed because MySQL doesn't support so wide range of dates as MonetDB. The rest of the typos (identified by OpenRefine) were not fixed.

Dataset details

Associated task:
Classification
Domain:
Retail
Data types:
Size:
2.7 MB
Count of tables:
8
Count of rows:
29,067
Count of columns:
89
Missing values:
Yes
Compound keys:
Yes
Loops:
No
Type:
Real
Instance count:
8,073
Target table:
voyages
Target column:
arrival_harbour
Target ID:
number, number_sup
Target timestamp:
arrival_date

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "voc" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).