Hepatitis

PKDD'02 Hepatitis dataset describes 206 instances of Hepatitis B (contrasting them against 484 cases of Hepatitis C).

Original source: www.cs.sfu.ca (BibTeX)

Versions

  • Hepatitis_std (by Oliver Schulte)

    • It is unknown why only 500 samples are present (instead of 690). Numerical atributes were discretized and typed as varchar.

Dataset details

Associated task:
Classification
Domain:
Medicine
Data types:
Size:
2.2 MB
Count of tables:
7
Count of rows:
12,927
Count of columns:
26
Missing values:
No
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
500
Target table:
dispat
Target column:
Type
Target ID:
m_id
Target timestamp:
?

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "Hepatitis_std" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).