Seznam

Seznam.cz is a web portal and search engine in the Czech Republic. The data represent online advertisement expenditures from Seznam's "wallet". Table description: client: location and domain field of the client (anonymized) dobito: prepaid into a wallet in Czech currency probehnuto: charged from the wallet in Czech currency probehnuto_mimo_penezenku: charged in Czech currency, but not from the wallet

Original source: datafestak-us.s3.amazonaws.com

Versions

  • Seznam (by Jan Motl)

Dataset details

Associated task:
Regression
Domain:
Retail
Data types:
Size:
146.8 MB
Count of tables:
4
Count of rows:
2,681,983
Count of columns:
14
Missing values:
Yes
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
1,458,233
Target table:
probehnuto
Target column:
kc_proklikano
Target ID:
client_id, sluzba
Target timestamp:
month_year_datum_transakce

Algorithms

Dataset versionTargetAlgorithmAuthor textMeasureValue
Seznamkc_proklikanoFastPropgetML: Feature Learning with AutoML to build end-to-end prediction pipelinesR20.7822
Seznamkc_proklikanoDeep Feature SynthesisfeaturetoolsR20.6324
Seznamkc_proklikanoFastPropgetML: Feature Learning with AutoML to build end-to-end prediction pipelinesRMSE24480
Seznamkc_proklikanoDeep Feature SynthesisfeaturetoolsRMSE31655
Seznamkc_proklikanoFastPropgetML: Feature Learning with AutoML to build end-to-end prediction pipelinesMAE3160
Seznamkc_proklikanoDeep Feature SynthesisfeaturetoolsRMSE5167

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "Seznam" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).