Walmart

Walmart challenges participants to accurately predict the sales of 111 potentially weather-sensitive products (like umbrellas, bread, and milk) around the time of major weather events at 45 of their retail locations.

Original source: www.kaggle.com

Versions

  • Walmart (by Jan Motl)

    • Changed the coding of missing values from "M" to NULL

Dataset details

Associated task:
Regression
Domain:
Retail
Data types:
Size:
167.3 MB
Count of tables:
4
Count of rows:
4,628,497
Count of columns:
27
Missing values:
Yes
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
4,607,680
Target table:
train
Target column:
units
Target ID:
store_nbr, item_nbr
Target timestamp:
date

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "Walmart" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).