Walmart
Walmart challenges participants to accurately predict the sales of 111 potentially weather-sensitive products (like umbrellas, bread, and milk) around the time of major weather events at 45 of their retail locations.
Original source: www.kaggle.com
Versions
Walmart (by Jan Motl)
- Changed the coding of missing values from "M" to NULL
Dataset details
- Associated task:
- Regression
- Domain:
- Retail
- Data types:
- Size:
- 167.3 MB
- Count of tables:
- 4
- Count of rows:
- 4,628,497
- Count of columns:
- 27
- Missing values:
- Yes
- Compound keys:
- No
- Loops:
- No
- Type:
- Real
- Instance count:
- 4,607,680
- Target table:
- train
- Target column:
- units
- Target ID:
- store_nbr, item_nbr
- Target timestamp:
- date
How to download the dataset
The datasets are publicly available directly from MariaDB database.
- Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
- Use following credentials:
- hostname: db.relational-data.org
- port: 3306
- username: guest
- password: relational
- Export "Walmart" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).