ConsumerExpenditures
Alternative names: PUMD
The Consumer Expenditure Survey (CE) collects data on expenditures, income, and demographics in the United States. The public-use microdata (PUMD) files provide this information for individual respondents without any information that could identify respondents. PUMD files include adjustments for information that is missing because respondents were unwilling or unable to provide it. The files also have been adjusted to reduce the likelihood of identifying respondents, either directly or through inference. The task is to predict, whether the expenditure is a gift or not. Household ids change from year to year - this is a property of the data source.
Original source: www.bls.gov
Versions
ConsumerExpenditures (by Patrick Urbanke)
- Imported from csv. Irrelevant attributes were removed.
Dataset details
- Associated task:
- Classification
- Domain:
- Retail
- Data types:
- Size:
- 337.6 MB
- Count of tables:
- 3
- Count of rows:
- 2,241,548
- Count of columns:
- 24
- Missing values:
- Yes
- Compound keys:
- No
- Loops:
- No
- Type:
- Real
- Instance count:
- 2,047,961
- Target table:
- EXPENDITURES
- Target column:
- GIFT
- Target ID:
- EXPENDITURE_ID
- Target timestamp:
- ?
How to download the dataset
The datasets are publicly available directly from MariaDB database.
- Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
- Use following credentials:
- hostname: db.relational-data.org
- port: 3306
- username: guest
- password: relational
- Export "ConsumerExpenditures" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).