ConsumerExpenditures

Alternative names: PUMD

The Consumer Expenditure Survey (CE) collects data on expenditures, income, and demographics in the United States. The public-use microdata (PUMD) files provide this information for individual respondents without any information that could identify respondents. PUMD files include adjustments for information that is missing because respondents were unwilling or unable to provide it. The files also have been adjusted to reduce the likelihood of identifying respondents, either directly or through inference. The task is to predict, whether the expenditure is a gift or not. Household ids change from year to year - this is a property of the data source.

Original source: www.bls.gov

Versions

  • ConsumerExpenditures (by Patrick Urbanke)

    • Imported from csv. Irrelevant attributes were removed.

Dataset details

Associated task:
Classification
Domain:
Retail
Data types:
Size:
337.6 MB
Count of tables:
3
Count of rows:
2,241,548
Count of columns:
24
Missing values:
Yes
Compound keys:
No
Loops:
No
Type:
Real
Instance count:
2,047,961
Target table:
EXPENDITURES
Target column:
GIFT
Target ID:
EXPENDITURE_ID
Target timestamp:
?

How to download the dataset

The datasets are publicly available directly from MariaDB database.

  1. Open your favourite MariaDB client (MySQL Workbench works, but see FAQ)
  2. Use following credentials:
    • hostname: db.relational-data.org
    • port: 3306
    • username: guest
    • password: relational
  3. Export "ConsumerExpenditures" database (or other version of the dataset, if available) in your favourite format (e.g. CSV or SQL dump).