Microdata
- Microdata are the individual records or observations within a dataset, each describing a single entity.
- Each record contains multiple variables (e.g., employment status, education level, income).
- Microdata enable detailed analysis of relationships and variation across subgroups, and help control for confounding factors.
Definition
Section titled “Definition”Microdata is a term used to refer to the individual observations or records in a dataset. These observations typically consist of multiple variables or pieces of information about a single entity, such as an individual person, household, or business.
Explanation
Section titled “Explanation”Microdata records contain several variables describing one entity. Because they represent individual observations rather than aggregates, researchers can analyze how variables relate to one another and how they vary across subgroups. Microdata are commonly collected through surveys or other data-collection methods, such as census data. Using microdata allows researchers to control for confounding factors (for example, controlling for size and location when studying home sale prices) and to detect trends or patterns that may not appear in aggregate statistics.
Examples
Section titled “Examples”Individual-level survey data
Section titled “Individual-level survey data”A dataset containing information about the employment status, education level, and income of a sample of individuals in a particular city. Each individual in the dataset is represented by a single record or observation, with their specific values for the variables of interest.
Housing transaction data
Section titled “Housing transaction data”A dataset containing information about the prices and characteristics of homes sold in a particular neighborhood. Each record in the dataset represents a single home, with variables such as the sale price, number of bedrooms and bathrooms, square footage, and type of heating system.
Use cases
Section titled “Use cases”- Statistical analysis and survey research using individual-level records.
- Data collected via surveys or census data for studying characteristics and behaviors of entities.
- Exploring relationships among variables and variation across subgroups.
- Controlling for confounding variables to better understand influences on outcomes (for example, size and location when analyzing sale prices).
Related terms
Section titled “Related terms”- Dataset
- Observation
- Record
- Variable
- Aggregate statistics
- Survey
- Census