Econometrics I
Department of Economics, WU Vienna
Partly based on a slide set Simon Heß, with additional thanks to Gustav Pirich, Lucas Unterweger and Fynn Lohre for their inputs
March 6, 2025
Econometrics is a subfield of economics.
We deal with economic questions.
Econometrics is a kind of applied statistics.
We use statistical methods to test hypotheses.
Econometrics is a subfield of economics.
We deal with economic questions.
Econometrics is a kind of applied statistics.
We use statistical methods to test hypotheses.
Econometrics differs from mathematical statistics mainly in its focus on the problems associated with the use of non-experimental data.
What does such an economic question look like?
We need to think carefully about what specific question we want to investigate.
We could formulate our question like this:
If a worker takes advantage of educational leave, does their wage increase over the course of their career?
We assume the following model:
\[ \mathrm{Wage} = f\left(\mathrm{Education},\mathrm{Experience},\mathrm{Talent},\mathrm{EducationalLeave},\dots\right) \]
Wages depend on our variable of interest, the use of educational leave, but also on a number of other factors.
How does the variable education differ from the variable talent?
Test and falsify economic theories
Do households save more when interest rates rise?
Do countries converge to a common equilibrium?
Quantify relationships between economic variables
What is the causal effect of education on wages?
How large is the average gender pay gap?
Evaluate policy measures
Does a minimum wage reduce unemployment?
Does a reduction in class size have different effects on male and female students?
Predictions and forecasts
How much will GDP grow next year?
How volatile will stock markets be next week?
Suppose we are tasked with investigating:
Does the average class size in a district influence test performance?
… and if so, by how much?
As before, we can assume there are both observed and unobserved influencing factors.
CASchools
is a dataset on math and reading test scores from 420 California schools in 1999. So let’s create a plot.
First, we prepare our data.
What does this plot tell us? Not much.
We face two major problems with this analysis:
In Econometrics I, Econometrics II, and Applied Econometrics, we learn step by step how to address these issues. By the end of these three courses, we are able to independently answer econometric research questions.
One additional year of education leads to an average 20% increase in wages.
People who have one more year of education earn on average 20% more.
Do these two statements mean the same thing? No. 🙃
As economists, we are often interested in causal effects, where one variable affects another variable.
Informal Definition: Causality
We speak of a causal effect when the isolated change of a variable has a direct, measurable effect on another variable.
Let’s take the example of fertilizer and agricultural yields. How could we isolate a causal effect here?
Let’s take the example of fertilizer and agricultural yields. How could we isolate a causal effect here?
Let’s take the example of fertilizer and agricultural yields. How could we isolate a causal effect here?
Randomized Controlled Trials (RCTs)
We assign an intervention to a randomly selected study group. A control group does not receive the intervention. Such a study approximates a natural science experiment.
Under certain assumptions, our results are valid:
We can’t always run an experiment (RCT or lab experiment). There are
Coville et al. (2020) want to find out if people who haven’t paid their water bills pay faster when their water is shut off.
Coville et al. (2020) want to find out if people who haven’t paid their water bills pay faster when their water is shut off.
Cohen & Dupas (2008) examine whether co-payments for malaria nets reduce “wasteful” usage.
Cohen & Dupas (2008) examine whether co-payments for malaria nets reduce “wasteful” usage.
In many cases, conducting an experiment is unrealistic. In other cases, it is ethically questionable.
So we often rely on observational data.
Experiments are becoming more common in economic research, but like other social sciences, we usually work with observational data.
Back to our model for educational leave:
\[ \mathrm{Wage} = f\left(\mathrm{Education},\mathrm{Experience},\mathrm{Talent},\mathrm{EducationalLeave,\dots}\right) \]
What would a dataset look like for studying such a question?
? | Wage | Education | Experience | Educational Leave |
---|---|---|---|---|
1 | 15 | 12 | 9 | Yes |
2 | 21 | 14 | 2 | No |
3 | 14 | 11 | 7 | No |
4 | 18 | 9 | 22 | No |
5 | … | … | … | … |
In this dataset, columns are variables and rows are observations.
Individuals | Wage | Education | Experience | Educational Leave |
---|---|---|---|---|
i = 1 | 15 | 12 | 9 | Yes |
i = 2 | 21 | 14 | 2 | No |
i = 3 | 14 | 11 | 7 | No |
i = 4 | 18 | 9 | 22 | No |
i = 5 | … | … | … | … |
Cross-Sectional Data
Cross-sectional data consists of a sample of individuals, households, firms, cities, countries, etc., for which data is collected at one point in time. We use index \(i\) for each observation. The number of observations is denoted \(N\).
Time Points | Wage | Education | Experience | Educational Leave |
---|---|---|---|---|
t = 2021 | 0 | 8 | 0 | No |
t = 2022 | 0 | 9 | 0 | No |
t = 2023 | 12 | 10 | 1 | No |
t = 2024 | 14 | 10 | 2 | Yes |
t = 2025 | … | … | … | … |
Time Series Data
Time series data consists of a sequence of time points at which data is collected on the same individual or unit. We use index \(t\) for each observation. The number of observations is denoted \(T\).
Individuals | Time Points | Wage | Education | Experience | Educational Leave |
---|---|---|---|---|---|
i = 1 | t = 2023 | 20 | 14 | 1 | No |
i = 2 | t = 2023 | 12 | 10 | 1 | No |
i = 1 | t = 2024 | 21 | 14 | 2 | No |
i = 2 | t = 2024 | 14 | 10 | 2 | No |
i = 1 | t = 2025 | … | … | … | … |
Panel Data
Panel data includes both a cross-sectional and time component. Each observation is indexed by \(i\) and \(t\). We observe \(N\) units over \(T\) periods, for a total of \(NT\) observations.