11 Panel data

What is panel data?

when we observe many observation unit over several time periods.

what can we do in panel data?

we can remove omitted variables that breakes the exogeneity assumption to be able to do causal analysis

what is wide vs long?

wide is where we have a row for each observation. Common in coss-sectional data. Long is where we have sorted the observetaions for each name, etc A A A, B B B and each time period

what does panel data describe?

it describes the evolution of unit characteristics over time for many units

what does reshaping data mean?

it is the process of rearraging data

what does an independent sample mean?

it means that the rows are statistically independent which means that period 2 has no "memory" of what happened in period 1. They have zero correlation.

what does an identical sample mean?

it means that every row is sampled from the same population.

what do we do with the first difference transformation and the fixed effect to solve the enodgeneity problem?

there are omitted variables that are important, but they are unomserved. Since they are important, are in U and correlate with the regressor and putput variable, we break U into two parts: A and U. A is the unobserved variables that remain fixed and does not change over time. U does now not correlate with the regressors and our exogeneity assumption holds. We can then use the two different methods to get rid of A and then be able to estimate the causal effect.

what is a unit in panel data?

it is "i" that contains every same "i" in t periods. It is called observing one unit at two different times.

how do we accumulate information from our panel data?

by jumping from unit to unit.

what is true about time variation in panel data?

there can be correlation within the units, etc in unit A periods t t t can correlate. But A can not correlate with unit B. We can therefore think of one unit as on cross-sectional observation.

what is first difference transformation?

When we subtract the regression model for period 1 from period 2. The variables without time definition disappear

what is the model for first difference tranformation? and what important properties does is satisfy?

change in Y = B1*change in variable of interest + change in u

1) A is eliminated, so we can observe the model with OLS

2) B1 reflects the causal effect that we get from OLS. we have to check if the assumptions are good:
- exogeneity assumption E[change in u | change in x1] = 0
- full rank assumption var(change in x1) > 0 which states that there is positive probability that the x1 will change between the two time periods
- random sampling: we observe the change in Y for every cross-sectional which are independent due to the random sampling = so it holds

we use time variation to compute our transformed model by get rid of A within each unit (etc A, B osv). Then we use the cross-sectional variation between cross-sectional units to accumulate infromation about this transformed model. What can we do to provide more information?

because of potential serial correlation, we could observe more than two time periods to provide more information

however all the learning is in the corss-section, so what do we have to do if we want a small SE?

we want a large cross-section dimension so we wants lots of units

what is crucial for the first difference transfomation?

that the fixed effect assumption is plausible. We have to study an economic environment where the fixed effect assumption is plausible.

hur fixar man så att man går från long till wide?

generate new variables for the difference in output and the change of the variable of interest. Then we would regress with these variables instead.

gen d_tax = tax1998-tax1970

gen d_fr = fr1998-fr1970

förklara hur man gör fixed effect transformation

man beräknar averages inom varje unit och sen tar man och subtarherar medelvärdet inom varje unit med varje time.

formeln blir då:
fr = B1tax + A + U
-
fr-streck = B1tax-streck + A + U-streck
=
fr~ = B1tax~ + U~

vad blir resultated av fixed effect transformation?

vi blir av med fixed effect A och vi får en B1 som kan estimate the caustal effect, men vi måste checka så att det är en good estimate

what is the difference for the fixed effect transformations?

The first difference transofmation only had one regression equation for each unit (change in Y on change in change in x1), while the fixed effect transformation have two regression equations for every unit, dvs for each time period. Vi tar ju vanlig regression - average regression

what will stata do when we want it to estimate fr~=B1tax~ + U~

stata will compute standard errors with the assumption that all observations from one unit are statistically indipendent over time periods, t =/ t´ which also says that cov(U,U´) = 0 so there is no correlation between the error terms

vad utesluter detta utrycket cov(U,U´) = 0

att en postive shock idag inte ökar sannolikheten för en (fortsatt) positiv shock imorgon. Detta händer alltså inte och uttrycket cov(U,U´) = 0 utesluter serial correlation

men detta uttrycket gör det ganska strikt ( cov(U,U´) = 0 ), vad är bättre att assume?

det är bättre to assume cross-sectional dependence, dvs att U och U´är statistically indipendent mellan i =/ i´ iställer för times inom en unit. Så antar man nu istället att det inte förekommer någon serial correlation mellan units, men att det får förekomma inom units.

vad betyder "computing standard errors woth clustering"

det är när standard errrors are computed under the assumption that certain blocks of observations uppvisar correlation. Dessa blocks of correlated observations kallas för "clusters"

vad antar vi att ett cluster består av för panel data?

att alla observations (t) of one unit formar ett cluster. Så varje unit består av ett cluster som består av alla observations för en unit.

varför kan SE för B1^ öka när vi använder clustered standard errors?

an additional time period may correlate with the already observed time periods which would not provide a lot of new information.
Stata will detect how much correlation there is!!!
less information -> less percise -> larger SE

vad är risken med att inte göra clustering?

då skulle vi kunna bli too optimistic with our relatively small SE since OLS assumes statistically independet observations. Och då tro att om vi adderar mer time periods så skulle vi få mer information, vilket vi egentligen inte får.

Quiz
Fractures simple//immobilisation membre inferieur
Formulario
Verb to be
les fractures simples// immobilisation membre superieur
Vocabulaire Arabisch
Reino Visigodo en España
Reino Visigodo en España
SHIH TZU
conservación de la energía y sus interacciones con la materia
Mouvements des EL 1 à 16
basic first aid procedures
Projet de lecture des EL 1 à 16
bio
BIOLOGIA 12
exercicies for up
BAG
KSMGT
EOI
Ve Republique
IV REPUBLIQUE
Mistral
vocab from book
Types of Cuts
Figuras retóricas
Teoria de la Constitución
fracciones operaciones de primer grado
17
derivadas de una potencia
Domande per la patente (1^ Fase)
Interview Prep
mercado de trabajo
Globalización
Module 9 Test
Humanidades
spanish vocab
5.3. La Crisis del Régimen.
Voting, Money, and Elections Review (4/18)
present simple and present continous
5.2. El Desarrollo Económico y sus Efectos en la Sociedad.
5.1. La Construcción del Régimen.
guia analisis
Reglamentación Aérea
Derecho
pistolesi
naturvetenskap
Sao Roque
Grundläggande Anatomi & fysiolgi
Skeletal System
BIOLOGIA 9/10
Funcio 1er examen
Muscular system
so
mezzi audiovisivi
pronoms
biología
cálculo integral
advice
BIOLOGIA 8
history
german words
passagers sous drogue/alcool
Delprov - kopia
hyperventilation, tetanie
monet matters
Glosor
vocabulario empresarial
indigestion
terminologie
Le sommeil
key terms 3
coreano
Aerocinetose
math
NOUVEL ORDRE MONDIAL
les brulures
Anglais
DROIT ADMINISTRATIF SEM 2
corps etranger
hemorragie
PNS
oposiciones administrativo Navarra
plaies et blessures
igcse biology topic 1 to 21
Spärrfärd fortsättn. 2
appareil cardio-vasculaire
ia
2DV_ACT5_CUET GENIAL_6EJM
preposition of place
economía
examen 2 gestion
hematologia
Gène
gestión
Redacción profesional
latino
Classes gramaticais
Classes gramaticais
spanish final
data
Ykb
Spärris del 2 Kahoot
BIOLOGIA 7
Spärrfärd del 1 Bosse
Ideología y sus expresiones
aardrijkskunde
Highball
Political ideology
Sociology 150 Final
niger
capitales del mundo
ERA
biologi
series
vocabulaire anglais
Chapitre 11 : La domestication des plantes
Chapitre 10 : Reproduction de la plante, entre vie fixée et mobilité
Political socialization
Chapitre 9 : La plante productrice de matière organique
Chapitre 8 : L'organisation fopnctionnelle des plantes à fleurs
Martini Glass
philo 2
le restaurant
naturkunskap
Blok 3 hc toets
Rivoluzione americana
gs se1
tyska
Exam CLE - Luke verses
lexique en santé mentale
Exam CLE - No verses
science quiz april 18th
Feaces toets
Läxor/prov
Veni
Drama
Celler och vävnader
17/4 anteckningar
Amerikanska revolutionen
Fysiologi
IATA CODES BATCH 1
le restaurante
fisica - copia
objetivo 2 de Química - copia
biologia - copia
español - copia
objetivo 2 de Química - copia
Franska revolutionen
Sexuallkunskap
Aminoacidos y proteínas
Les mots en or
yoss
Derecho penal
Derecho Administrativo
clasificación de las constituciones
Industrial Revolution
Kahoo de todoooo
Examen
Puntos
PyDI
Español
Chapitre 9
Chapitre 10.4.2
Derechos de las niñas, niños y adolecentes
Marco normativo para la formacion docente en educacion basica
Smart
España crisis, guerra civil
reinforcing fibers and thier properties
232 RDA's
Parte II preguntas derecho - copy
Cardiovascular system
Monopoly
PREGUNTAS DERECHO - copy
lola
arte
frans onregelmatig werkwoord 'être'
Frans dagen van de week (van Dautsen)
frans dagen van de week
Clase 2
PARTE III DERECHO MERCANTIL
finance
Estudio Medicina
Parte II preguntas derecho
Energy Systems - Aerobic
Energy System - Lactic acid
Cocktail Recipes
10 two stage least squares and demand estimation
Experiments
System S
midterm in micro
2/2008 28 MAYO Ley CGPC
SVT C1
Morse
biología
Apprendre Fanchon
9 intrument variable regression (IV)
ley nacional del uso dela fuerza
ley federal de armás de fuego
examen
Trastornos Mentales y definiciones
verbos en ruso - español
Gine kahoo
Pedia kahoo
Qx y kahoo
CPT
Qx
EXAMEN CEMM
Repaso
objetivo 2 de Química
sistema nervioso
Chapitre 14
American Revolution (Pre-IB SS 9)
Historia
Musculos34
Haitian Revolution (Pre-IB SS 9)
red or black
posovice 15
modos del silogismo
modos y figuras del silogismo
silogismo
semana 6 el juicio
filosofia
storia dell'arte
504 absolutely Essential Words
Diritto
8 Sources of endogeneity
chemical reactions paper 1
roulette splits
agg
French speaking technology
topo toets
PREGUNTAS DERECHO
Vocabulaire physique
French Revolution (Pre-IB SS 9)
tecnología
Religion and life
relationships and family
Teste Jogos
Hindu practices quotes
Hindu beliefs quotes
Teste do dia 26 de Abril
Dana
estudiar