Data Archive - Download

The Stanford Education Data Archive (SEDA) includes a number of publicly available data files, listed below. Most files are available in Stata (v13) and .csv formats. Codebooks and documentation are available for download as well.

In publications, please cite the data as:

Sean F. Reardon, Demetra Kalogrides, Andrew Ho, Ben Shear, Kenneth Shores, Erin Fahle. (2016). Stanford Education Data Archive (Version 1.1 File Title). http://purl.stanford.edu/db586ns4974.

If you have questions or note errors in the data, please contact us at sedasupport@stanford.edu

Notes

The data that are currently available include district level racial/ethnic achievement gaps, district level average achievement, and district level demographic/socioeconomic data. The most recent release (currently, Version 1.1) should always be used for reporting and analysis. Previous versions of the data are still available to facilitate research replication

District level average achievement data were reported in the New York Times Upshot article in "grade equivalent'" units. Achievement gap data available for download here are reported in standard deviation or "effect size'" units. Achievement gaps are not currently available in the metric of grade equivalent units, and race-specific mean scores are forthcoming. For interpretability, roughly speaking, 1 standard deviation is (very roughly) equivalent to approximately 3 grade equivalents.

We have added a data file that shows which schools are included in each district in our estimates. Interested parties can download this document and check for themselves whether the school has been properly assigned to its geographic district. While every effort has been made to place schools in their proper geographic district, please contact sedasupport@stanford.edu if you note an error.

Version 1.0
Data description Download Documentation
This file contains district level means in grade equivalent units. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV Codebook
This file contains district level means in grade equivalent units. There are multiple observations per district, one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level means in grade equivalent units. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level means in constant population standard deviation units. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level means in NAEP-referenced units. Estimates are comparable between states. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level means in state-referenced units. Estimates are comparable within states. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each year, grade and subject. Stata Excel CSV
This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each subject; values are averaged across years and grades. Stata Excel
This file contains district level white-black and white-Hispanic achievement gaps. There is one observations per district; values are averaged across years, grades and subjects. Stata Excel
This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year and grade. Stata CSV Codebook
This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year. Stata CSV
This file contains district level covariates (socioeconomic, demographic, school level data). There is one observation per district. Stata Excel
This file contains a unique school identifier, an identifier indicating its NCES ID (the district to which it legally belongs), and the district in which it is included in our estimates. There is one observation per school. Stata CSV Documentation
Version 1.1

Technical Documentation

Assessment Outcomes: Means and Standard Errors 
Data Description Disaggregated by Download Documentation
File Title Description Metric Geographic Level Year Grade Subject      
MeanA_V1.1 This file contains district level means in grade equivalent units. There are multiple observations per district; one for each year, grade and subject. Grade Equivilant Units District x x x Stata CSV Codebook
MeanB_V1.1 This file contains district level means in grade equivalent units. There are multiple observations per district, one for each subject; values are averaged across years and grades. Grade Equivilant Units  District     x Stata CSV
MeanC_V1.1 This file contains district level means in grade equivalent units. There is one observations per district; values are averaged across years, grades and subjects. Grade Equivilant Units District       Stata CSV
MeanD_V1.1 This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each year, grade and subject. Standard Deviation Units District x x x Stata CSV
MeanE_V1.1 This file contains district level means in constant population standard deviation units. There are multiple observations per district; one for each subject; values are averaged across years and grades. Standard Deviation Units District     x Stata CSV
MeanF_V1.1 This file contains district level means in constant population standard deviation units. There is one observations per district; values are averaged across years, grades and subjects. Standard Deviation Units District       Stata CSV
MeanG_V1.1 This file contains district level means in NAEP-referenced units. Estimates are comparable between states. There are multiple observations per district; one for each year, grade and subject. NAEP  District x x x Stata CSV
MeanH_V1.1 This file contains district level means in state-referenced units. Estimates are comparable within states. There are multiple observations per district; one for each year, grade and subject. State Referenced District x x x Stata CSV
Assessment Outcomes: Achievement Gaps
Data Description  Disaggregated by Dowload  Documentation
File Title Description Metric Geographic Level Year Grade Subject      
GapA_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each year, grade and subject. Standard Deviation Units District x x x Stata CSV Codebook
GapB_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There are multiple observations per district; one for each subject; values are averaged across years and grades. Standard Deviation Units District     x Stata CSV
GapC_V1.1 This file contains district level white-black and white-Hispanic achievement gaps. There is one observations per district; values are averaged across years, grades and subjects. Standard Deviation Units District       Stata CSV
Covariates 
Data Description  Disaggregated by Dowload  Documentation
File Title Description Metric  Geographic Level Year Grade Subject      
CovA_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year and grade. - District x x   Stata CSV Codebook
CovB_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There are multiple observations per district; one for each year. - District x     Stata CSV
CovC_V1.1 This file contains district level covariates (socioeconomic, demographic, school level data). There is one observation per district. - District       Stata CSV
Ancillary Files
Data Description Disaggregated by Dowload  Documentation
File Title Description Metric Geographic Level Year Grade Subject      
AncillaryA_V1.1 This file contains a unique school identifier, an identifier indicating its NCES ID (the district to which it legally belongs), and the district in which it is included in our estimates. There is one observation per school. - District       Stata CSV  
AncillaryB_V1.1 This file contains the shape file that corresponds to the district crosswalk.  - National       File