Add Health

Initiated in 1994 and supported by five program project grants from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) with co-funding from 23 other federal agencies and foundations, Add Health is the largest, most comprehensive, nationally-representative longitudinal survey of adolescents ever undertaken. Beginning with an in-school questionnaire administered to a nationally representative sample of students in grades 7-12, the study followed up with a series of in-home interviews conducted in 1995, 1996, 2001-02, 2008, and 2016-18. Add Health participants are now full-fledged adults, aged 33-44, and will soon be moving into midlife. Over the years, Add Health has added a substantial amount of additional data for users, including contextual data on the communities and states in which participants reside, genomic data and a range of biological health markers of participants, and parental survey data.

Wave I Data

The public-use dataset for Wave I contains information collected in 1994–95 from Add Health’s nationally representative sample of adolescents. This dataset consists of one-half of the core sample, and one-half of the oversample of African-American adolescents with a parent who has a college degree, chosen at random. The total number of Wave I respondents in this dataset is 6,504.

The Wave I public-use dataset includes information from each of the following sources (as available):

  • Wave I In-Home Data File, includes Wave I In-School Questionnaire Data, Wave I Parent Questionnaire Data and Add Health Picture Vocabulary Test Scores
  • Contextual data
  • In-school network data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave II Data

The public-use dataset for Wave II contains information collected in 1996 from Add Health’s nationally representative sample of adolescents. The interview was generally similar to that at Wave I. Questions about attributes that should not change, such as ethnic background, were not repeated. A total of 4,834 of the original Wave I respondents were re-interviewed between April through August 1996.

The Wave II public-use dataset includes information from each of the following sources (as available):

  • Wave II In-Home Interview
  • Contextual data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave III Data

Wave III

The Wave III public-use data are helpful in analyzing the transition between adolescence and young adulthood. A total of 4,882 of the original Wave I respondents were re-interviewed between August 2001 and April 2002. Wave III respondents were between 18 and 26 years old.

The In-home interview Wave III dataset includes the following data files (as available):

  • Main Respondent File: includes the In-Home Questionnaire data, AHPVT scores, and bio specimen data
  • Education Data: High School Exit Status and Math & Science Transcript data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave IV Data

Wave IV was designed to study the developmental and health trajectories across the life course of adolescence into young adulthood. The Wave IV public-use file contains data on 5,114 respondents, aged 24 to 32*. In Wave IV, biological data was also gathered in an attempt to acquire a greater understanding of pre-disease pathways, with a specific focus on obesity, stress, and health risk behavior.

The In-home interview Wave IV dataset includes the following data files:

  • Wave IV In-home Interview File: variables from the in-home interview, including anthropometric measures
  • Wave IV Biomarker Data: Measures of EBV and hsCRP, Measures of Glucose homeostasis, and Lipids Measures
  • Weights

*17 respondents in the Wave IV public use sample were 33 years old at the time of the interview

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Sort by:
Wave I In-Home Interview Data (part of Core Files bundle)

A merged file containing the Wave I In-Home Interview data, Parent Questionnaire data and the Add Health Picture Vocabulary data, collected in 1994–1995. N= 20,745

Wave I School Administrator Questionnaire Data (part of Core Files bundle)

Information from the Wave I (self-administered) questionnaires answered by administrators at the sampled schools. N=172

Wave I In-School Questionnaire Data (part of Core Files bundle)

Adolescent responses to the In-School Questionnaire administered September 1994 through April 1995. N= 90,118

Wave I School Administrator Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave I school administrator data. N=164

Wave I Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave I in-school data. N=90,118

Wave II School Administrator Questionnaire Data (part of Core Files bundle)

Information from the Wave II (phone administered) questionnaires answered by administrators at the sampled schools. N=128

Wave II Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave II in-home data. N=14,738

Wave III In-Home Interview Data (part of Core Files bundle)

Respondent-level data collected during the 2001?2002 in-home interview includes field interviewer characteristics, AHPVT. N=15,197

Wave III Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights. N=15,197

Wave IV Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights. N=15,701

Wave V Sample 2B Data Add-on (Core Files Add-on)

(For Existing Contracts Only) We will update your existing contract to include Wave V Sample 2B and any other data that were released into the Core Files Bundle if you do not have them yet. It could include Wave V Mixed-Mode Survey Data.

Wave IV Consent (Wave IV Biomarker Files)

This file contains variables indicating the types of consent (archive, no archive, refused, incarcerated) obtained for the Wave IV blood spot and saliva DNA collections. N= 15,701

Wave V Demographics - Home Exam (Wave V Biomarker Files)

This file contains various demographic variables in regards to the Wave V home exam, including the date of the home exam, number of days between the Wave V survey and the home exam, the home exam completion status, as well as the respondent’s age, biological sex, pregnancy status, medication use and blood draw status.

Variables: 12

Observations: 5,381

WaveVDemographics-HomeExamCodebook.pdf

Wave V Anthropometrics (Wave V Biomarker Files)

This file contains anthropometric variables constructed from the measurements taken at the Wave V home exam. The measurements include arm circumference, height, weight, and waist circumference. The file also contains BMI, as well as classification variables for BMI and waist circumference.

Variables: 13

Observations: 5,381

WaveVAnthropometrics.zip

Wave V Cardiovascular Measures (Wave V Biomarker Files)

This file contains cardiovascular measures constructed from the three serial measurements of blood pressure and pulse rate collected at the Wave V home exam. The measures include systolic and diastolic blood pressure, pulse rate, pulse pressure and mean arterial pressure. Other variables included are classifications of blood pressure, flags based on self-reported medical history, an antihypertensive medication use flag and a hypertensive joint classification variable.

Variables: 31

Observations: 5,381

WaveVCardiovascularMeasures.zip

Wave V Medications - Home Exam (Wave V Biomarker Files)

This file provides the therapeutic classification codes, as well as numerous medication flag variables, to identify the types of medication (both prescription and over-the-counter) used by respondents as reported at the Wave V home exam.

Variables: 35

Observations: 11,105

WaveVMedications-HomeExam.zip

Wave V Glucose Homeostasis (Wave V Biomarker Files)

This file contains assay results of glucose and hemoglobin A1c (HbA1c) based on venous blood collected via phlebotomy at the Wave V home exam. There are classifications for fasting glucose and non-fasting glucose, as well as an HbA1C classification. Other variables included are flags based on a self-reported diabetes medical history, anti-diabetic medication use, and a diabetes joint classification variable.

Variables: 11

Observations: 5,381

WaveVGlucoseHomeostasis.zip

Wave V Lipids (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the lipid assay results, there are classifications according to both the NCEP/ATP III and AHA/ACC guidelines, a flag for antihyperlipidemic medication use, and two hyperlipidemia joint classification variables.

Variables: 26

Observations: 5,381

WaveVLipids.zip

Wave V Renal Function (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for creatinine and cystatin-c are available, as well as three different estimations of the glomerular filtration rate (GFR) using either the creatinine concentration, the cystatin C concentration or both concentrations. Classifications according to both clinical and KDIGO guidelines are available as well.

Variables: 14

Observations: 5,381

WaveVRenalFunction.zip

Wave V Inflammation and Immune Function (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the assay results for high sensitivity C-reactive protein (hsCRP), there is an AHA/CDC classification, counts of subclinical symptoms and common infectious or inflammatory diseases, and various anti-inflammatory medication use flags.

Variables: 26

Observations: 5,381

WaveVInflammation_ImmuneFunction.zip

Wave V Biomarker Weight (Wave V Biomarker Files)

This file contains the Wave V biomarker sample weight.

Variables: 2

Observations: 5,381

WaveVBiomarkerSampleWeight.zip

Wave I - II: Family Structure Array (Constructed Data Files)

Nineteen constructed family structure array variables are available for Add Health respondents (Wave I or II) from birth to age at latest adolescent follow-up, with a maximum age of 18 years old. The family structure array variables can be used to construct measures such as the number of family structure changes experienced from birth to a given age (family instability), or the amount of time spent in different family structures.

The following citation should be used for papers and publications using this data: Gaydosh, L. & Harris, K.M. 2018. Family Instability and Young Adult Health. Journal of Health and Social Behavior, 59(2).

Variables: 19

Observations: 20,745

Family Structure (Constructed Data Files)

This file contains four variables, utilizing five, seven, eight, and fourteen categories, to describe the household parental structure at Wave I.

The following citation should be used for papers and publications using this data: Harris, K. M. 1999. The Health Status and Risk Behavior of Adolescents in Immigrant Families. In Hernandez, D.J. (Ed.), Children of Immigrants: Health, Adjustment, and Public Assistance (pp. 286-347). Washington, D.C.: National Academy Press.

Variables: 4

Observations: 20,745

Race (Constructed Data Files)

Included in this file are a single race variable and a multiracial variable constructed from the Wave I race questions.

The following citation should be used for papers and publications using this data: Udry, J.R., Li, R.M., & Hendrickson-Smith, J. 2003. Health and Behavior Risks of Adolescents with Mixed-Race Identity. American Journal of Public Health, 93(11), 1865-1870.

Variables: 1

Observations: 20,246

Wave IV Constructed Variables (Constructed Data Files)

This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators.

Variables: 49

Observations: 15,701

Constructed SES Variables (Constructed Data Files)

These data include variables for social origins measured from information about their families collected from Add Health participants’ parents at the Wave I interview, social attainments measured from their occupations reported on the Wave IV survey, and neighborhood-level socioeconomic disadvantage using Census-tract-level data linked to Add Health participants’ addresses from Wave I and Wave IV.

Variables: 5

Observations: 20,745

Wave IV Constructed Current Relationship Status (Constructed Data Files)

This dataset contains variables that describe the current relationship each respondent had by Wave IV.

Variables: 3

Observations: 15,701

Wave I In-School Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the in-school interview. N=90,118

Wave I In-Home Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the Wave I in-home interview. N=20,745

Wave II In-Home Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the Wave II in-home interview. N=14,738

Wave III Friend IDs (Friendship Files)

Add Health respondents who were in the 7th or 8th grade at Wave I were asked at Wave III to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names. N=3,572

Adolescent Pairs Data (Sibling Files)

Information that links and describes the sibling pairs. N=3,139

Wave III Sibling IDs (Sibling Files)

At Wave III, Add Health respondents were asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings. N= 4,368

Wave I Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses. N=20,745

Wave I Spatial Analysis Data (Contextual Data Files)

Pseudo coordinates that can be used to calculate distances between friends in a school community. N=20,301

Wave I Neighborhood Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses). N=20,745

Wave II Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses. N=14,738

Wave II Neighborhood Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses). N=14,738

Wave III Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses. N=15,197

Wave III Sex Ratio (Contextual Data Files)

In this file are constructed variables at the county-level for sex ratios for males to females ages 18 to 29, 30-34, and 18 to 34, the proportion of males and females ages 18 to 34, and the sex ratio of employed males 16 years and older to the total number of females 16 years and older for the white population and the black or African American population. N= 15,197

Wave III Grouping File Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses).N=15,197

Wave III Region (Contextual Data Files)

This file contains the Census region codes for the respondents’ Wave III residential locations. N=15,197

Wave III Supplemental Tract-Level Contextual Data (Contextual Data Files)

This file contains supplemental Wave III contextual data that include transportation and commuting measures, climate descriptors, amenities, and state-level tobacco control influences. These variables are available at the census tract-level unless otherwise specified. N= 15,197

Wave IV Region (Contextual Data Files)

This file contains the Census region codes for the respondents’ Wave IV residential locations. N=15,197

Wave IV Grouping Data (Contextual Data Files)

The pseudo FIPS codes in this file allow you to geographically group respondents by their Wave IV locations. N=15,701

Wave IV Supplemental Tract-Level Contextual Data (Contextual Data Files)

This file contains tract-level measures, based on the Wave IV respondent locations, reported by the U.S. Census Bureau's 2009 American Community Survey (ACS), the Climate Atlas of the United States, the USDA Economics Research Service, Esri Data and Maps, ImpacTeen Tobacco Control policy and Prevalence Data, and the Uniform Crime Reports. When tract level measures were not available or appropriate, state and county level variables were used. N=15,701

Wave III ASHA Call Data (Wave III Supplemental Files)

To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call. N=4,279

Wave III BEM Scores Data (Wave III Supplemental Files)

The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file. N=15,197

Wave III Mentor Data (Wave III Supplemental Files)

For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file. N=15,197

Wave III Academic Courses (Wave III Education Files)

These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category. N= 12,237

Wave III Academic Networks (Wave III Education Files)

The Network files provide information on social networks based on the respondents’ course-taking patterns. N varies by file

Wave III Context (Wave III Education Files)

School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the 1990 and 2000 Census, and the Office of Civil Rights. N varies by file

Wave III Course Level (Wave III Education Files)

The data in this file are needed for merging the course-level curriculum data with other Education Files. N=516,146

Wave III Curriculum (Wave III Education Files)

These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects. N varies by file

Wave III Linking (Wave III Education Files)

This file contains variables designed to link transcript data to academic or school years and to Add Health. N= 12,237

Wave III Primary (Wave III Education Files)

The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process. N varies by file

Wave III Transition (Wave III Education Files)

This file contains variables explaining the respondents’ movement through the educational system. N=12,217

Wave III Education Weights (Wave III Education Files)

Files contain weights for the education data along with the school weights needed for HLM analysis. N varies by file

Wave I In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I. N= 20,745

In-School Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights. N= 90,118

Wave II In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II. N= 14,738

Wave III In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III. N=15,197

Wave IV Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV. N= 15,701

Add Health School Weights (Additional Weight Files)

The initial weights for the school are in this file. N=132

Wave I School Network Data (Network Files)

Network variables constructed from the in-school questionnaire data and friendship nominations. N=75,871

Wave I and II Disposition File (Disposition Files)

This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II. N= 20,745

National Death Index File (Disposition Files)

This file contains the underlying cause of death and days alive after Wave I interview. N=227

Wave III Disposition File (Disposition Files)

This file contains the final outcome of the 20,058 cases fielded at Wave III. N=20,058

Wave IV Disposition File (Disposition Files)

This file contains the final outcome of the 19,962 cases fielded at Wave IV. N=19,962

Wave I Climate File (ONE: Obesity and Neighborhood Environment Files)

This file contains the climate data for Wave I respondents based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. N=20,745

Wave I Street Connectivity File (ONE: Obesity and Neighborhood Environment Files)

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I respondent locations. N= 20,745

Wave I Crime File (ONE: Obesity and Neighborhood Environment Files)

The county-level crime data in these files are based on the Wave I respondent locations. N= 20,745

Wave I Geocode Source (ONE: Obesity and Neighborhood Environment Files)

The data source of the Wave I respondent residential geocodes (latitude and longitude) are provided in these files. N= 20,745

Wave I Land Cover Data (ONE: Obesity and Neighborhood Environment Files)

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I respondent locations. N= 20,745

Wave I Parks Data (ONE: Obesity and Neighborhood Environment Files)

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I are in these files. N=20,745

Wave I Resources Data (ONE: Obesity and Neighborhood Environment Files)

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I. N=20,745

Wave I Urban Distances (ONE: Obesity and Neighborhood Environment Files)

Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent. N=20,745

Wave I Weather Data (ONE: Obesity and Neighborhood Environment Files)

This file contains weather data for Wave I respondents based on the nearest weather station reporting data for the corresponding survey month and year. N=20,745

Wave I Population Density Data (ONE: Obesity and Neighborhood Environment Files)

The Wave I population density file contains the proportion of 1990 U.S. Census block group population and are (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent. N=20,745

Wave I School Distance Measures (ONE: Obesity and Neighborhood Environment Files)

This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school. N=20,745

Wave I ACCRA Data (ONE: Obesity and Neighborhood Environment Files)

These data files report ACCRA Cost of Living Index data for Wave I based on respondent location and the year and quarter of the Add Health interview. N=20,745

Wave I Employment Files (ONE: Obesity and Neighborhood Environment Files)

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I respondent locations. N=20,745

Wave I Length of Day (ONE: Obesity and Neighborhood Environment Files)

These data files contain the number of hours of daylight at each Wave I respondent location based on that respondent’s latitude and survey date. N=20,745

Wave I Road Type Length Data (ONE: Obesity and Neighborhood Environment Files)

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave I respondent locations. N=20,745

Wave I Rural-Urban Commuting Area (RUCA) Codes (ONE: Obesity and Neighborhood Environment Files)

These data files define the rural-urban commuting characteristics of Wave I respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. Department of Agriculture’s Economic Research Service. N=20,745

Wave I Grouping File (ONE: Obesity and Neighborhood Environment Files)

This file is for use with the Obesity and Neighborhood Environment (ONE) data only.  It is based on recoded Wave I addresses and contains pseudo state, county, tract, and block group variables in FIPS code format that allow for grouping of Add Health respondents geographically. N: 20,745

Wave III Street Connectivity Files (ONE: Obesity and Neighborhood Environment Files)

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave III respondent locations. N= 15,197

Wave III Crime File (ONE: Obesity and Neighborhood Environment Files)

The county-level crime data in these files are based on the Wave III respondent locations. N= 15,197

Wave III Geocode Source (ONE: Obesity and Neighborhood Environment Files)

The data source of the Wave III respondent residential geocodes (latitude and longitude) are provided in these files. N= 15,197

Wave III Land Cover Data (ONE: Obesity and Neighborhood Environment Files)

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave III respondent locations. N= 15,197

Wave III Parks Data (ONE: Obesity and Neighborhood Environment Files)

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave III are in these files. N= 15,197

Wave III Resources Data (ONE: Obesity and Neighborhood Environment Files)

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave III. N=15,197