Add Health

Initiated in 1994 and supported by five program project grants from the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) with co-funding from 23 other federal agencies and foundations, Add Health is the largest, most comprehensive, nationally-representative longitudinal survey of adolescents ever undertaken. Beginning with an in-school questionnaire administered to a nationally representative sample of students in grades 7-12, the study followed up with a series of in-home interviews conducted in 1995, 1996, 2001-02, 2008, and 2016-18. Add Health participants are now full-fledged adults, aged 33-44, and will soon be moving into midlife. Over the years, Add Health has added a substantial amount of additional data for users, including contextual data on the communities and states in which participants reside, genomic data and a range of biological health markers of participants, and parental survey data.



Access to the Add Health Data


Restricted-Use Data will be distributed by contract only to certified researchers (this includes researchers located outside of the US) who commit themselves to maintaining limited access. To be eligible to enter into an Add Health contract, researchers must complete a Contract Application which includes the following requirements (as well as others):

  • Security plan
  • IRB approval letter
  • NEW contracts: $1000 payment by check or credit card as arranged with Add Health Contracts
  • Additional justification statement for each dataset ouside the “Core Files” category (see below for data listing).

Romantic Pairs Data will be distributed by contract only to certified researchers (this includes researchers located outside of the US) who commit themselves to maintaining limited access. To be eligible to enter into an Add Health contract, researchers must complete a Contract Application which includes the following requirements (as well as others):

  • Security plan
  • IRB approval letter
  • NEW contracts: $1000 payment by check or credit card as arranged with Add Health Contracts
  • Additional justification statement for each dataset ouside the “Core Files” category (see below for data listing).
  • Additional justification statement for each Romantic Pairs data file requested

    NOTE: The Romantic Pairs contract includes access to all the data available to the Restricted-Use contract plus the Romantic Pairs data.

Initiating an application for a new Add Health contract

For some notes on initiating an application for an Add Health contract, please see the document Getting Started.


For some hints for submitting your application, please see the document New Contract Information Packet.


Renewing an existing Add Health contract

Contact Add Health Contracts for instructions.

Forms


Wave I Data

The public-use dataset for Wave I contains information collected in 1994–95 from Add Health’s nationally representative sample of adolescents. This dataset consists of one-half of the core sample, and one-half of the oversample of African-American adolescents with a parent who has a college degree, chosen at random. The total number of Wave I respondents in this dataset is 6,504.

The Wave I public-use dataset includes information from each of the following sources (as available):

  • Wave I In-Home Data File, includes Wave I In-School Questionnaire Data, Wave I Parent Questionnaire Data and Add Health Picture Vocabulary Test Scores
  • Contextual data
  • In-school network data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave II Data

The public-use dataset for Wave II contains information collected in 1996 from Add Health’s nationally representative sample of adolescents. The interview was generally similar to that at Wave I. Questions about attributes that should not change, such as ethnic background, were not repeated. A total of 4,834 of the original Wave I respondents were re-interviewed between April through August 1996.

The Wave II public-use dataset includes information from each of the following sources (as available):

  • Wave II In-Home Interview
  • Contextual data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave III Data

Wave III

The Wave III public-use data are helpful in analyzing the transition between adolescence and young adulthood. A total of 4,882 of the original Wave I respondents were re-interviewed between August 2001 and April 2002. Wave III respondents were between 18 and 26 years old.

The In-home interview Wave III dataset includes the following data files (as available):

  • Main Respondent File: includes the In-Home Questionnaire data, AHPVT scores, and bio specimen data
  • Education Data: High School Exit Status and Math & Science Transcript data
  • Weights

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave IV Data

Wave IV was designed to study the developmental and health trajectories across the life course of adolescence into young adulthood. The Wave IV public-use file contains data on 5,114 respondents, aged 24 to 32*. In Wave IV, biological data was also gathered in an attempt to acquire a greater understanding of pre-disease pathways, with a specific focus on obesity, stress, and health risk behavior.

The In-home interview Wave IV dataset includes the following data files:

  • Wave IV In-home Interview File: variables from the in-home interview, including anthropometric measures
  • Wave IV Biomarker Data: Measures of EBV and hsCRP, Measures of Glucose homeostasis, and Lipids Measures
  • Weights

*17 respondents in the Wave IV public use sample were 33 years old at the time of the interview

Topics

  • Demographic Characteristics
  • Contextual Data
  • Family
  • Education
  • SES, Labor Market & Occupation
  • Physical Health
  • Psychological Well-being & Cognition
  • Friends & Social Network
  • Medication & Substance Use and Abuse
  • Reproductive Health
  • Crime/Delinquency/Victimization
  • Romantic Relationships
  • Risk Behavior

Wave V Data

Add Health re-interviewed cohort members in a Wave V follow-up during 2016-2018 to collect social, environmental, behavioral, and biological data with which to track the emergence of chronic disease as the cohort moved through their fourth decade of life. Wave V public-use file contains 4,196 respondents.

Mixed Mode Survey

Wave V data collection employed a mixed mode survey design, including an embedded component to analyze mode effects. The Wave V survey was shorter than the Wave IV interview, which lasted 90 minutes.

Topics

  • Demographic Characteristics
  • Family, siblings, friends
  • Education, work, military
  • Physical and Mental Health
  • Daily activities and sleep
  • Relationships
  • Sexual & Fertility Histories
  • Substance Use & Abuse
  • Involvement with Criminal Justice System
  • Work attitudes and characteristics
  • Religion
  • Economics
  • Expectations
  • Personality
  • Stressors
  • Children and Parenting
  • Civic Participation
  • Cognitive Function
  • Psychosocial factors
  • Retrospective Childhood Health & Socioeconomic Status

Sort by:
Wave I In-Home Interview Data (part of Core Files bundle)

A merged file containing the Wave I In-Home Interview data, Parent Questionnaire data and the Add Health Picture Vocabulary data, collected in 1994–1995. N= 20,745

Wave I School Administrator Questionnaire Data (part of Core Files bundle)

Information from the Wave I (self-administered) questionnaires answered by administrators at the sampled schools. N=172

Wave I In-School Questionnaire Data (part of Core Files bundle)

Adolescent responses to the In-School Questionnaire administered September 1994 through April 1995. N= 90,118

Wave I School Administrator Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave I school administrator data. N=164

Wave II School Administrator Questionnaire Data (part of Core Files bundle)

Information from the Wave II (phone administered) questionnaires answered by administrators at the sampled schools. N=128

Wave III In-Home Interview Data (part of Core Files bundle)

Respondent-level data collected during the 2001?2002 in-home interview includes field interviewer characteristics, AHPVT. N=15,197

Wave III Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights. N=15,197

Wave IV Grand Sample Weights (part of Core Files bundle)

Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights. N=15,701

Wave V Mixed-Mode Survey Data (part of Core Files bundle)

Respondent data from the Wave V mixed-mode survey. This release includes survey, weights, disposition, survey medications and Section 16b data sets. N=12,300

WaveVMixedModeSurvey.zip

Wave V Sample 2B Data Add-on (Core Files Add-on)

(For Existing Contracts Only) We will update your existing contract to include Wave V Sample 2B and any other data that were released into the Core Files Bundle if you do not have them yet. It could include Wave V Mixed-Mode Survey Data.

Wave IV Consent (Wave IV Biomarker Files)

This file contains variables indicating the types of consent (archive, no archive, refused, incarcerated) obtained for the Wave IV blood spot and saliva DNA collections. N= 15,701

consent4.pdf

Wave V Demographics - Home Exam (Wave V Biomarker Files)

This file contains various demographic variables in regards to the Wave V home exam, including the date of the home exam, number of days between the Wave V survey and the home exam, the home exam completion status, as well as the respondent’s age, biological sex, pregnancy status, medication use and blood draw status.

Variables: 12

Observations: 5,381

WaveVDemographics-HomeExamCodebook.pdf

Wave V Anthropometrics (Wave V Biomarker Files)

This file contains anthropometric variables constructed from the measurements taken at the Wave V home exam. The measurements include arm circumference, height, weight, and waist circumference. The file also contains BMI, as well as classification variables for BMI and waist circumference.

Variables: 13

Observations: 5,381

WaveVAnthropometrics.zip

Wave V Cardiovascular Measures (Wave V Biomarker Files)

This file contains cardiovascular measures constructed from the three serial measurements of blood pressure and pulse rate collected at the Wave V home exam. The measures include systolic and diastolic blood pressure, pulse rate, pulse pressure and mean arterial pressure. Other variables included are classifications of blood pressure, flags based on self-reported medical history, an antihypertensive medication use flag and a hypertensive joint classification variable.

Variables: 31

Observations: 5,381

WaveVCardiovascularMeasures.zip

Wave V Medications - Home Exam (Wave V Biomarker Files)

This file provides the therapeutic classification codes, as well as numerous medication flag variables, to identify the types of medication (both prescription and over-the-counter) used by respondents as reported at the Wave V home exam.

Variables: 35

Observations: 11,105

WaveVMedications-HomeExam.zip

Wave V Glucose Homeostasis (Wave V Biomarker Files)

This file contains assay results of glucose and hemoglobin A1c (HbA1c) based on venous blood collected via phlebotomy at the Wave V home exam. There are classifications for fasting glucose and non-fasting glucose, as well as an HbA1C classification. Other variables included are flags based on a self-reported diabetes medical history, anti-diabetic medication use, and a diabetes joint classification variable.

Variables: 11

Observations: 5,381

WaveVGlucoseHomeostasis.zip

Wave V Lipids (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the lipid assay results, there are classifications according to both the NCEP/ATP III and AHA/ACC guidelines, a flag for antihyperlipidemic medication use, and two hyperlipidemia joint classification variables.

Variables: 26

Observations: 5,381

WaveVLipids.zip

Wave V Renal Function (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for creatinine and cystatin-c are available, as well as three different estimations of the glomerular filtration rate (GFR) using either the creatinine concentration, the cystatin C concentration or both concentrations. Classifications according to both clinical and KDIGO guidelines are available as well.

Variables: 14

Observations: 5,381

WaveVRenalFunction.zip

Wave V Inflammation and Immune Function (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the assay results for high sensitivity C-reactive protein (hsCRP), there is an AHA/CDC classification, counts of subclinical symptoms and common infectious or inflammatory diseases, and various anti-inflammatory medication use flags.

Variables: 26

Observations: 5,381

WaveVInflammation_ImmuneFunction.zip

Wave V Biomarker Weight (Wave V Biomarker Files)

This file contains the Wave V biomarker sample weight.

Variables: 2

Observations: 5,381

WaveVBiomarkerSampleWeight.zip

Wave I - II: Family Structure Array (Constructed Data Files)

Nineteen constructed family structure array variables are available for Add Health respondents (Wave I or II) from birth to age at latest adolescent follow-up, with a maximum age of 18 years old. The family structure array variables can be used to construct measures such as the number of family structure changes experienced from birth to a given age (family instability), or the amount of time spent in different family structures.

The following citation should be used for papers and publications using this data: Gaydosh, L. & Harris, K.M. 2018. Family Instability and Young Adult Health. Journal of Health and Social Behavior, 59(2).

Variables: 19

Observations: 20,745

Family Structure (Constructed Data Files)

This file contains four variables, utilizing five, seven, eight, and fourteen categories, to describe the household parental structure at Wave I.

The following citation should be used for papers and publications using this data: Harris, K. M. 1999. The Health Status and Risk Behavior of Adolescents in Immigrant Families. In Hernandez, D.J. (Ed.), Children of Immigrants: Health, Adjustment, and Public Assistance (pp. 286-347). Washington, D.C.: National Academy Press.

Variables: 4

Observations: 20,745

Race (Constructed Data Files)

Included in this file are a single race variable and a multiracial variable constructed from the Wave I race questions.

The following citation should be used for papers and publications using this data: Udry, J.R., Li, R.M., & Hendrickson-Smith, J. 2003. Health and Behavior Risks of Adolescents with Mixed-Race Identity. American Journal of Public Health, 93(11), 1865-1870.

Variables: 1

Observations: 20,246

Wave IV Constructed Variables (Constructed Data Files)

This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators.

Variables: 49

Observations: 15,701

Constructed SES Variables (Constructed Data Files)

These data include variables for social origins measured from information about their families collected from Add Health participants’ parents at the Wave I interview, social attainments measured from their occupations reported on the Wave IV survey, and neighborhood-level socioeconomic disadvantage using Census-tract-level data linked to Add Health participants’ addresses from Wave I and Wave IV.

Variables: 5

Observations: 20,745

Wave IV Constructed Current Relationship Status (Constructed Data Files)

This dataset contains variables that describe the current relationship each respondent had by Wave IV.

Variables: 3

Observations: 15,701

Wave IV BMI Genetic Risk Score (Genetic Files)

This file contains the BMI genetic risk score for Add Health twin and full sibling respondents who provided saliva samples at Wave IV. N= 1,886

GRS_BMI.pdf

Wave I and II Disposition File (Disposition Files)

This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II. N= 20,745

Wave III Academic Courses (Wave III Education Files)

These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category. N= 12,237

Wave III ASHA Call Data (Wave III Supplemental Files)

To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call. N=4,279

Parents (2015-2017) (Parents (2015-2017))

The parent data files contain social, demographic, behavioral, and health data collected in 2015-2017 on a probability sample of Add Health parents who were originally interviewed in 1995 and coincide with Wave V of Add Health. Data for 2,013 Wave I parents, connected to 2,244 Add Health respondents, are available. Additionally, 988 current spouse/partner interviews are available. These data can be linked with Wave I parent data, and corresponding Add Health respondents at Waves I – V. Includes weight files.

Parents_2015-2017.zip

Wave I In-School Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the in-school interview. N=90,118

dbGaP Linking File (GID_link.xpt) (dbGaP Files)

The dbGaP Linking File contains the ID crosswalk to link the Add Health dbGaP genotype data to the restricted-use phenotype data for 9,974 Wave IV respondents.

The following items are required before this request can be approved. Please submit, as necessary, the documents listed below.

  1. An approved Add Health Restricted-Use Data Contract.
  2. The Add Health Linkage File Agreement.
  3. An approved dbGaP Add Health Data Use Certification Agreement.
  4. The NIH-approved dbGaP NIH Data Access Request.

Wave I Climate File (ONE: Obesity and Neighborhood Environment Files)

This file contains the climate data for Wave I respondents based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. N=20,745

Wave_I_and_III_Climate_Files.zip

Wave I In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I. N= 20,745

Adolescent Pairs Data (Sibling Files)

Information that links and describes the sibling pairs. N=3,139

Wave III Cotinine Assay Data (Wave III Biomarker Files)

This file contains the cotinine and 3-hydroxycotinine assay values for 963 Wave III respondents. N=963

Cotinine.pdf

Wave I School Network Data (Network Files)

Network variables constructed from the in-school questionnaire data and friendship nominations. N=75,871

HUD-Assisted Housing Supplementary Data (Contextual Data Files)

The supplementary datafile identifies Add Health respondents who lived in HUD-assisted housing at any point between 1995 and 2017. For these Add Health respondents, the supplementary datafile provides unique Add Health respondent identifiers (AID) and household-level information about the characteristics of their HUD housing residence. The supplementary datafile is a hierarchical (i.e., long-format data file), with each row representing a unique HUD administrative record that is linked to an AID. In total, the hierarchical file includes a total of 8,587 HUD records on 1,159 unique Add Health respondents identified through the linkage. N=8,587

Due to the sensitive nature of the HUD-Assisted Housing Supplementary data, approved users can only access the data on the UNC Secure Remote Workspace (UNC SRW) server for a limited timeframe (1 year).

The following items are required before this request can be approved. Please submit, as necessary, the documents listed below.

  1. Data Analysis Plan (maximum one page; indicate time for completion)
  2. Completed Affiliate Form (will be provided upon receipt of this order form)
  3. Remote Access Form

HUD-Assisted_Housing_Supplementary_Data.zip

Wave I State Demographic Characteristics, Exclusionary Indices, and Inclusionary Indices (Contextual Data Files)

These data provide measures of punishment regime variation in state-based policies, practices, and programs, both in their punitive and non-punitive forms, and some additional state demographic control variables. These data were gathered to use with Add Health for multilevel analyses. N=20,745

WaveIStateCharacteristics_ExclusionaryIndices_andInclusionaryIndices.zip

Contextual Wave IV Database (Contextual Data Files)

The data provide an important update to the contextual variables already available in Wave IV by including information at the state- and county-levels. Additionally, there are a few new variables in the present database that are absent from other waves. N=15,701

ContextualWaveIVDatabase.zip

Contextual Wave V Database (Contextual Data Files)

This contextual database further expands the extensive contextual data currently available to users of the National Longitudinal Study of Adolescent to Adult Health (Add Health) through the provision of numerous measures reported by the U.S. Census Bureau’s American Community Survey (ACS), rural-urban commuting area codes, U.S. Climate Atlas, and Uniform Crime Report. N=12,300

ContextualWaveVDatabase.zip

Wave V County Health and Mobility Data (Contextual Data Files)

The Wave V County Health and Mobility database summarizes the socioeconomic, health, and mobility characteristics of the environments in which Add Health participants were living at the time of their Wave V interview. County-level data describe (1) levels of and trends in chronic disease (hypertension, type-2 diabetes) and health risk behaviors (obesity, smoking, alcohol use); and (2) economic opportunity and inequality. N=12,300

WaveVCountyHealthandMobilityData.zip

Wave V - ACA Medicaid Expansion Data (Contextual Data Files)

This data file provides the year in which states expanded Medicaid under the Affordable Care Act. These dates are attached to the location in which Add Health participants were living at Wave V. N=12,300

WaveV-ACAMedicaidExpansionData.zip

Waves I, IV, and V - The Opportunity Atlas: Mapping the Childhood Roots of Social Mobility Data (Contextual Data Files)

This file enhances the existing Add Health contextual database through the addition of measures essential to understanding the determinants and sequelae of socioeconomic mobility. Specifically, it aims to characterize the socioeconomic mobility of Add Health participants at Wave I, IV, and V. WI N=20,745 WIV N=15,701 WV N=12,300

WavesI_IV__V-TheOpportunityAtlas.zip

Wave IV City Crime Rate Data (Contextual Data Files)

This Crime Rate Data file facilitates research examining the impact of community violence on the health trajectories of Wave IV Add Health participants by providing police department crime data from 13 U.S. cities with high crime rates. N=15,701

WaveIVCityCrimeRateData.zip

Wave V City Crime Rate Data (Contextual Data Files)

This Crime Rate Data file facilitates research examining the impact of community violence on the health trajectories of Wave V Add Health participants by providing police department crime data from 13 U.S. cities with high crime rates. N=12,300

WaveVCityCrimeRateData.zip

Historical Neighborhood Redlining (Contextual Data Files)

This contextual database allows researchers to identify potential long-term consequences of redlining for contemporary inequities in neighborhood environments, and individual health and socioeconomic attainment over the life course. N=20,706

Historical_Neighborhood_Redlining.zip

Waves III-V Multi-year Air Pollution Exposure Estimates (Contextual Data Files)

The air pollution data described here provide longer-term estimates of air pollution exposure that can be used to address a broad range of research questions related to how air pollution exposure over time may relate to a variety of health outcomes. N=20,745

Waves_III-V_Multi-year_Air_Pollution_Exposure_Estimates.zip

Wave I & II School Desegregation Disparities (Contextual Data Files)

This file contains data on the levels of school racial segregation experienced by Add Health respondents during their school-age years, related school district characteristics, and measures of tract-level residential segregation present in adulthood (Waves III-V). N=84,166

Wave_I___II_School_Desegregation_Disparities.zip

Wave I & II School District Grouping Data (Contextual Data Files)

To facilitate clustering by school district, the school district identifiers comprising this file are based on the Local education agency identification numbers (LEAID) of the school districts in which the Wave I school, Wave I residence, and Wave II residence were situated. The first two characters of this LEAID represent state and reflect state codes assigned by Add Health in other disseminated data similarly intended for clustering at different geographic areas. N=84,166

Wave_I___II_School_District_Grouping_Data_Codebook.pdf

Wave I Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses. N=20,745

Wave I Spatial Analysis Data (Contextual Data Files)

Pseudo coordinates that can be used to calculate distances between friends in a school community. N=20,301

Wave I Neighborhood Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses). N=20,745

Wave II Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses. N=14,738

Wave II Neighborhood Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses). N=14,738

Wave III Contextual Data (Contextual Data Files)

Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses. N=15,197

Wave III Sex Ratio (Contextual Data Files)

In this file are constructed variables at the county-level for sex ratios for males to females ages 18 to 29, 30-34, and 18 to 34, the proportion of males and females ages 18 to 34, and the sex ratio of employed males 16 years and older to the total number of females 16 years and older for the white population and the black or African American population. N= 15,197

Wave III Grouping File Data (Contextual Data Files)

Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses).N=15,197

Wave III Region (Contextual Data Files)

This file contains the Census region codes for the respondents’ Wave III residential locations. N=15,197

Wave III Supplemental Tract-Level Contextual Data (Contextual Data Files)

This file contains supplemental Wave III contextual data that include transportation and commuting measures, climate descriptors, amenities, and state-level tobacco control influences. These variables are available at the census tract-level unless otherwise specified. N= 15,197

Wave IV Region (Contextual Data Files)

This file contains the Census region codes for the respondents’ Wave IV residential locations. N=15,197

Wave IV Grouping Data (Contextual Data Files)

The pseudo FIPS codes in this file allow you to geographically group respondents by their Wave IV locations. N=15,701

Wave IV Supplemental Tract-Level Contextual Data (Contextual Data Files)

This file contains tract-level measures, based on the Wave IV respondent locations, reported by the U.S. Census Bureau's 2009 American Community Survey (ACS), the Climate Atlas of the United States, the USDA Economics Research Service, Esri Data and Maps, ImpacTeen Tobacco Control policy and Prevalence Data, and the Uniform Crime Reports. When tract level measures were not available or appropriate, state and county level variables were used. N=15,701

Wave I Political Context Data (Contextual Data Files)

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N= 20,745

Wave II Political Context Data (Contextual Data Files)

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N= 14,738

Wave III Political Context Data (Contextual Data Files)

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N=15,197

Wave III Alcohol Outlet Density Data (Contextual Data Files)

This Add Health data file measures the prevalence of alcohol outlets in respondent communities by reporting the tract-level density of establishments possessing on- and/or off-premise alcohol licenses. N=15,197

Wave IV Modified Retail Food Environment (mRFEI) Data (Contextual Data Files)

The Wave IV mRFEI file includes the mRFEI for each respondent based on their Wave IV residential location (n = 15,701).

Wave IV Ambient Air Pollutants Data (Contextual Data Files)

These files include 365 daily exposure estimates of the following ambient air pollutants for each Add Health study participant in Wave IV:

  • Particulate Matter:  PM2.5, PM10, SO4, NO3, NH4, organic carbon (OC), elemental carbon (EC), Fe, Al, Si, Ti, Ca, Mg, K, Mn, Na, and Cl
  • Gases:  O3, NO2, HNO3, HONO, H2O2, CO, and SO2; 2007 only: acrolein, acetaldehyde, benzene, butadiene, ethanol, formaldehyde, and naphthalene

Wave III & IV Sexual Minority Policy Data (Contextual Data Files)

This database includes indicators of the policy context for sexual minorities (a population typically defined based on co-residence with a same-sex partner or self-identification as gay, lesbian, or bisexual) in Waves III & IV of Add Health. These data will enable researchers to examine how state-level policies in young adulthood are associated with a wide range of indicators of health and well-being among sexual minorities. Wave III - N: 15,197 and Wave IV – N: 15,701

Wave IV College Characteristics Data (Contextual Data Files)

At Wave IV of the Add Health survey, respondents were asked if they had received a bachelor’s degree. This is a degree-level file. College-level data were linked to each degree based on the institution from which the respondent reported receiving each degree. N:15,810.

Wave III & IV College Mobility Data (Contextual Data Files)

These data were collected from a sample of college students who were born between 1980 and 1982 and who attended a college or university in the early 2000’s. At Wave III of the Add Health survey, respondents were asked if they were currently enrolled in a postsecondary institution.  At Wave IV of the Add Health survey, respondents were asked if they had received a bachelor’s degree. Respondents who answered in the affirmative were then asked to report the institution from which they were currently enrolled or received this degree. Wave III - N: 15,197 and Wave IV - N: 15,810

Wave I & IV County Health, Mobility, and Tobacco Tax Data (Contextual Data Files)

The Wave I & IV County Health and Mobility database summarizes the socioeconomic, health, and mobility characteristics of the environments in which Add Health participants were living at the time of their Wave I & IV interview. These variables are available at the county or state level. Wave I - N: 20,745 and Wave IV - N: 15,701

Wave II & III Tobacco Tax Data (Contextual Data Files)

Wave II & III Tobacco Tax Data file supplements the County Health and Mobility database available for Waves I & IV. Comprehensively, these data files provide tobacco tax for Add Health respondent state of residence from Wave I to Wave IV. Wave II - N: 14,738 and Wave III - N: 15,197

Wave I, III & IV Sunset Data (Contextual Data Files)

These data present average sunset time over the course of the year for the respondent’s Census block group. Users can input latitude and longitude coordinates and time zone information to retrieve the sunset time (along with other solar and lunar information) for any date between 1700 and 2100. . Wave I - N: 20,745, Wave III – N=15,197 and Wave IV - N: 15,701

Wave I, II, III, IV & V Grouping Data (Contextual Data Files)

These location identifiers are based on 2010 Census geographic boundaries and are longitudinally consistent across all waves. Previous Add Health waves released location identifiers based on the most recent Census for that wave, and are therefore not comparable through time. These identifiers are pseudo FIPS codes that were randomly generated from the actual Census block group FIPS codes.

Day-Night Level (DNL) Noise Exposures from 90 Major Airports (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=83,357

Aircraft_Noise_Measures_User_Guide.7z

Day-Night Level (DNL) Noise Exposures Codebook.7z

Equivalent Sound Level for a 15-Hour Day (LAEQD) Noise Exposures from 90 Major Airports (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=73,174

Aircraft_Noise_Measures_User_Guide.7z

Equivalent Sound Level for a 15-Hour Day (LAEQD) Codebook.7z

Equivalent Sound Level for a 9-Hour Night (LAEQN) Noise Exposures from 90 Major Airports (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=57,886

Aircraft_Noise_Measures_User_Guide.7z

Equivalent Sound Level for a 9-Hour Night (LAEQN) Codebook.7z

Proxies for Aircraft Noise from Other Airports: Airport Counts (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=166,195

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Airport Counts Codebook.7z

Proxies for Aircraft Noise from Other Airports: Mean Distances (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=256,318

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Mean Distances Codebook.7z

Proxies for Aircraft Noise from Other Airports: Mean Total Enplanements (Exposome Data Files)

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=250,740

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Mean Total Enplanements Codebook.7z

Rural-Urban Commuting Area (RUCA) Codes (Exposome Data Files)

Rural-urban commuting area (RUCA) codes classify U.S. census tracts using measures of population density, urbanization, and daily commuting. The data file including them is based on RUCA codes for census years 1990, 2000, and 2010. The rationale for and utility of acquiring RUCA codes, assigning them to census geographies in which Add Health respondents have resided over three decades. N=97,700

RUCA_Codes_User_Guide.7z

RUCA_Codes_Codebook.7z

Birth Records (Birth Records)

Birth record data was collected from participating states for AHSM birth years, 1974-83. When these states provided birth data for all recorded births occurring during that time interval, an AHSM-specific subset was created using Link Plus, a statistical linkage software developed by the U.S. Centers for Disease Control and Prevention (CDC), Cancer Division. One participating state performed its own AHSM linkages and provided Add Health with the linked subset of births. Add Health then performed transformations on all of the original data from the participating states to create the categorical variables present in this release. N=2,750, 15 variables

Birth_Records_Database_Codebook.7z

Birth_Records_Database_User_Guide.7z

Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES) (SOGI-SES Data Files)

This file contains new survey data to support exploration of the relationships among sexual orientation, gender identity, gender expression, romantic and sexual behaviors, socioeconomic status, and health. It contains social, demographic, behavioral, and health data collected in 2020-2021 on a sample of Add Health Wave V participants. N=2,614

SOGI-SES_Codebook.pdf

Individual Vital Status and Underlying Cause of Death File, 2019 (Wave V Surveillance Files)

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents.

Variables: 12

Observations: 20,745

Individual_Vital_Status_and_Underlying_Cause_of_Death_File__2019.pdf

Wave IV Polygenic Scores - Release 2 (Genetic Files)

Seventy-four constructed polygenic scores (PGSs) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV. Scores are available for various anthropomorphic, health, and behavioral outcomes. See the documentation for a list of all PGSs. N=9,130.

WaveIVPGSRelease2.zip

Polygenic Index Inventories (Genetic Files)

Polygenic scores computed by the SSGAC consortium for anthropometric traits, cognition/education, fertility/sexual development, health/health behaviors, and personality/well being. N=5,689.

PolygenicIndexInventories.zip

Polygenic Index Inventories - Release 2 (Genetic Files)

This data file is a 2022 update of the polygenic scores computed by the SSGAC consortium for anthropometric traits, cognition/education, fertility/sexual development, health/health behaviors, and personality/well being. N=5,689

Polygenic_Index_Inventories_-_Release_2_Codebook.pdf

Individual Vital Status and Underlying Cause of Death File, 2021 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2021 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745

Individual_Vital_Status_and_Underlying_Cause_of Death_File_2021_Codebook.pdf

Ordered Cause of Death File, 2021 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2021. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,123

Ordered_Cause_of_Death_File_2021_Codebook.pdf

All Coded Causes of Death File, Including Entity-Axis Codes, 2021 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647

All_Coded_Causes_of_Death_File_Including_Entity-Axis_Codes_2021_Codebook.pdf

All Coded Causes of Death File, Including Record-Axis Codes, 2021 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647

All_Coded_Causes_of_Death_File_Including_Record-Axis_Codes_2021_Codebook.pdf

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide_(2021_Update).pdf

Baroreflex Sensitivity and Hemodynamic Recovery (Wave V Biomarker Files)

This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave V respondents.

Variables: 4

Observations: 5381

baroreflex_sensitivity.7z

Measures of Inflammation and Immune Function (Wave V Biomarker Files)

This file contains additional measures of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for several cytokines (IL-1β; IL-6; IL-8; IL-10; TNF-α) and anti-cytomegalovirus (CMV) IgG.

Variables: 16

Observations: 5381

measures_inflammation_immune_function.7z

Neurodegeneration (NF-L and Tau) (Wave V Biomarker Files)

This file contains two measures of neurodegeneration based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for neurofilament light (NfL) and tau.

Variables: 5

Observations: 5381

neurodegeneration.7z

Wave V Contextual Despair (Contextual Data Files)

This contextual data set focuses on the social, political, and resource environment of Add Health respondents at the tract, county, and state level that are relevant to the prevailing causes of death in midlife - namely alcohol-related diseases, drug overdoses and accidental poisonings, and suicide and self-inflicted harm. Most measures are specific to Wave V residential location, though several measures span multiple waves. Measures include the sociodemographic and segregation context, proximity to firearms distributors and alcohol outlets, opioid dispensing, and policies related to alcohol, drugs, and firearms. N=20745, v=266

Wave_V_Contextual_Despair_Codebook.pdf

Wave_V_Contextual_Despair_User_Guide.pdf

Individual Vital Status and Underlying Cause of Death File, 2022 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2022 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

Underlying_Cause_of_Death_File__2022.7z

Ordered Cause of Death File, 2022 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2022. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,377

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

Ordered_Cause_of_Death_File__2022.7z

All Coded Causes of Death File, Including Entity-Axis Codes, 2022 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

All_Coded_Causes_of_Death_File__Including_Entity-Axis_Codes__2022.7z

All Coded Causes of Death File, Including Record-Axis Codes, 2022 (Mortality Outcomes Surveillance, Part I Data Files)

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

All_Coded_Causes_of_Death_File__Including_Record-Axis_Codes__2022.7z

Contextual Heterosexism Database, Phase 1 (Contextual Data Files)

This contextual database, Contextual Heterosexism Database-Phase 1 (CHD1), further expands the collection of contextual data available to users of The National Longitudinal Study of Adolescent to Adult Health (Add Health) through the provision of state, county, and tract level measures from the Decennial Census of Population and Housing, American Community Survey (ACS), the Movement Advancement Project (MAP), Lax and Phillips (2009), Public Religion Research Institute (PRRI), Cooperative Election Study (CES), U.S. Religion Census, and Massachusetts Institute of Technology (MIT) Election Lab. These data include indicators of social policies, social climate, and confounding factors related to the study/measurement of structural heterosexism that correspond to Waves 3, 4, and 5. Some of these indicators are new to the Add Health contextual database and others were previously not available at all three of these waves.

Contextual_Heterosexism_User_Guide_Phase_1.7z

Contextual_Heterosexism_Database__Phase_1_Codebook.7z

Wave I In-Home Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the Wave I in-home interview. N=20,745

Wave II In-Home Friendship Nominations (Friendship Files)

Identification numbers of the friends that the respondent nominated during the Wave II in-home interview. N=14,738

Wave III Friend IDs (Friendship Files)

Add Health respondents who were in the 7th or 8th grade at Wave I were asked at Wave III to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names. N=3,572

Wave III Sibling IDs (Sibling Files)

At Wave III, Add Health respondents were asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings. N= 4,368

Wave III BEM Scores Data (Wave III Supplemental Files)

The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file. N=15,197

Wave III Mentor Data (Wave III Supplemental Files)

For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file. N=15,197

Wave III Academic Networks (Wave III Education Files)

The Network files provide information on social networks based on the respondents’ course-taking patterns. N varies by file

Wave III Context (Wave III Education Files)

School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the 1990 and 2000 Census, and the Office of Civil Rights. N varies by file

Wave III Course Level (Wave III Education Files)

The data in this file are needed for merging the course-level curriculum data with other Education Files. N=516,146

Wave III Curriculum (Wave III Education Files)

These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects. N varies by file

Wave III Linking (Wave III Education Files)

This file contains variables designed to link transcript data to academic or school years and to Add Health. N= 12,237

Wave III Primary (Wave III Education Files)

The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process. N varies by file

Wave III Transition (Wave III Education Files)

This file contains variables explaining the respondents’ movement through the educational system. N=12,217

Wave III Education Weights (Wave III Education Files)

Files contain weights for the education data along with the school weights needed for HLM analysis. N varies by file

In-School Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights. N= 90,118

Wave II In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II. N= 14,738

Wave III In-Home Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III. N=15,197

Wave IV Weight Components (Additional Weight Files)

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV. N= 15,701

Add Health School Weights (Additional Weight Files)

The initial weights for the school are in this file. N=132

National Death Index File (Disposition Files)

This file contains the underlying cause of death and days alive after Wave I interview. N=227

Wave III Disposition File (Disposition Files)

This file contains the final outcome of the 20,058 cases fielded at Wave III. N=20,058

Wave IV Disposition File (Disposition Files)

This file contains the final outcome of the 19,962 cases fielded at Wave IV. N=19,962

Wave I Street Connectivity File (ONE: Obesity and Neighborhood Environment Files)

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I respondent locations. N= 20,745

Wave_I_and_III_Street_Connectivity_Files.zip

Wave I Crime File (ONE: Obesity and Neighborhood Environment Files)

The county-level crime data in these files are based on the Wave I respondent locations. N= 20,745

Wave_I_and_III_Crime_Files.zip

Wave I Geocode Source (ONE: Obesity and Neighborhood Environment Files)

The data source of the Wave I respondent residential geocodes (latitude and longitude) are provided in these files. N= 20,745

Wave_I_and_III_Geocode_Source.zip

Wave I Land Cover Data (ONE: Obesity and Neighborhood Environment Files)

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I respondent locations. N= 20,745

Wave_I_and_III_Land_Cover_Data.zip

Wave I Parks Data (ONE: Obesity and Neighborhood Environment Files)

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I are in these files. N=20,745

Wave_I_and_III_Parks_Data.zip

Wave I Resources Data (ONE: Obesity and Neighborhood Environment Files)

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I. N=20,745

Wave I Urban Distances (ONE: Obesity and Neighborhood Environment Files)

Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent. N=20,745

Wave_I_and_III_Urban_Distances.zip

Wave I Weather Data (ONE: Obesity and Neighborhood Environment Files)

This file contains weather data for Wave I respondents based on the nearest weather station reporting data for the corresponding survey month and year. N=20,745

Wave_I_and_III_Weather_Data.zip

Wave I Population Density Data (ONE: Obesity and Neighborhood Environment Files)

The Wave I population density file contains the proportion of 1990 U.S. Census block group population and are (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent. N=20,745

Wave_I_and_III_Population_Density_Data.zip

Wave I School Distance Measures (ONE: Obesity and Neighborhood Environment Files)

This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school. N=20,745

w1schdis.pdf

Wave I ACCRA Data (ONE: Obesity and Neighborhood Environment Files)

These data files report ACCRA Cost of Living Index data for Wave I based on respondent location and the year and quarter of the Add Health interview. N=20,745

Wave_I_and_III_ACCRA_Data.zip

Wave I Employment Files (ONE: Obesity and Neighborhood Environment Files)

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I respondent locations. N=20,745

Wave_I_and_III_Employment_Files.zip

Wave I Length of Day (ONE: Obesity and Neighborhood Environment Files)

These data files contain the number of hours of daylight at each Wave I respondent location based on that respondent’s latitude and survey date. N=20,745

Wave_I_and_III_Length_of_Day.zip

Wave I Road Type Length Data (ONE: Obesity and Neighborhood Environment Files)

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave I respondent locations. N=20,745

Wave_I_and_III_Road_Type_Length_Data.zip

Wave I Rural-Urban Commuting Area (RUCA) Codes (ONE: Obesity and Neighborhood Environment Files)

These data files define the rural-urban commuting characteristics of Wave I respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. Department of Agriculture’s Economic Research Service. N=20,745

Wave_I_and_III_RUCA_Codes.zip

Wave I Grouping File (ONE: Obesity and Neighborhood Environment Files)

This file is for use with the Obesity and Neighborhood Environment (ONE) data only.  It is based on recoded Wave I addresses and contains pseudo state, county, tract, and block group variables in FIPS code format that allow for grouping of Add Health respondents geographically. N: 20,745

w1grpone.pdf

Wave III Street Connectivity Files (ONE: Obesity and Neighborhood Environment Files)

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave III respondent locations. N= 15,197

Wave_I_and_III_Street_Connectivity_Files.zip

Wave III Crime File (ONE: Obesity and Neighborhood Environment Files)

The county-level crime data in these files are based on the Wave III respondent locations. N= 15,197

Wave_I_and_III_Crime_Files.zip

Wave III Geocode Source (ONE: Obesity and Neighborhood Environment Files)

The data source of the Wave III respondent residential geocodes (latitude and longitude) are provided in these files. N= 15,197

Wave_I_and_III_Geocode_Source.zip

Wave III Land Cover Data (ONE: Obesity and Neighborhood Environment Files)

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave III respondent locations. N= 15,197

Wave_I_and_III_Land_Cover_Data.zip

Wave III Parks Data (ONE: Obesity and Neighborhood Environment Files)

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave III are in these files. N= 15,197

Wave_I_and_III_Parks_Data.zip

Wave III Resources Data (ONE: Obesity and Neighborhood Environment Files)

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave III. N=15,197

Wave III Urban Distances (ONE: Obesity and Neighborhood Environment Files)

Contains the Euclidean distance to 2000 U.S. Census Bureau-defined urbanized areas (UAs) for each Wave III respondent. N= 15,197

Wave_I_and_III_Urban_Distances.zip

Wave III Weather Data (ONE: Obesity and Neighborhood Environment Files)

This file contains weather data for Wave III respondents based on the nearest weather station reporting data for the corresponding survey month and year. N=15,197

Wave_I_and_III_Weather_Data.zip

Wave III Population Density Data (ONE: Obesity and Neighborhood Environment Files)

The Wave III population density file contains the proportion of 2000 U.S. Census block group population and area (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave III respondent. N=15,197

Wave_I_and_III_Population_Density_Data.zip

Wave III Mobility Data (ONE: Obesity and Neighborhood Environment Files)

Reports the distance between each respondent’s geocoded point location for each survey wave and that respondent’s school location, along with the respondent’s move distance between each survey wave. N=20,745

w3mobind.pdf

Wave III MSA Pseudo Codes (ONE: Obesity and Neighborhood Environment Files)

The MSA pseudo code created for each respondent’s Wave III location is in this file. N=15,197

w3MSA.pdf

Wave III ACCRA Data (ONE: Obesity and Neighborhood Environment Files)

These data files report ACCRA Cost of Living Index data for Wave III based on respondent location and the year and quarter of the Add Health interview. N=15,197

Wave_I_and_III_ACCRA_Data.zip

Wave III Employment Files (ONE: Obesity and Neighborhood Environment Files)

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave III respondent locations. N=15,197

Wave_I_and_III_Employment_Files.zip

Wave III Length of Day (ONE: Obesity and Neighborhood Environment Files)

These data files contain the number of hours of daylight at each Wave III respondent location based on that respondent’s latitude and survey date. N=15,197

Wave_I_and_III_Length_of_Day.zip

Wave III Road Type Length Data (ONE: Obesity and Neighborhood Environment Files)

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave III respondent locations. N=15,197

Wave_I_and_III_Road_Type_Length_Data.zip

Wave III Rural-Urban Commuting Area (RUCA) Codes (ONE: Obesity and Neighborhood Environment Files)

These data files define the rural-urban commuting characteristics of Wave III respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. Department of Agriculture’s Economic Research Service. N=15,197

Wave_I_and_III_RUCA_Codes.zip

Wave III HPV-MGEN Assay Data (Wave III Biomarker Files)

Assay results for human papillomavirus and mycoplasma genitalium are available for a subset of the Wave III respondents who provided a urine sample. N=5,126

hpvmgen.pdf

Wave III HPV-MGEN Assay Weights (Wave III Biomarker Files)

Sample weights for respondents with HPV and MGEN assay results are in this file. N=14,322

WHPVMGEN.pdf

Wave III Urinalysis Data (Wave III Biomarker Files)

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, microalbumin, urine creatinine, and blood values from the Wave III urine specimens. N= 15,197

URINE.pdf

Wave IV Medication File (Wave IV Biomarker Files)

This file provides the therapeutic classification codes for the medications reported at Wave IV. N=10,711

w4meds.pdf

Wave IV Narcotic Drug Flag (Wave IV Biomarker Files)

This file contains a flag that identifies Wave IV respondents who report taking a medication that contains a narcotic. N= 15,701

narcotic.pdf

Wave IV Glucose (Wave IV Biomarker Files)

This file contains two measures of glucose homeostasis based on the assay of the Wave IV dried blood spots. N= 15,701

glu_a1c.pdf

Wave IV CRP-EBV (Wave IV Biomarker Files)

The results of the assays for CRP (C-reactive protein) and EBV (Epstein-Barr virus) are in this data file. N= 15,701

ebv_crp.pdf

Wave IV Lipids (Wave IV Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results. N= 15,701

lipids.pdf

Wave IV Baroreceptor Sensitivity (Wave IV Biomarker Files)

This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave IV respondents. N= 15,701

Baroreflex_Sensitivity.pdf

Wave IV Education Genetic Risk Score (Genetic Files)

An education genetic risk score (GRS_EDU) is available for Add Health twin and full sibling respondents who provided saliva samples at Wave IV. This variable is the weighted sum of risk alleles identified in the Rietveld et al. (2013) genome-wide association study. N= 1,886

GRS_EDU.pdf

Wave III DNA Data (Genetic Files)

Twin and full siblings interviewed at Wave III were asked to provide saliva samples for DNA analysis. This file contains the genotype values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), and SLC6A4 (serotonin transporter), MAOA_V (monoamine oxidase A-uVNTR), DRD2 (dopamine D2 receptor), and CYP2A6 (cytochrome P450 2A6) from these samples. Also included are values for the following SNPs: rs2304297, rs892413, rs4950, rs13280604. N=2,574

W3DNA.pdf

Wave IV DNA Data (Genetic Files)

Contains genotyping results for all Wave IV respondents who agreed to provide a saliva sample for DNA testing. This dataset has values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), MAOA (monoamine oxidase A-uVNTR), 5HTTLPR (serotonin transporter), HTTLPR La-Lg-S, triallelic activity bins for the serotonin transporter 5HTTLPR adjusted for rs25531, DRD2, s000005, s000006, DRD5, and MAOCA1. N=15,701

w4dna.pdf

Wave III Partner In-home (Romantic Pairs)

In Wave III, 1,507 partners of Add Health respondents were interviewed. These files contain the partner data for the Wave III in-home interview, AHPVT, field interviewer characteristics, and links to the Add Health respondents. N= 1,507

W3_In-home_interview-partner.zip

Wave III Partner ASHA Call (Romantic Pairs)

To receive the results of their STD assays, partners of Wave III respondents called an Add Health dedicated number at the American Social Health Association. This file provides information on who called the results hotline and the date and time of the call. N=332

callashap.pdf

Wave III Partner Urinalysis (Romantic Pairs)

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, and blood values from the Wave III partner urine specimens. N=1,507

urinep.pdf

Wave III Climate Files (ONE: Obesity and Neighborhood Environment Files)

This file contains the climate data for Wave III respondents based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. N=15,197

Wave_I_and_III_Climate_Files.zip

Temporary data (Add Health) (Temporary data (Add Health))

Requirements:

This is a place holder for the Add Health data importing process.

Wave III Academic Transcript Social Studies and Civic Coursework Data (Wave III Education Files)

The Wave III ATRCVC data is a course-by-student-level file that includes academic transcript data related to social studies and civic coursework N = 93,651.

Wave IV Polygenic Scores - Release 1 (Genetic Files)

Thirty constructed polygenic scores (PGS) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV. Scores are available for coronary artery disease, myocardial infarction, plasma cortisol, LDL cholesterol, HDL cholesterol, total cholesterol, triglycerides, type II diabetes (2 measures), BMI, waist circumference, waist-to-hip ratio, height, age at menarche, age at menopause, number of children, age at first birth, ever/current smoker, number of cigarettes per day, extraversion, attention deficit disorder (2 measures), bipolar disorder, major depressive disorder (2 measures), schizophrenia, mental health cross disorder, Alzheimer’s disease, and educational attainment (2 measures). N = 9,129

Wave_IV_Polygenic_Scores_-_Release_1.zip

Wave IV Polygenic Scores – SSGAC (Genetic Files)

Polygenic scores (PGS) constructed by the Social Science Genetic Association Consortium (SSGAC) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV.  This data file contains educational attainment, cognitive performance, depression, neuroticism, and subjective well-being scores based on standard GWAS summary statistics and multilevel analysis (2 scores for each construct).  Additional multivariate analysis scores for highest level math taken and math ability are also included.  N=5,690

SSGACPGS.pdf

Parents (2015-2017) – Family Health History (Parents (2015-2017))

These two files contain data from the Wave I parent and spouse/partner who returned the Parents Phase 2 family health history leave-behind forms. fhhp2: N=2,247 / fhhsp2: N=1,118.

Parents_2015-2017_Family_Health_History.zip

Parents (2015-2017) – Disposition File (Parents (2015-2017))

This file contains the final disposition codes for Wave I parents selected for the Parents Phase 2 interview. N = 20,745.

dspp2.pdf

Parents (2015-2017) – National Death Index File (Parents (2015-2017))

This file contains the underlying cause of death and days alive after the Wave I interview for Wave I parents selected for the Parents Phase 2 interview who are coded as deceased in the disposition file. N = 226

ndip2.pdf

Parents (2015-2017) – Medications Files (Parents (2015-2017))

Therapeutic classifications for medications data collected (2015-17) from Wave I Parent and Spouse or Parent.

Parents_2015-2017_Medication_Files.zip

PGS Risk-Tolerance (Genetic Files)

Contains polygenic scores for general risk tolerance, adventurousness and risky behaviors in the driving, drinking, smoking and sexual domains. Polygenic scores were created for unrelated, Add Health participants of European Ancestry (n=4755). Score metrics were generated using Plink, LDPred or MTAG software using the UK Biobank GWAS study.

PGS_Risk_Tolerance_Codebook_and_Documentation.pdf

Wave V Constructed Age (Constructed Data Files)

The Wave V survey did not ask respondent age though it is possible to calculate age using interview date and date of birth. For deductive disclosure reasons complete interview dates and dates of birth were collected but not released. To provide calculated ages two variables have been constructed based on complete interview dates and date of birth month/year and 15 as a universal assigned day of birth.

Variables: 3

Observations: 12,300

WaveVConstructedAgeCodebook.pdf

Wave V Sample 2B (part of Core Files bundle)

Sample S2B is part of the Wave V survey. S2B was administered 2 June 2017-8 July 2018. While most variables were released in December 2019 as Wave V data, two sections were not. These specifically S2B cases are a representative sample (asked of everyone in the sample). Web survey content for the other samples (1, 2A, 3) were edited mid-data collection and not all respondents in each sample had the opportunity to answer the questions.

Variables: 163

Observations: 1,101

WaveVSample2BCodebook.pdf

Wave V Mover Distance (Constructed Data Files)

The dataset provides the distance, in meters, between each respondent's home residence in each of the previous waves and their Wave V residence.

Variables: 6

Observations: 12,300

WaveVMoverDistanceCodebook.pdf

Ordered Cause of Death File, 2019 (Wave V Surveillance Files)

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record.

Variables: 7

Observations: 1,745

Ordered_Cause_of_Death_File__2019.pdf

All Coded Causes of Death File, Including Entity-Axis Codes, 2019 (Wave V Surveillance Files)

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one.

Variables: 88

Observations: 540

All_Coded_Causes_of_Death_File__Including_Entity-Axis_Codes__2019.pdf

All Coded Causes of Death File, Including Record-Axis Codes, 2019 (Wave V Surveillance Files)

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one.

Variables: 83

Observations=540

All_Coded_Causes_of_Death_File__Including_Record-Axis_Codes__2019.pdf

Wave V Renal Function Addendum (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Updated estimations of the glomerular filtration rate (GFR) based on 2021 guidelines are available using either the creatinine concentration or using both the creatinine and cystatin C concentrations. The estimations of GFR have been calculated according to the 2021 NIDDK CKD-EPI guidelines. Classifications of the new estimations according to both clinical and KDIGO guidelines are available as well.

Variables: 7

Observations: 5,381

WaveVRenalFunctionAddendum.zip

Wave V Hepatic Injury (Wave V Biomarker Files)

This file contains constructed measures designed to facilitate analysis and interpretation of hepatic injury based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for aspartate aminotransferase (AST) and alanine aminotransferase (ALT) are available, as well as three semi-quantitative serum index assays – lipemia, hemolysis and icterus – to evaluate the possibility of interference with the AST or ALT assays. Moreover, two constructed measures are available - the AST/ALT ratio and its classification.

Variables: 10

Observations: 5,381

WaveVHepaticInjury.zip

Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES) - Sensitive (SOGI-SES Data Files)

This file contains the SOGI-SES study's sensitive data variables related to gender identity, in vitro fertilization, and HIV status. N=2,614

SOGI-SES_Sensitive_Codebook.pdf

Wave I In-Home Interview Data

A merged file containing the Wave I In-Home Interview data, Parent Questionnaire data and the Add Health Picture Vocabulary data, collected in 1994–1995. N= 20,745

Wave I School Administrator Questionnaire Data

Information from the Wave I (self-administered) questionnaires answered by administrators at the sampled schools. N=172

School Information Data

Additional information about the individual schools. N=172

Wave I In-School Questionnaire Data

Adolescent responses to the In-School Questionnaire administered September 1994 through April 1995. N= 90,118

Wave I In-Home Weights

Variables needed to correct for design effects and weight the Wave I in-home data. N=20,745

Wave I School Administrator Weights

Variables needed to correct for design effects and weight the Wave I school administrator data. N=164

Wave I Grand Sample Weights

Variables needed to correct for design effects and weight the Wave I in-school data. N=90,118

Wave II In-Home Interview Data

Data collected during the 1996 in-home interview. N= 14,738

Wave II School Administrator Questionnaire Data

Information from the Wave II (phone administered) questionnaires answered by administrators at the sampled schools. N=128

Wave II Grand Sample Weights

Variables needed to correct for design effects and weight the Wave II in-home data. N=14,738

Wave III In-Home Interview Data

Respondent-level data collected during the 2001?2002 in-home interview includes field interviewer characteristics, AHPVT. N=15,197

Wave III Grand Sample Weights

Variables needed to correct for design effects and weight the Wave III in-home data, including longitudinal and cross-sectional weights. N=15,197

Wave IV In-Home Interview Data

Data Collected during the 2008 in-home interview. N=15,701

Wave IV Grand Sample Weights

Variables needed to correct for design effects and weight the Wave IV in-home data, including longitudinal and cross-sectional weights. N=15,701

Wave V Mixed-Mode Survey Data

Respondent data from the Wave V mixed-mode survey. This release includes survey, weights, disposition, survey medications and Section 16b data sets. N=12,300

WaveVMixedModeSurvey.zip

Wave V Sample 2B

Sample S2B is part of the Wave V survey. S2B was administered 2 June 2017-8 July 2018. While most variables were released in December 2019 as Wave V data, two sections were not. These specifically S2B cases are a representative sample (asked of everyone in the sample). Web survey content for the other samples (1, 2A, 3) were edited mid-data collection and not all respondents in each sample had the opportunity to answer the questions.

Variables: 163

Observations: 1,101

WaveVSample2BCodebook.pdf

Wave V Sample 2B Data Add-on

(For Existing Contracts Only) We will update your existing contract to include Wave V Sample 2B and any other data that were released into the Core Files Bundle if you do not have them yet. It could include Wave V Mixed-Mode Survey Data.

Wave IV Consent

This file contains variables indicating the types of consent (archive, no archive, refused, incarcerated) obtained for the Wave IV blood spot and saliva DNA collections. N= 15,701

consent4.pdf

Wave IV Medication File

This file provides the therapeutic classification codes for the medications reported at Wave IV. N=10,711

w4meds.pdf

Wave IV Narcotic Drug Flag

This file contains a flag that identifies Wave IV respondents who report taking a medication that contains a narcotic. N= 15,701

narcotic.pdf

Wave IV Glucose

This file contains two measures of glucose homeostasis based on the assay of the Wave IV dried blood spots. N= 15,701

glu_a1c.pdf

Wave IV CRP-EBV

The results of the assays for CRP (C-reactive protein) and EBV (Epstein-Barr virus) are in this data file. N= 15,701

ebv_crp.pdf

Wave IV Lipids

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results. N= 15,701

lipids.pdf

Wave IV Baroreceptor Sensitivity

This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave IV respondents. N= 15,701

Baroreflex_Sensitivity.pdf

Wave V Demographics - Home Exam

This file contains various demographic variables in regards to the Wave V home exam, including the date of the home exam, number of days between the Wave V survey and the home exam, the home exam completion status, as well as the respondent’s age, biological sex, pregnancy status, medication use and blood draw status.

Variables: 12

Observations: 5,381

WaveVDemographics-HomeExamCodebook.pdf

Wave V Anthropometrics

This file contains anthropometric variables constructed from the measurements taken at the Wave V home exam. The measurements include arm circumference, height, weight, and waist circumference. The file also contains BMI, as well as classification variables for BMI and waist circumference.

Variables: 13

Observations: 5,381

WaveVAnthropometrics.zip

Wave V Cardiovascular Measures

This file contains cardiovascular measures constructed from the three serial measurements of blood pressure and pulse rate collected at the Wave V home exam. The measures include systolic and diastolic blood pressure, pulse rate, pulse pressure and mean arterial pressure. Other variables included are classifications of blood pressure, flags based on self-reported medical history, an antihypertensive medication use flag and a hypertensive joint classification variable.

Variables: 31

Observations: 5,381

WaveVCardiovascularMeasures.zip

Wave V Medications - Home Exam

This file provides the therapeutic classification codes, as well as numerous medication flag variables, to identify the types of medication (both prescription and over-the-counter) used by respondents as reported at the Wave V home exam.

Variables: 35

Observations: 11,105

WaveVMedications-HomeExam.zip

Wave V Glucose Homeostasis

This file contains assay results of glucose and hemoglobin A1c (HbA1c) based on venous blood collected via phlebotomy at the Wave V home exam. There are classifications for fasting glucose and non-fasting glucose, as well as an HbA1C classification. Other variables included are flags based on a self-reported diabetes medical history, anti-diabetic medication use, and a diabetes joint classification variable.

Variables: 11

Observations: 5,381

WaveVGlucoseHomeostasis.zip

Wave V Lipids

This file contains constructed measures designed to facilitate analysis and interpretation of lipids results based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the lipid assay results, there are classifications according to both the NCEP/ATP III and AHA/ACC guidelines, a flag for antihyperlipidemic medication use, and two hyperlipidemia joint classification variables.

Variables: 26

Observations: 5,381

WaveVLipids.zip

Wave V Renal Function

This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for creatinine and cystatin-c are available, as well as three different estimations of the glomerular filtration rate (GFR) using either the creatinine concentration, the cystatin C concentration or both concentrations. Classifications according to both clinical and KDIGO guidelines are available as well.

Variables: 14

Observations: 5,381

WaveVRenalFunction.zip

Wave V Inflammation and Immune Function

This file contains constructed measures designed to facilitate analysis and interpretation of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam. In addition to the assay results for high sensitivity C-reactive protein (hsCRP), there is an AHA/CDC classification, counts of subclinical symptoms and common infectious or inflammatory diseases, and various anti-inflammatory medication use flags.

Variables: 26

Observations: 5,381

WaveVInflammation_ImmuneFunction.zip

Wave V Biomarker Weight

This file contains the Wave V biomarker sample weight.

Variables: 2

Observations: 5,381

WaveVBiomarkerSampleWeight.zip

Baroreflex Sensitivity and Hemodynamic Recovery

This file contains constructed measures for baroreflex sensitivity, heart rate recovery, and systolic blood pressure recovery for the Wave V respondents.

Variables: 4

Observations: 5381

baroreflex_sensitivity.7z

Measures of Inflammation and Immune Function

This file contains additional measures of inflammation and immune function based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for several cytokines (IL-1β; IL-6; IL-8; IL-10; TNF-α) and anti-cytomegalovirus (CMV) IgG.

Variables: 16

Observations: 5381

measures_inflammation_immune_function.7z

Neurodegeneration (NF-L and Tau)

This file contains two measures of neurodegeneration based on venous blood collected via phlebotomy at the Wave V home exam and then assayed for neurofilament light (NfL) and tau.

Variables: 5

Observations: 5381

neurodegeneration.7z

Wave V Renal Function Addendum

This file contains constructed measures designed to facilitate analysis and interpretation of renal function based on venous blood collected via phlebotomy at the Wave V home exam. Updated estimations of the glomerular filtration rate (GFR) based on 2021 guidelines are available using either the creatinine concentration or using both the creatinine and cystatin C concentrations. The estimations of GFR have been calculated according to the 2021 NIDDK CKD-EPI guidelines. Classifications of the new estimations according to both clinical and KDIGO guidelines are available as well.

Variables: 7

Observations: 5,381

WaveVRenalFunctionAddendum.zip

Wave V Hepatic Injury

This file contains constructed measures designed to facilitate analysis and interpretation of hepatic injury based on venous blood collected via phlebotomy at the Wave V home exam. Assay results for aspartate aminotransferase (AST) and alanine aminotransferase (ALT) are available, as well as three semi-quantitative serum index assays – lipemia, hemolysis and icterus – to evaluate the possibility of interference with the AST or ALT assays. Moreover, two constructed measures are available - the AST/ALT ratio and its classification.

Variables: 10

Observations: 5,381

WaveVHepaticInjury.zip

Wave I - II: Family Structure Array

Nineteen constructed family structure array variables are available for Add Health respondents (Wave I or II) from birth to age at latest adolescent follow-up, with a maximum age of 18 years old. The family structure array variables can be used to construct measures such as the number of family structure changes experienced from birth to a given age (family instability), or the amount of time spent in different family structures.

The following citation should be used for papers and publications using this data: Gaydosh, L. & Harris, K.M. 2018. Family Instability and Young Adult Health. Journal of Health and Social Behavior, 59(2).

Variables: 19

Observations: 20,745

Family Structure

This file contains four variables, utilizing five, seven, eight, and fourteen categories, to describe the household parental structure at Wave I.

The following citation should be used for papers and publications using this data: Harris, K. M. 1999. The Health Status and Risk Behavior of Adolescents in Immigrant Families. In Hernandez, D.J. (Ed.), Children of Immigrants: Health, Adjustment, and Public Assistance (pp. 286-347). Washington, D.C.: National Academy Press.

Variables: 4

Observations: 20,745

Race

Included in this file are a single race variable and a multiracial variable constructed from the Wave I race questions.

The following citation should be used for papers and publications using this data: Udry, J.R., Li, R.M., & Hendrickson-Smith, J. 2003. Health and Behavior Risks of Adolescents with Mixed-Race Identity. American Journal of Public Health, 93(11), 1865-1870.

Variables: 1

Observations: 20,246

Wave IV Constructed Variables

This file contains constructed variables on stress, depression, mastery, personality, arrest history, sexual behavior, smoking, and substance abuse created by Wave IV collaborators.

Variables: 49

Observations: 15,701

Constructed SES Variables

These data include variables for social origins measured from information about their families collected from Add Health participants’ parents at the Wave I interview, social attainments measured from their occupations reported on the Wave IV survey, and neighborhood-level socioeconomic disadvantage using Census-tract-level data linked to Add Health participants’ addresses from Wave I and Wave IV.

Variables: 5

Observations: 20,745

Wave IV Constructed Current Relationship Status

This dataset contains variables that describe the current relationship each respondent had by Wave IV.

Variables: 3

Observations: 15,701

Wave V Constructed Age

The Wave V survey did not ask respondent age though it is possible to calculate age using interview date and date of birth. For deductive disclosure reasons complete interview dates and dates of birth were collected but not released. To provide calculated ages two variables have been constructed based on complete interview dates and date of birth month/year and 15 as a universal assigned day of birth.

Variables: 3

Observations: 12,300

WaveVConstructedAgeCodebook.pdf

Wave V Mover Distance

The dataset provides the distance, in meters, between each respondent's home residence in each of the previous waves and their Wave V residence.

Variables: 6

Observations: 12,300

WaveVMoverDistanceCodebook.pdf

Wave IV BMI Genetic Risk Score

This file contains the BMI genetic risk score for Add Health twin and full sibling respondents who provided saliva samples at Wave IV. N= 1,886

GRS_BMI.pdf

Wave IV Polygenic Scores - Release 2

Seventy-four constructed polygenic scores (PGSs) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV. Scores are available for various anthropomorphic, health, and behavioral outcomes. See the documentation for a list of all PGSs. N=9,130.

WaveIVPGSRelease2.zip

Polygenic Index Inventories

Polygenic scores computed by the SSGAC consortium for anthropometric traits, cognition/education, fertility/sexual development, health/health behaviors, and personality/well being. N=5,689.

PolygenicIndexInventories.zip

Polygenic Index Inventories - Release 2

This data file is a 2022 update of the polygenic scores computed by the SSGAC consortium for anthropometric traits, cognition/education, fertility/sexual development, health/health behaviors, and personality/well being. N=5,689

Polygenic_Index_Inventories_-_Release_2_Codebook.pdf

Wave IV Education Genetic Risk Score

An education genetic risk score (GRS_EDU) is available for Add Health twin and full sibling respondents who provided saliva samples at Wave IV. This variable is the weighted sum of risk alleles identified in the Rietveld et al. (2013) genome-wide association study. N= 1,886

GRS_EDU.pdf

Wave III DNA Data

Twin and full siblings interviewed at Wave III were asked to provide saliva samples for DNA analysis. This file contains the genotype values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), and SLC6A4 (serotonin transporter), MAOA_V (monoamine oxidase A-uVNTR), DRD2 (dopamine D2 receptor), and CYP2A6 (cytochrome P450 2A6) from these samples. Also included are values for the following SNPs: rs2304297, rs892413, rs4950, rs13280604. N=2,574

W3DNA.pdf

Wave IV DNA Data

Contains genotyping results for all Wave IV respondents who agreed to provide a saliva sample for DNA testing. This dataset has values for DAT1 (dopamine transporter), DRD4 (dopamine receptor), MAOA (monoamine oxidase A-uVNTR), 5HTTLPR (serotonin transporter), HTTLPR La-Lg-S, triallelic activity bins for the serotonin transporter 5HTTLPR adjusted for rs25531, DRD2, s000005, s000006, DRD5, and MAOCA1. N=15,701

w4dna.pdf

Wave IV Polygenic Scores - Release 1

Thirty constructed polygenic scores (PGS) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV. Scores are available for coronary artery disease, myocardial infarction, plasma cortisol, LDL cholesterol, HDL cholesterol, total cholesterol, triglycerides, type II diabetes (2 measures), BMI, waist circumference, waist-to-hip ratio, height, age at menarche, age at menopause, number of children, age at first birth, ever/current smoker, number of cigarettes per day, extraversion, attention deficit disorder (2 measures), bipolar disorder, major depressive disorder (2 measures), schizophrenia, mental health cross disorder, Alzheimer’s disease, and educational attainment (2 measures). N = 9,129

Wave_IV_Polygenic_Scores_-_Release_1.zip

Wave IV Polygenic Scores – SSGAC

Polygenic scores (PGS) constructed by the Social Science Genetic Association Consortium (SSGAC) are available for Add Health respondents who provided archival saliva samples for genetic testing at Wave IV.  This data file contains educational attainment, cognitive performance, depression, neuroticism, and subjective well-being scores based on standard GWAS summary statistics and multilevel analysis (2 scores for each construct).  Additional multivariate analysis scores for highest level math taken and math ability are also included.  N=5,690

SSGACPGS.pdf

PGS Risk-Tolerance

Contains polygenic scores for general risk tolerance, adventurousness and risky behaviors in the driving, drinking, smoking and sexual domains. Polygenic scores were created for unrelated, Add Health participants of European Ancestry (n=4755). Score metrics were generated using Plink, LDPred or MTAG software using the UK Biobank GWAS study.

PGS_Risk_Tolerance_Codebook_and_Documentation.pdf

Wave I and II Disposition File

This file contains the types of data available for the Wave I respondents along with the outcome of the 16,706 respondents selected for Wave II. N= 20,745

National Death Index File

This file contains the underlying cause of death and days alive after Wave I interview. N=227

Wave III Disposition File

This file contains the final outcome of the 20,058 cases fielded at Wave III. N=20,058

Wave IV Disposition File

This file contains the final outcome of the 19,962 cases fielded at Wave IV. N=19,962

Wave III Academic Courses

These files contain academic status and/or performance indicators for math, science, foreign language, English, history, social sciences, physical education, and a combined overall category. N= 12,237

Wave III Academic Networks

The Network files provide information on social networks based on the respondents’ course-taking patterns. N varies by file

Wave III Context

School level contextual data are from the Common Core of Data (CCD), Private School Survey (PSS), the 1990 and 2000 Census, and the Office of Civil Rights. N varies by file

Wave III Course Level

The data in this file are needed for merging the course-level curriculum data with other Education Files. N=516,146

Wave III Curriculum

These math and science curriculum data are derived from coding the textbooks schools reported using for each course offered in these two subjects. N varies by file

Wave III Linking

This file contains variables designed to link transcript data to academic or school years and to Add Health. N= 12,237

Wave III Primary

The Primary Component contains several types of indicators based on information collected from participating schools and listed directly on student transcripts such as student exit or graduation status and materials gathered from schools during the data collection process. N varies by file

Wave III Transition

This file contains variables explaining the respondents’ movement through the educational system. N=12,217

Wave III Education Weights

Files contain weights for the education data along with the school weights needed for HLM analysis. N varies by file

Wave III Academic Transcript Social Studies and Civic Coursework Data

The Wave III ATRCVC data is a course-by-student-level file that includes academic transcript data related to social studies and civic coursework N = 93,651.

Wave III ASHA Call Data

To receive the results of their STD assays, Wave III respondents called an Add Health dedicated number at the American Social Health Association. This dataset provides information on who called the results hotline and the date and time of the call. N=4,279

Wave III BEM Scores Data

The masculinity and femininity raw and standard scores from the 30 item short form BEM Sex-Role Inventory are available in this file. N=15,197

Wave III Mentor Data

For Wave III respondents who reported having a mentor, the open-ended responses to the question “How did {HE/SHE} help you?” have been coded and are available in this file. N=15,197

Parents (2015-2017)

The parent data files contain social, demographic, behavioral, and health data collected in 2015-2017 on a probability sample of Add Health parents who were originally interviewed in 1995 and coincide with Wave V of Add Health. Data for 2,013 Wave I parents, connected to 2,244 Add Health respondents, are available. Additionally, 988 current spouse/partner interviews are available. These data can be linked with Wave I parent data, and corresponding Add Health respondents at Waves I – V. Includes weight files.

Parents_2015-2017.zip

Parents (2015-2017) – Family Health History

These two files contain data from the Wave I parent and spouse/partner who returned the Parents Phase 2 family health history leave-behind forms. fhhp2: N=2,247 / fhhsp2: N=1,118.

Parents_2015-2017_Family_Health_History.zip

Parents (2015-2017) – Disposition File

This file contains the final disposition codes for Wave I parents selected for the Parents Phase 2 interview. N = 20,745.

dspp2.pdf

Parents (2015-2017) – National Death Index File

This file contains the underlying cause of death and days alive after the Wave I interview for Wave I parents selected for the Parents Phase 2 interview who are coded as deceased in the disposition file. N = 226

ndip2.pdf

Parents (2015-2017) – Medications Files

Therapeutic classifications for medications data collected (2015-17) from Wave I Parent and Spouse or Parent.

Parents_2015-2017_Medication_Files.zip

Wave I In-School Friendship Nominations

Identification numbers of the friends that the respondent nominated during the in-school interview. N=90,118

Wave I In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave I in-home interview. N=20,745

Wave II In-Home Friendship Nominations

Identification numbers of the friends that the respondent nominated during the Wave II in-home interview. N=14,738

Wave III Friend IDs

Add Health respondents who were in the 7th or 8th grade at Wave I were asked at Wave III to identify, from a list of 10 computer-generated names, which ones were current friends or which ones were their friends when they were in school together. This dataset contains the altered identification numbers (AIDs) of the 10 computer-generated names. N=3,572

dbGaP Linking File (GID_link.xpt)

The dbGaP Linking File contains the ID crosswalk to link the Add Health dbGaP genotype data to the restricted-use phenotype data for 9,974 Wave IV respondents.

The following items are required before this request can be approved. Please submit, as necessary, the documents listed below.

  1. An approved Add Health Restricted-Use Data Contract.
  2. The Add Health Linkage File Agreement.
  3. An approved dbGaP Add Health Data Use Certification Agreement.
  4. The NIH-approved dbGaP NIH Data Access Request.

Wave IV dbGaP GWAS Sample Weight

A weight component for the dbGaP GWAS Sample. N=9,975

Wave_IV_dbGaP_GWAS_Sample_Weights.zip

Wave I Climate File

This file contains the climate data for Wave I respondents based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. N=20,745

Wave_I_and_III_Climate_Files.zip

Wave I Street Connectivity File

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave I respondent locations. N= 20,745

Wave_I_and_III_Street_Connectivity_Files.zip

Wave I Crime File

The county-level crime data in these files are based on the Wave I respondent locations. N= 20,745

Wave_I_and_III_Crime_Files.zip

Wave I Geocode Source

The data source of the Wave I respondent residential geocodes (latitude and longitude) are provided in these files. N= 20,745

Wave_I_and_III_Geocode_Source.zip

Wave I Land Cover Data

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave I respondent locations. N= 20,745

Wave_I_and_III_Land_Cover_Data.zip

Wave I Parks Data

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave I are in these files. N=20,745

Wave_I_and_III_Parks_Data.zip

Wave I Resources Data

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave I. N=20,745

Wave I Urban Distances

Contains Euclidean distances to both 1990 and 2000 U.S. Census Urbanized Areas (UAs) for each Wave I respondent. N=20,745

Wave_I_and_III_Urban_Distances.zip

Wave I Weather Data

This file contains weather data for Wave I respondents based on the nearest weather station reporting data for the corresponding survey month and year. N=20,745

Wave_I_and_III_Weather_Data.zip

Wave I Population Density Data

The Wave I population density file contains the proportion of 1990 U.S. Census block group population and are (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave I respondent. N=20,745

Wave_I_and_III_Population_Density_Data.zip

Wave I School Distance Measures

This file contains the distance between the geocoded point locations of each respondent’s Wave I location and that respondent’s school. N=20,745

w1schdis.pdf

Wave I ACCRA Data

These data files report ACCRA Cost of Living Index data for Wave I based on respondent location and the year and quarter of the Add Health interview. N=20,745

Wave_I_and_III_ACCRA_Data.zip

Wave I Employment Files

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave I respondent locations. N=20,745

Wave_I_and_III_Employment_Files.zip

Wave I Length of Day

These data files contain the number of hours of daylight at each Wave I respondent location based on that respondent’s latitude and survey date. N=20,745

Wave_I_and_III_Length_of_Day.zip

Wave I Road Type Length Data

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave I respondent locations. N=20,745

Wave_I_and_III_Road_Type_Length_Data.zip

Wave I Rural-Urban Commuting Area (RUCA) Codes

These data files define the rural-urban commuting characteristics of Wave I respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. Department of Agriculture’s Economic Research Service. N=20,745

Wave_I_and_III_RUCA_Codes.zip

Wave I Grouping File

This file is for use with the Obesity and Neighborhood Environment (ONE) data only.  It is based on recoded Wave I addresses and contains pseudo state, county, tract, and block group variables in FIPS code format that allow for grouping of Add Health respondents geographically. N: 20,745

w1grpone.pdf

Wave III Street Connectivity Files

These files contain road network connectivity measures within 1, 3, 5, and 8.05 km (5 miles) of the Wave III respondent locations. N= 15,197

Wave_I_and_III_Street_Connectivity_Files.zip

Wave III Crime File

The county-level crime data in these files are based on the Wave III respondent locations. N= 15,197

Wave_I_and_III_Crime_Files.zip

Wave III Geocode Source

The data source of the Wave III respondent residential geocodes (latitude and longitude) are provided in these files. N= 15,197

Wave_I_and_III_Geocode_Source.zip

Wave III Land Cover Data

These files contain land cover metrics within 1, 3, 5, and 8.05 km (5 miles) of Wave III respondent locations. N= 15,197

Wave_I_and_III_Land_Cover_Data.zip

Wave III Parks Data

The counts of public parks within a Euclidean distance of 1, 3, 5, and 8.05 kilometers (5 miles) of each respondent at Wave III are in these files. N= 15,197

Wave_I_and_III_Parks_Data.zip

Wave III Resources Data

These Add Health files provide data on the presence of various physical activity (PA) resources situated near respondent residences at Wave III. N=15,197

Wave III Urban Distances

Contains the Euclidean distance to 2000 U.S. Census Bureau-defined urbanized areas (UAs) for each Wave III respondent. N= 15,197

Wave_I_and_III_Urban_Distances.zip

Wave III Weather Data

This file contains weather data for Wave III respondents based on the nearest weather station reporting data for the corresponding survey month and year. N=15,197

Wave_I_and_III_Weather_Data.zip

Wave III Population Density Data

The Wave III population density file contains the proportion of 2000 U.S. Census block group population and area (in square meters) within 1, 3, 5, and 8.04672 km (5 mi) of each Wave III respondent. N=15,197

Wave_I_and_III_Population_Density_Data.zip

Wave III Mobility Data

Reports the distance between each respondent’s geocoded point location for each survey wave and that respondent’s school location, along with the respondent’s move distance between each survey wave. N=20,745

w3mobind.pdf

Wave III MSA Pseudo Codes

The MSA pseudo code created for each respondent’s Wave III location is in this file. N=15,197

w3MSA.pdf

Wave III ACCRA Data

These data files report ACCRA Cost of Living Index data for Wave III based on respondent location and the year and quarter of the Add Health interview. N=15,197

Wave_I_and_III_ACCRA_Data.zip

Wave III Employment Files

Certain county-level employment data from the U.S. Bureau of Labor Statistics are attached to Wave III respondent locations. N=15,197

Wave_I_and_III_Employment_Files.zip

Wave III Length of Day

These data files contain the number of hours of daylight at each Wave III respondent location based on that respondent’s latitude and survey date. N=15,197

Wave_I_and_III_Length_of_Day.zip

Wave III Road Type Length Data

Road type length calculations within radii of 1, 3, 5, and 8.05 kilometers (5 miles) of Wave III respondent locations. N=15,197

Wave_I_and_III_Road_Type_Length_Data.zip

Wave III Rural-Urban Commuting Area (RUCA) Codes

These data files define the rural-urban commuting characteristics of Wave III respondent locations at the U.S. Census tract-level using the 1990 and 2000 RUCA codes developed by the U.S. Department of Agriculture’s Economic Research Service. N=15,197

Wave_I_and_III_RUCA_Codes.zip

Wave III Climate Files

This file contains the climate data for Wave III respondents based on the nearest climate station. Information is available on precipitation, total snowfall, sky cover, temperature, and total hours of sunshine. N=15,197

Wave_I_and_III_Climate_Files.zip

Wave I In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave I. N= 20,745

In-School Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights. N= 90,118

Wave II In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave II. N= 14,738

Wave III In-Home Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave III. N=15,197

Wave IV Weight Components

A weight component for each level of sampling (school and adolescents) has been created for each wave of data collection. This file contains the weight components needed for computing multilevel weights for Wave IV. N= 15,701

Add Health School Weights

The initial weights for the school are in this file. N=132

Adolescent Pairs Data

Information that links and describes the sibling pairs. N=3,139

Wave III Sibling IDs

At Wave III, Add Health respondents were asked questions about their siblings who also participated in the Wave I or II in-home interviews. This dataset contains the AIDs for these siblings. N= 4,368

Waves I and II Romantic/Friend Nominations

These files contain the identification numbers to link the Wave I and II romantic, non-romantic, and friend data from the Wave I and II in-home interview. N varies by file.

Friend_Nominations.zip

Wave III Partner In-home

In Wave III, 1,507 partners of Add Health respondents were interviewed. These files contain the partner data for the Wave III in-home interview, AHPVT, field interviewer characteristics, and links to the Add Health respondents. N= 1,507

W3_In-home_interview-partner.zip

Wave III Partner ASHA Call

To receive the results of their STD assays, partners of Wave III respondents called an Add Health dedicated number at the American Social Health Association. This file provides information on who called the results hotline and the date and time of the call. N=332

callashap.pdf

Wave III Partner Urinalysis

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, and blood values from the Wave III partner urine specimens. N=1,507

urinep.pdf

Wave III Cotinine Assay Data

This file contains the cotinine and 3-hydroxycotinine assay values for 963 Wave III respondents. N=963

Cotinine.pdf

Wave III HPV-MGEN Assay Data

Assay results for human papillomavirus and mycoplasma genitalium are available for a subset of the Wave III respondents who provided a urine sample. N=5,126

hpvmgen.pdf

Wave III HPV-MGEN Assay Weights

Sample weights for respondents with HPV and MGEN assay results are in this file. N=14,322

WHPVMGEN.pdf

Wave III Urinalysis Data

This file contains nitrate, specific gravity, pH level, white blood cells, protein, glucose, ketone, urobilinogen, bilirubin, microalbumin, urine creatinine, and blood values from the Wave III urine specimens. N= 15,197

URINE.pdf

Wave I School Network Data

Network variables constructed from the in-school questionnaire data and friendship nominations. N=75,871

HUD-Assisted Housing Supplementary Data

The supplementary datafile identifies Add Health respondents who lived in HUD-assisted housing at any point between 1995 and 2017. For these Add Health respondents, the supplementary datafile provides unique Add Health respondent identifiers (AID) and household-level information about the characteristics of their HUD housing residence. The supplementary datafile is a hierarchical (i.e., long-format data file), with each row representing a unique HUD administrative record that is linked to an AID. In total, the hierarchical file includes a total of 8,587 HUD records on 1,159 unique Add Health respondents identified through the linkage. N=8,587

Due to the sensitive nature of the HUD-Assisted Housing Supplementary data, approved users can only access the data on the UNC Secure Remote Workspace (UNC SRW) server for a limited timeframe (1 year).

The following items are required before this request can be approved. Please submit, as necessary, the documents listed below.

  1. Data Analysis Plan (maximum one page; indicate time for completion)
  2. Completed Affiliate Form (will be provided upon receipt of this order form)
  3. Remote Access Form

HUD-Assisted_Housing_Supplementary_Data.zip

Wave I State Demographic Characteristics, Exclusionary Indices, and Inclusionary Indices

These data provide measures of punishment regime variation in state-based policies, practices, and programs, both in their punitive and non-punitive forms, and some additional state demographic control variables. These data were gathered to use with Add Health for multilevel analyses. N=20,745

WaveIStateCharacteristics_ExclusionaryIndices_andInclusionaryIndices.zip

Contextual Wave IV Database

The data provide an important update to the contextual variables already available in Wave IV by including information at the state- and county-levels. Additionally, there are a few new variables in the present database that are absent from other waves. N=15,701

ContextualWaveIVDatabase.zip

Contextual Wave V Database

This contextual database further expands the extensive contextual data currently available to users of the National Longitudinal Study of Adolescent to Adult Health (Add Health) through the provision of numerous measures reported by the U.S. Census Bureau’s American Community Survey (ACS), rural-urban commuting area codes, U.S. Climate Atlas, and Uniform Crime Report. N=12,300

ContextualWaveVDatabase.zip

Wave V County Health and Mobility Data

The Wave V County Health and Mobility database summarizes the socioeconomic, health, and mobility characteristics of the environments in which Add Health participants were living at the time of their Wave V interview. County-level data describe (1) levels of and trends in chronic disease (hypertension, type-2 diabetes) and health risk behaviors (obesity, smoking, alcohol use); and (2) economic opportunity and inequality. N=12,300

WaveVCountyHealthandMobilityData.zip

Wave V - ACA Medicaid Expansion Data

This data file provides the year in which states expanded Medicaid under the Affordable Care Act. These dates are attached to the location in which Add Health participants were living at Wave V. N=12,300

WaveV-ACAMedicaidExpansionData.zip

Waves I, IV, and V - The Opportunity Atlas: Mapping the Childhood Roots of Social Mobility Data

This file enhances the existing Add Health contextual database through the addition of measures essential to understanding the determinants and sequelae of socioeconomic mobility. Specifically, it aims to characterize the socioeconomic mobility of Add Health participants at Wave I, IV, and V. WI N=20,745 WIV N=15,701 WV N=12,300

WavesI_IV__V-TheOpportunityAtlas.zip

Wave IV City Crime Rate Data

This Crime Rate Data file facilitates research examining the impact of community violence on the health trajectories of Wave IV Add Health participants by providing police department crime data from 13 U.S. cities with high crime rates. N=15,701

WaveIVCityCrimeRateData.zip

Wave V City Crime Rate Data

This Crime Rate Data file facilitates research examining the impact of community violence on the health trajectories of Wave V Add Health participants by providing police department crime data from 13 U.S. cities with high crime rates. N=12,300

WaveVCityCrimeRateData.zip

Historical Neighborhood Redlining

This contextual database allows researchers to identify potential long-term consequences of redlining for contemporary inequities in neighborhood environments, and individual health and socioeconomic attainment over the life course. N=20,706

Historical_Neighborhood_Redlining.zip

Waves III-V Multi-year Air Pollution Exposure Estimates

The air pollution data described here provide longer-term estimates of air pollution exposure that can be used to address a broad range of research questions related to how air pollution exposure over time may relate to a variety of health outcomes. N=20,745

Waves_III-V_Multi-year_Air_Pollution_Exposure_Estimates.zip

Wave I & II School Desegregation Disparities

This file contains data on the levels of school racial segregation experienced by Add Health respondents during their school-age years, related school district characteristics, and measures of tract-level residential segregation present in adulthood (Waves III-V). N=84,166

Wave_I___II_School_Desegregation_Disparities.zip

Wave I & II School District Grouping Data

To facilitate clustering by school district, the school district identifiers comprising this file are based on the Local education agency identification numbers (LEAID) of the school districts in which the Wave I school, Wave I residence, and Wave II residence were situated. The first two characters of this LEAID represent state and reflect state codes assigned by Add Health in other disseminated data similarly intended for clustering at different geographic areas. N=84,166

Wave_I___II_School_District_Grouping_Data_Codebook.pdf

Wave I Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave I addresses. N=20,745

Wave I Spatial Analysis Data

Pseudo coordinates that can be used to calculate distances between friends in a school community. N=20,301

Wave I Neighborhood Data

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave I addresses). N=20,745

Wave II Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave II addresses. N=14,738

Wave II Neighborhood Data

Pseudo state, county, tract, and block group variables that allow grouping of Add Health respondents geographically (based on Wave II addresses). N=14,738

Wave III Contextual Data

Community contextual variables based on state, county, tract, and block group levels derived from the Wave III addresses. N=15,197

Wave III Sex Ratio

In this file are constructed variables at the county-level for sex ratios for males to females ages 18 to 29, 30-34, and 18 to 34, the proportion of males and females ages 18 to 34, and the sex ratio of employed males 16 years and older to the total number of females 16 years and older for the white population and the black or African American population. N= 15,197

Wave III Grouping File Data

Pseudo state, county, tract, and block group variables in FIPS code format that allow grouping of Add Health respondents geographically (based on Wave III addresses).N=15,197

Wave III Region

This file contains the Census region codes for the respondents’ Wave III residential locations. N=15,197

Wave III Supplemental Tract-Level Contextual Data

This file contains supplemental Wave III contextual data that include transportation and commuting measures, climate descriptors, amenities, and state-level tobacco control influences. These variables are available at the census tract-level unless otherwise specified. N= 15,197

Wave IV Region

This file contains the Census region codes for the respondents’ Wave IV residential locations. N=15,197

Wave IV Grouping Data

The pseudo FIPS codes in this file allow you to geographically group respondents by their Wave IV locations. N=15,701

Wave IV Supplemental Tract-Level Contextual Data

This file contains tract-level measures, based on the Wave IV respondent locations, reported by the U.S. Census Bureau's 2009 American Community Survey (ACS), the Climate Atlas of the United States, the USDA Economics Research Service, Esri Data and Maps, ImpacTeen Tobacco Control policy and Prevalence Data, and the Uniform Crime Reports. When tract level measures were not available or appropriate, state and county level variables were used. N=15,701

Wave I Political Context Data

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N= 20,745

Wave II Political Context Data

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N= 14,738

Wave III Political Context Data

The Add Health Political Context files provide an array of measures that describe the political environments in which Add Health respondents reside. These contextual variables include measures of commuting, election results for gubernatorial, presidential, and senatorial races, and voter registration law. N=15,197

Wave III Alcohol Outlet Density Data

This Add Health data file measures the prevalence of alcohol outlets in respondent communities by reporting the tract-level density of establishments possessing on- and/or off-premise alcohol licenses. N=15,197

Wave IV Modified Retail Food Environment (mRFEI) Data

The Wave IV mRFEI file includes the mRFEI for each respondent based on their Wave IV residential location (n = 15,701).

Wave IV Ambient Air Pollutants Data

These files include 365 daily exposure estimates of the following ambient air pollutants for each Add Health study participant in Wave IV:

  • Particulate Matter:  PM2.5, PM10, SO4, NO3, NH4, organic carbon (OC), elemental carbon (EC), Fe, Al, Si, Ti, Ca, Mg, K, Mn, Na, and Cl
  • Gases:  O3, NO2, HNO3, HONO, H2O2, CO, and SO2; 2007 only: acrolein, acetaldehyde, benzene, butadiene, ethanol, formaldehyde, and naphthalene

Wave III & IV Sexual Minority Policy Data

This database includes indicators of the policy context for sexual minorities (a population typically defined based on co-residence with a same-sex partner or self-identification as gay, lesbian, or bisexual) in Waves III & IV of Add Health. These data will enable researchers to examine how state-level policies in young adulthood are associated with a wide range of indicators of health and well-being among sexual minorities. Wave III - N: 15,197 and Wave IV – N: 15,701

Wave IV College Characteristics Data

At Wave IV of the Add Health survey, respondents were asked if they had received a bachelor’s degree. This is a degree-level file. College-level data were linked to each degree based on the institution from which the respondent reported receiving each degree. N:15,810.

Wave III & IV College Mobility Data

These data were collected from a sample of college students who were born between 1980 and 1982 and who attended a college or university in the early 2000’s. At Wave III of the Add Health survey, respondents were asked if they were currently enrolled in a postsecondary institution.  At Wave IV of the Add Health survey, respondents were asked if they had received a bachelor’s degree. Respondents who answered in the affirmative were then asked to report the institution from which they were currently enrolled or received this degree. Wave III - N: 15,197 and Wave IV - N: 15,810

Wave I & IV County Health, Mobility, and Tobacco Tax Data

The Wave I & IV County Health and Mobility database summarizes the socioeconomic, health, and mobility characteristics of the environments in which Add Health participants were living at the time of their Wave I & IV interview. These variables are available at the county or state level. Wave I - N: 20,745 and Wave IV - N: 15,701

Wave II & III Tobacco Tax Data

Wave II & III Tobacco Tax Data file supplements the County Health and Mobility database available for Waves I & IV. Comprehensively, these data files provide tobacco tax for Add Health respondent state of residence from Wave I to Wave IV. Wave II - N: 14,738 and Wave III - N: 15,197

Wave I, III & IV Sunset Data

These data present average sunset time over the course of the year for the respondent’s Census block group. Users can input latitude and longitude coordinates and time zone information to retrieve the sunset time (along with other solar and lunar information) for any date between 1700 and 2100. . Wave I - N: 20,745, Wave III – N=15,197 and Wave IV - N: 15,701

Wave I, II, III, IV & V Grouping Data

These location identifiers are based on 2010 Census geographic boundaries and are longitudinally consistent across all waves. Previous Add Health waves released location identifiers based on the most recent Census for that wave, and are therefore not comparable through time. These identifiers are pseudo FIPS codes that were randomly generated from the actual Census block group FIPS codes.

Wave V Contextual Despair

This contextual data set focuses on the social, political, and resource environment of Add Health respondents at the tract, county, and state level that are relevant to the prevailing causes of death in midlife - namely alcohol-related diseases, drug overdoses and accidental poisonings, and suicide and self-inflicted harm. Most measures are specific to Wave V residential location, though several measures span multiple waves. Measures include the sociodemographic and segregation context, proximity to firearms distributors and alcohol outlets, opioid dispensing, and policies related to alcohol, drugs, and firearms. N=20745, v=266

Wave_V_Contextual_Despair_Codebook.pdf

Wave_V_Contextual_Despair_User_Guide.pdf

Contextual Heterosexism Database, Phase 1

This contextual database, Contextual Heterosexism Database-Phase 1 (CHD1), further expands the collection of contextual data available to users of The National Longitudinal Study of Adolescent to Adult Health (Add Health) through the provision of state, county, and tract level measures from the Decennial Census of Population and Housing, American Community Survey (ACS), the Movement Advancement Project (MAP), Lax and Phillips (2009), Public Religion Research Institute (PRRI), Cooperative Election Study (CES), U.S. Religion Census, and Massachusetts Institute of Technology (MIT) Election Lab. These data include indicators of social policies, social climate, and confounding factors related to the study/measurement of structural heterosexism that correspond to Waves 3, 4, and 5. Some of these indicators are new to the Add Health contextual database and others were previously not available at all three of these waves.

Contextual_Heterosexism_User_Guide_Phase_1.7z

Contextual_Heterosexism_Database__Phase_1_Codebook.7z

Day-Night Level (DNL) Noise Exposures from 90 Major Airports

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=83,357

Aircraft_Noise_Measures_User_Guide.7z

Day-Night Level (DNL) Noise Exposures Codebook.7z

Equivalent Sound Level for a 15-Hour Day (LAEQD) Noise Exposures from 90 Major Airports

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=73,174

Aircraft_Noise_Measures_User_Guide.7z

Equivalent Sound Level for a 15-Hour Day (LAEQD) Codebook.7z

Equivalent Sound Level for a 9-Hour Night (LAEQN) Noise Exposures from 90 Major Airports

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=57,886

Aircraft_Noise_Measures_User_Guide.7z

Equivalent Sound Level for a 9-Hour Night (LAEQN) Codebook.7z

Proxies for Aircraft Noise from Other Airports: Airport Counts

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=166,195

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Airport Counts Codebook.7z

Proxies for Aircraft Noise from Other Airports: Mean Distances

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=256,318

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Mean Distances Codebook.7z

Proxies for Aircraft Noise from Other Airports: Mean Total Enplanements

This data contains the estimation of aircraft noise measures around ninety major airports and aircraft noise proxies for approximately 900 additional airports. Merged with geopositioned/geocoded Add Health respondent locations over Waves I-VI, it also documents how the aircraft noise source data were acquired, as well as the protocol for quality controlling their assignment across waves. N=250,740

Aircraft_Noise_Measures_User_Guide.7z

Proxies for Aircraft Noise from Other Airports- Mean Total Enplanements Codebook.7z

Rural-Urban Commuting Area (RUCA) Codes

Rural-urban commuting area (RUCA) codes classify U.S. census tracts using measures of population density, urbanization, and daily commuting. The data file including them is based on RUCA codes for census years 1990, 2000, and 2010. The rationale for and utility of acquiring RUCA codes, assigning them to census geographies in which Add Health respondents have resided over three decades. N=97,700

RUCA_Codes_User_Guide.7z

RUCA_Codes_Codebook.7z

Birth Records

Birth record data was collected from participating states for AHSM birth years, 1974-83. When these states provided birth data for all recorded births occurring during that time interval, an AHSM-specific subset was created using Link Plus, a statistical linkage software developed by the U.S. Centers for Disease Control and Prevention (CDC), Cancer Division. One participating state performed its own AHSM linkages and provided Add Health with the linked subset of births. Add Health then performed transformations on all of the original data from the participating states to create the categorical variables present in this release. N=2,750, 15 variables

Birth_Records_Database_Codebook.7z

Birth_Records_Database_User_Guide.7z

Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES)

This file contains new survey data to support exploration of the relationships among sexual orientation, gender identity, gender expression, romantic and sexual behaviors, socioeconomic status, and health. It contains social, demographic, behavioral, and health data collected in 2020-2021 on a sample of Add Health Wave V participants. N=2,614

SOGI-SES_Codebook.pdf

Sexual Orientation/Gender Identity, Socioeconomic Status, and Health across the Life Course (SOGI-SES) - Sensitive

This file contains the SOGI-SES study's sensitive data variables related to gender identity, in vitro fertilization, and HIV status. N=2,614

SOGI-SES_Sensitive_Codebook.pdf

Individual Vital Status and Underlying Cause of Death File, 2019

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents.

Variables: 12

Observations: 20,745

Individual_Vital_Status_and_Underlying_Cause_of_Death_File__2019.pdf

Ordered Cause of Death File, 2019

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record.

Variables: 7

Observations: 1,745

Ordered_Cause_of_Death_File__2019.pdf

All Coded Causes of Death File, Including Entity-Axis Codes, 2019

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one.

Variables: 88

Observations: 540

All_Coded_Causes_of_Death_File__Including_Entity-Axis_Codes__2019.pdf

All Coded Causes of Death File, Including Record-Axis Codes, 2019

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one.

Variables: 83

Observations=540

All_Coded_Causes_of_Death_File__Including_Record-Axis_Codes__2019.pdf

Individual Vital Status and Underlying Cause of Death File, 2021

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2021 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745

Individual_Vital_Status_and_Underlying_Cause_of Death_File_2021_Codebook.pdf

Ordered Cause of Death File, 2021

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2021. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,123

Ordered_Cause_of_Death_File_2021_Codebook.pdf

All Coded Causes of Death File, Including Entity-Axis Codes, 2021

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647

All_Coded_Causes_of_Death_File_Including_Entity-Axis_Codes_2021_Codebook.pdf

All Coded Causes of Death File, Including Record-Axis Codes, 2021

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2021. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=647

All_Coded_Causes_of_Death_File_Including_Record-Axis_Codes_2021_Codebook.pdf

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide_(2021_Update).pdf

Individual Vital Status and Underlying Cause of Death File, 2022

This file contains one record for each of the 20,745 Add Health sample members from Wave I. It provides the vital status of each sample member through 2022 as well as the National Death Index-provided underlying cause of death code in ICD-10 format for each decedent. The month and year of the most recent Add Health interview are provided for living sample members, while the month and year of death are provided for decedents. N=20,745

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

Underlying_Cause_of_Death_File__2022.7z

Ordered Cause of Death File, 2022

This file contains entity- and record-axis codes reported by the National Death Index (NDI) for each decedent in the Add Health sample through 2022. The file is arranged hierarchically, by axis code; therefore, each decedent may have multiple records depending on the maximum number of entity- and record-axis codes recorded by NDI. The sequence of the decedent’s records reflects the order in which the entity- and record-axis codes were reported in the NDI record. N=2,377

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

Ordered_Cause_of_Death_File__2022.7z

All Coded Causes of Death File, Including Entity-Axis Codes, 2022

This file contains all underlying cause of death and entity-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

All_Coded_Causes_of_Death_File__Including_Entity-Axis_Codes__2022.7z

All Coded Causes of Death File, Including Record-Axis Codes, 2022

This file contains all underlying cause of death and record-axis codes appearing in the National Death Index (NDI) source file through 2022. Functioning as dummy variables, zero represents the absence of a code on the decedent’s death certificate, while one denotes the presence of one. N=706

Mortality_Outcomes_Surveillance_Part_I_Ascertaining_Decedents_User_Guide__2022_Update_.7z

All_Coded_Causes_of_Death_File__Including_Record-Axis_Codes__2022.7z