DEFINICIÓN NOMINAL
II. MARCO TEÓRICO
2.3 MARCO TEÓRICO REFERENCIAL
The accuracy of survey estimates can be affected by nonresponse, coding errors, computer processing errors, errors in the sampling frame, reporting errors, and other errors not due to sampling. These types of "nonsampling errors" and their impact are reduced through data editing, statistical adjustments for nonresponse, close monitoring and periodic retraining of interviewers, and improvement in quality control procedures.
Although these types of errors often can be much larger than sampling errors,
measurement of most of these errors is difficult. However, some indication of the effects of some types of these errors can be obtained through proxy measures, such as response rates, and from other research studies.
B.3.1 Screening and Interview Response Rate Patterns
In 2013, respondents continued to receive a $30 incentive in an effort to maximize response rates. The weighted screening response rate (SRR) is defined as the weighted number of successfully screened households13 divided by the weighted number of eligible households (as defined in Table B.3), or
,
where is the inverse of the unconditional probability of selection for the household and excludes all adjustments for nonresponse and poststratification defined in Section A.3.3 of Appendix A. Of the 190,067 eligible households sampled for the 2013 NSDUH, 160,325 were screened successfully, for a weighted screening response rate of 83.9 percent (Table B.3). At the person level, the weighted interview response rate (IRR) is defined as the weighted number of respondents divided by the weighted number of selected persons (see Table B.4), or
,
where is the inverse of the probability of selection for the person and includes household- level nonresponse and poststratification adjustments (adjustments 1, 2, and 3 in Section A.3.3 of Appendix A). To be considered a completed interview, a respondent must provide enough data to pass the usable case rule.14 In the 160,325 screened households, a total of 88,742 sample persons were selected, and completed interviews were obtained from 67,838 of these sample persons, for a weighted IRR of 71.7 percent (Table B.4). A total of 15,717 sample persons (20.9 percent) were classified as refusals or parental refusals, 2,622 (3.0 percent) were not available or never at 13 hh hh hh hh w complete SRR w eligible
hh w i i i i w complete IRR w selected
i whome, and 2,565 (4.4 percent) did not participate for various other reasons, such as physical or mental incompetence or language barrier (see Table B.4, which also shows the distribution of the selected sample by interview code and age group). Among demographic subgroups, the weighted IRR was higher among 12 to 17 year olds (82.0 percent), females (73.3 percent), blacks
(78.8 percent), persons in the South (73.3 percent), and residents of small metropolitan areas (73.4 percent) than among other related groups (Table B.5).
The overall weighted response rate, defined as the product of the weighted screening response rate and weighted interview response rate or
was 60.2 percent in 2013. Nonresponse bias can be expressed as the product of the nonresponse rate and the difference between the characteristic of interest between respondents and nonrespondents in the population . By maximizing NSDUH response rates, it is hoped that the bias due to the difference between the estimates from respondents and nonrespondents is minimized. Drug use surveys are particularly vulnerable to nonresponse because of the difficult nature of accessing heavy drug users. However, in a study that matched 1990 census data to 1990 NHSDA nonrespondents,15 it was found that populations with low response rates did not always have high drug use rates. For example, although some populations were found to have low response rates and high drug use rates (e.g., residents of large metropolitan areas and males), other populations had low response rates and low drug use rates (e.g., older adults and high- income populations). Therefore, many of the potential sources of bias tend to cancel each other in estimates of overall prevalence (Gfroerer, Lessler, & Parsley, 1997a).
B.3.2 Inconsistent Responses and Item Nonresponse
Among survey participants, item response rates were generally very high for most drug use items. However, respondents could give inconclusive or inconsistent information about whether they ever used a given drug (i.e., "yes" or "no") and, if they had used a drug, when they last used it; the latter information is needed to identify those lifetime users of a drug who used it in the past year or past month. In addition, respondents could give inconsistent responses to items such as when they first used a drug compared with their most recent use of a drug. These missing or inconsistent responses first are resolved where possible through a logical editing process. Additionally, missing or inconsistent responses are imputed using statistical methodology. These imputation procedures in NSDUH are based on responses to multiple questions, so that the maximum amount of information is used in determining whether a
respondent is classified as a user or nonuser, and if the respondent is classified as a user, whether the respondent is classified as having used in the past year or the past month. For example, ambiguous data on the most recent use of cocaine are statistically imputed based on a
respondent's data for use (or most recent use) of tobacco products, alcohol, inhalants, marijuana, hallucinogens, and nonmedical use of prescription psychotherapeutic drugs. Nevertheless, editing and imputation of missing responses are potential sources of measurement error.
ORR SRR IRR
(1R)
Appendix A. Details of the editing and imputation procedures for 2013 also will appear in the
2013 NSDUH Methodological Resource Book, which is in process. Until that volume becomes
available, refer to the 2012 NSDUH Methodological Resource Book (CBHSQ, 2014a).
B.3.3 Data Reliability
A reliability study was conducted as part of the 2006 NSDUH to assess the reliability of responses to the NSDUH questionnaire. An interview/reinterview method was employed in which 3,136 individuals were interviewed on two occasions during 2006 generally 5 to 15 days apart; the initial interviews in the reliability study were a subset of the main study interviews. The reliability of the responses was assessed by comparing the responses of the first interview with the responses from the reinterview. Responses from the first interview and reinterview that were analyzed for response consistency were raw data that had been only minimally edited for ease of analysis and had not been imputed (see Sections A.3.1 and A.3.2 in this report).
This section summarizes the results for the reliability of selected variables related to substance use and demographic characteristics. Reliability is expressed by estimates of Cohen's kappa (κ) (Cohen, 1960), which can be interpreted according to benchmarks proposed by Landis and Koch (1977, p. 165): (a) poor agreement for kappas less than 0.00, (b) slight agreement for
kappas of 0.00 to 0.20, (c) fair agreement for kappas of 0.21 to 0.40, (d) moderate agreement for
kappas of 0.41 to 0.60, (e) substantial agreement for kappas of 0.61 to 0.80, and (f) almost perfect agreement for kappas of 0.81 to 1.00.
The kappa values for the lifetime and past year substance use variables (marijuana use, alcohol use, and cigarette use) all showed almost perfect response consistency, ranging from 0.82 for past year marijuana use to 0.93 for lifetime marijuana use and past year cigarette use. The value obtained for the substance dependence or abuse measure in the past year showed
substantial agreement (0.67), while the substance abuse treatment variable showed almost perfect consistency in both the lifetime (0.89) and past year (0.87). The variables for age at first use of marijuana and perceived great risk of smoking marijuana once a month showed substantial agreement (0.74 and 0.68, respectively). The demographic variables showed almost perfect agreement, ranging from 0.95 for current enrollment in school to 1.00 for gender. For further information on the reliability of a wide range of measures contained in NSDUH, see the complete methodology report (Chromy et al., 2010).
B.3.4 Validity of Self-Reported Substance Use
Most substance use prevalence estimates, including those produced for NSDUH, are based on self-reports of use. Although studies generally have supported the validity of self-report data, it is well documented that these data may be biased (underreported or overreported). The bias varies by several factors, including the mode of administration, the setting, the population under investigation, and the type of drug (Aquilino, 1994; Brener et al., 2006; Harrison & Hughes, 1997; Tourangeau & Smith, 1996; Turner, Lessler, & Gfroerer, 1992). NSDUH utilizes widely accepted methodological practices for increasing the accuracy of self-reports, such as
2002). Various procedures have been used to validate self-report data, such as biological specimens (e.g., urine, hair, saliva), proxy reports (e.g., family member, peer), and repeated measures (e.g., recanting) (Fendrich, Johnson, Sudman, Wislar, & Spiehler, 1999). However, these procedures often are impractical or too costly for general population epidemiological studies (SRNT Subcommittee on Biochemical Verification, 2002).
A study cosponsored by the Substance Abuse and Mental Health Services Administration (SAMHSA) and the National Institute on Drug Abuse (NIDA) examined the validity of NSDUH self-report data on drug use among persons aged 12 to 25. The study found that it is possible to collect urine and hair specimens with a relatively high response rate in a general population survey, and that most youths and young adults reported their recent drug use accurately in self- reports (Harrison, Martin, Enev, & Harrington, 2007). However, there were some reporting differences in either direction, with some respondents not reporting use but testing positive, and some reporting use but testing negative. Technical and statistical problems related to the hair tests precluded presenting comparisons of self-reports and hair test results, while small sample sizes for self-reports and positive urine test results for opiates and stimulants precluded drawing conclusions about the validity of self-reports of these drugs. Further, inexactness in the window of detection for drugs in biological specimens and biological factors affecting the window of detection could account for some inconsistency between self-reports and urine test results. B.3.5 Revised Estimates for 2006 to 2010
During regular data collection and processing checks for the 2011 NSDUH, data errors were identified. These errors resulted from fraudulent cases submitted by field interviewers and affected the data for Pennsylvania (2006 to 2010) and Maryland (2008 and 2009). Although all fraudulent interview cases were removed from the data files, the affected screening cases were not removed because they were part of the assigned sample. Instead, these screening cases were assigned a final screening code of 39 ("Fraudulent Case") and treated as incomplete with
unknown eligibility. The screening eligibility status for these cases then was imputed. Those cases that were imputed to be eligible were treated as unit nonrespondents for weighting
purposes; however, these cases were not treated differently from other unit nonrespondents in the weighting process in 2006 to 2010 (see Section A.3.3 in Appendix A).
Table B.3 in Appendix B of the 2011 national findings report (CBHSQ, 2012b) presents screening results for 2010, the last year that was affected by these errors. Cases that were
imputed to be eligible are classified with a final code of 39 ("Fraudulent Case"; see Table B.3 in this report). The cases that were imputed to be ineligible did not contribute to the weights and were reported as "Other, Ineligible" in the affected years. Because any cases with falsified data were treated either as ineligible or as unit nonrespondents at the screening level, they were excluded from the interview data (see Table B.4). However, some estimates for 2006 to 2010 in the 2013 national findings report and the 2013 detailed tables, as well as other new reports, may differ from corresponding estimates found in some previous reports or tables.
the only estimates appreciably affected are estimates for Pennsylvania, Maryland, the mid- Atlantic division, and the Northeast region.
The 2013 national findings report and detailed tables do not include State-level or model- based estimates. However, they do include estimates for the mid-Atlantic division and the Northeast region. Single-year estimates based on 2006 to 2010 data and estimates based on pooled data including any of these years may differ from previously published estimates. Tables and estimates based only on data since 2011 are unaffected by these data errors.
Caution is advised when comparing data from older reports with data from more recent reports that are based on corrected data files. As discussed previously, comparisons of estimates for Pennsylvania, Maryland, the mid-Atlantic division, and the Northeast region are of most concern, while comparisons of national data or data for other States and regions are essentially still valid. CBHSQ within SAMHSA has produced a selected set of corrected versions of reports and tables. In particular, CBHSQ has released a set of modified detailed tables that include revised 2006 to 2010 estimates for the mid-Atlantic division and the Northeast region for certain key measures. CBHSQ does not recommend making comparisons between unrevised 2006 to 2010 estimates and estimates based on data for 2011 and subsequent years for the geographic areas of greatest concern.