"Don't know" answers concerning somatic disease status should not be regarded as "no" responses

„Weiß nicht“-Antworten in Bezug auf Fragen nach dem Vorliegen einer körperlichen Erkrankung sollten nicht als „Nein“-Antworten betrachtet werden

Research Article

  • corresponding author Harald Baumeister - Department of Rehabilitation Psychology and Psychotherapy, Institute of Psychology, University of Freiburg, Germany

GMS Ger Med Sci 2008;6:Doc03

Eingereicht: 24. Januar 2008
Überarbeitet: 26. Mai 2008
Veröffentlicht: 2. Juni 2008

Background: With regard to patients' self-reported somatic diseases some researchers transformed “don't know (DK)” responses into “no” responses. The present study examines the appropriateness of this procedure.

Methods: Analyses were based on the nationally representative German National Health Interview and Examination Survey (GHS), which assessed both self-reported diseases and physician-diagnosed diseases (N = 7124). Prevalence rates of persons’ DK responses and the corresponding prevalences of physicians’ diagnoses were calculated for persons with hypertension, coronary heart disease (CHD), heart failure, asthma, chronic bronchitis, thyroid disease, diabetes, cancer, gout, arthrosis, arthritis and osteoporosis. Correlates of physicians' diagnosed diseases of DK cases are reported.

Results: Between 1.6% and 9.8% of the participants responded with DK to the question of whether they have the disease. In 3.7% to 29.5% of DK cases, the physicians did regard the respective disease as being present. With regard to persons who responded with DK, the probability of a physicians' diagnosis was increased in the case of increased age and a higher number of somatic comorbidities.

Conclusion: The procedure of transforming DK responses into “no” answers does not appear to be recommendable.

Keywords: somatic diseases, validity, self-report


Hintergrund: „Weiß nicht“-Antworten von Patienten auf die Frage nach dem Vorliegen einer körperlichen Erkrankung werden in verschiedenen Studien zu „Nein“-Antworten umkodiert. Die vorliegende Studie untersucht die Validität dieses Vorgehens.

Methodik: Die Studie basiert auf Daten des Bundesgesundheitssurveys 1998 (BGS98), in dessen Rahmen sowohl seitens der Probanden berichtete als auch ärztlich diagnostizierte Erkrankungsangaben vorliegen (N=7124). Untersucht wurden probandenseitig berichtete „weiß nicht“-Antworten im Vergleich zu den Arztdiagnosen (liegt vor: ja/nein) in Bezug auf Hypertension, koronare Herzkrankheit (KHK), Herzinsuffizienz, Asthma, chronisch Bronchitis, Schilddrüsenerkrankung, Diabetes, Tumor, Gicht, Arthrose, Arthritis und Osteoporose.

Ergebnisse: Zwischen 1,6% und 9,8% der Probanden antworteten mit „weiß nicht“ auf die Frage, ob die entsprechende Erkrankung jemals vorlag. In 3,7% bis 29,5% dieser Fälle diagnostizierte der Arzt die Erkrankung als gegeben. Bei Personen mit DK-Antworten, war die Wahrscheinlichkeit, dass ärztlicherseits eine Erkrankung diagnostiziert wurde, bei älteren Patienten und Patienten mit einer höheren Anzahl somatischer Erkrankungen erhöht.

Schlussfolgerung: Die Umkodierung von „weiß nicht“-Antworten zu „nein“-Antworten erscheint nicht empfehlenswert.

Schlüsselwörter: somatische Erkrankungen, Validität, Selbstbericht


Epidemiological surveys often assess the somatic health status of a population through self-report questionnaires [1], [2], [3], [4], [5]. Some of these questionnaires include a “don't know” (DK) answer when asking about the presence of a somatic disease [2], [4], [5]. The issue of how to deal with these DK responses has rarely been examined so far. There are two competing hypotheses [6]: The first regards DK responses as equivalent to the most conservative response. Thus, the DK response is treated and analyzed similarly to the conservative response option. The second hypothesis assumes DK answers to be a middle response. Following this hypothesis, the DK responses should be analyzed separately or omitted in the case of dichotomized analyses (e.g. yes/no). With regard to participants’ disease status (yes/no), one can assume that participants with the disease would have known about it. Thus, some researchers regard a DK response as a “no” [2] [4]. However, as yet there is hardly any evidence as to whether or not this recoding strategy is justified.

The present study aims to examine this issue using data from the German National Health Interview and Examination Survey (GHS), which assessed the presence of somatic diseases from both patients’ self-report and physicians’ interview. The following questions will be answered:

What is the physician-rated disease status of patients who use the DK category?
What are the sociodemographic and medical correlates of physician-diagnosed somatic diseases of patients using the DK category?


Study design and samples

Data were drawn from the German National Health Interview and Examination Survey (GHS) [7]. The GHS was based on a stratified, multistage, cross-sectional, nationally representative sample of subjects aged 18 to 79 years from the non-institutionalized population of Germany. Aims, design and methods have been described in greater detail in a separate publication [5]. Therefore, design and sample characteristics are discussed only briefly here.

The GHS consisted of a stratified random sample from 113 communities throughout Germany with 130 sampling units. A representative gross sample of 13,222 persons was eligible according to the age, sex, and community-type criteria. All participants of the GHS filled out a questionnaire regarding sociodemographic variables, chronic diseases and health-related questions. All participants underwent a thorough physical examination and laboratory data were collected. The response rate (completing the total assessment) was 61.4% (N=7124 [5]).


Assessment of somatic diseases

The somatic examination took place in special centers at the study sites and started with a self-report questionnaire to evaluate subjects’ current and past somatic symptoms and complaints, health care utilization, impairments, and disabilities as well as characteristics of the participants. Within the questionnaire, participants were asked the following question: “Which of the following diseases have you had?” followed by a list of 42 disease groups (Table 1 [Tab. 1]) and 2 questions regarding other diseases not mentioned in the list. Each disease question could be answered with “yes”, “no” or “don’t know”. Upon completion of the questionnaire, a structured interview was conducted by a study physician in order to reexamine and refine the medical data from the self-report items. This interview was computer-assisted for standardization and integrity purposes. Diagnoses were then supplemented and, depending on the medical condition, revised on the basis of laboratory test data. Each of the patients’ self-reported disease statuses were re-diagnosed by the study physicians according to whether the disease had been present during the last 4 weeks, the last 12 months or anytime earlier (yes/no). For reasons of conciseness, the results of the present study are restricted to frequent chronic somatic cardiovascular, musculoskeletal, respiratory tract, cancer and endocrinological diseases. Results for all 42 diseases are available from the author on request.

Assessment of sociodemographic, medical and psychosocial correlates

Data such as sex, age and socioeconomic status (SES index with a range from 3 (low) to 21 (high) based on education, income and employment status) were collected within the self-report questionnaire of the GHS. The number of somatic diseases was based on the aforementioned physicians' diagnoses. To assess psychosocial disturbances, the SF-36-Mental-Health-Index subscale (MHI) was used [8]. Higher scores indicate better mental health, with a range from 0 to 100 points.

Data analysis

The data analysis was completed using Stata Statistical Software® [9]. Statistical weighting procedures were used for post-stratification adjustment to the German Census total by age, sex and region. Correct variance estimates were obtained via the Stata SVY (survey) commands. Correlates of physicians' diagnosed somatic disease status (yes/no) of DK responses were calculated by means of logistic regression models. Odds ratios (OR) with 95% CI are reported.


Across all diseases 5.4% of the participants responded with DK to the question of whether they have ever been diagnosed with the disease. In 11.4% of the DK cases, the diseases were regarded as present by the physicians. With regard to the specific diseases between 1.6% (cancer) and 9.8% (gout) of the participants responded with DK (Table 2 [Tab. 2]). DK answers were given approximately ¼ (hypertension) to twice (osteoporosis) as often as “yes” answers. In 3.7% (osteoporosis) to 29.5% (arthrosis) of DK cases, the diseases were regarded as present by the physicians. With regard to persons who responded with DK, a physicians' diagnosis of a disease as having been present showed a significant association (p<.05) with covariates in some cases (Table 3 [Tab. 3]). Owing to different sample sizes, these results should be interpreted with caution. However, there seems to be a trend (p<0.2) for older (true for 6 of 12 comparisons) and somatically comorbid participants (9/12) who responded with DK to be diagnosed as the disease having been present.


The present study examined for the first time the relationship between patients’ DK responses to questions concerning their disease status and physicians’ diagnosis of the respective disease. When interpreting the results, four limitations of this study should be considered. First, physicians' diagnoses were neither proven in terms of interrater reliability nor did the physicians had access to patients' medical records. Hence, there is also a risk of false physicians' diagnoses. Second, the chronology of the assessment of self-reported and physicians' diagnosed diseases may have restricted the independency of the measures. However, study physicians were trained to assess a variety of medical conditions in order to increase the reliability and validity of the diagnoses. Third, participants and physicians were asked for lifetime diagnoses, which may have biased the results owing to participants' recall difficulties and the lack of past medical history data. For this reason, we restricted our results to chronic conditions. Fourth, asking for the same information with questions differently worded may yield different responses. Thus, the present results can only be regarded as representative for the question wordings used in the present survey.

In prior studies on patient self-reported diseases some researches transformed DK responses into “no” responses [2], [4]. The rationale for this method is the assumption that participants with a specific disease would have known about this disease. As shown, this procedure causes misclassification, which proved to be of considerable extent at least for some of the diseases examined as well as for older and multimorbid participants, while no such trend were found for age, SES and mental health status. With regard to hypertension, chronic bronchitis, thyroid disease and arthrosis, between 16.7% and 29.5% of the patients giving DK responses were diagnosed with the disease by the physician. Moreover, there is a risk of misclassification in the case of infrequent diseases such as osteoporosis (4.5% self-reported “yes”) and frequent DK responses (8.3% self-reported “DK”). After recoding the positively diagnosed DK responses, the osteoporosis sample increased from 309 to 330 cases. In a similar vein, this also applies for CHD, heart failure, gout and arthritis. Therefore, the procedure of transforming DK responses into “no” answers does not appear to be recommendable and should at least be discussed as a limitation if there are reasons for keeping the DK responses as no cases within the sample (e.g. small sample size).

With regard to the question of whether a DK response should be included or not within surveys on disease status, there are at least two competing arguments that should be considered. On the one hand, a DK option increases the validity of “yes” and “no” answers and reduce missing data, since only those participants who are certain will answer with yes or no and most others will chose the DK category instead leaving the question unanswered. In this context, it would be of interest to examine the response pattern of DK respondents in the case of forced choice questions (yes/no). On the other hand, omitting the DK option may lead to a higher percentage of substantive responses relative to surveys that offer a DK option [6]. This benefit of forced choice questions, however, has to be balanced against the risk of false-positive and false-negative self-reported disease status. The risk may increase in the case of diseases that often remain undetected such as hypertension, while DK responses to diseases that are not well known may simply lead to non-responses in the case of forced choice questions. Thus, the assessment strategy should take into account the detection rates and the awareness level of a disease as well as sample characteristics such as age and number of comorbidities. Overall, however, including a DK option and excluding cases that responded with DK seems to be the most conservative strategy, reducing misclassification bias to a minimum.


Conflicts of interest

None declared.


We would like to thank the Robert Koch Institute for providing the GHS survey data [7].


