Systematic investigation of gastrointestinal diseases in China (SILC): validation of survey methodology
BMC Gastroenterology volume 9, Article number: 86 (2009)
Symptom-based surveys suggest that the prevalence of gastrointestinal diseases is lower in China than in Western countries. The aim of this study was to validate a methodology for the epidemiological investigation of gastrointestinal symptoms and endoscopic findings in China.
A randomized, stratified, multi-stage sampling methodology was used to select 18 000 adults aged 18-80 years from Shanghai, Beijing, Xi'an, Wuhan and Guangzhou. Participants from Shanghai were invited to provide blood samples and undergo upper gastrointestinal endoscopy. All participants completed Chinese versions of the Reflux Disease Questionnaire (RDQ) and the modified Rome II questionnaire; 20% were also invited to complete the 36-item Short Form Health Survey (SF-36) and Epworth Sleepiness Scale (ESS). The psychometric properties of the questionnaires were evaluated statistically.
The study was completed by 16 091 individuals (response rate: 89.4%), with 3219 (89.4% of those invited) completing the SF-36 and ESS. All 3153 participants in Shanghai provided blood samples and 1030 (32.7%) underwent endoscopy. Cronbach's alpha coefficients were 0.89, 0.89, 0.80 and 0.91, respectively, for the RDQ, modified Rome II questionnaire, ESS and SF-36, supporting internal consistency. Factor analysis supported construct validity of all questionnaire dimensions except SF-36 psychosocial dimensions.
This population-based study has great potential to characterize the relationship between gastrointestinal symptoms and endoscopic findings in China.
The epidemiology of common gastrointestinal diseases differs between populations in Asian and in Western countries, and in Asia particularly, the pattern of these diseases seems to be changing . The prevalence of peptic ulcer disease has been declining at the same time as the prevalence of gastroesophageal reflux disease (GERD) and its complications have been increasing . Gastric cancer remains a common cancer in Asia, but its prevalence has also been declining in recent years . These changes may reflect the experience of Western countries several decades ago, and comparisons of the epidemiology of gastrointestinal diseases between Western and Asian countries continue to be of interest.
GERD is a chronic disease in which reflux of gastric contents into the esophagus causes a broad range of troublesome symptoms and esophageal complications. The recent Montreal consensus states that a diagnosis of GERD can be made based on symptoms alone and, in population-based surveys, a defined symptom threshold is required for making a symptom-based diagnosis . The historical lack of a single definition of GERD makes comparisons between studies difficult. A systematic review of studies that defined GERD as symptoms of heartburn and/or acid regurgitation experienced at least once a week concluded that GERD is relatively uncommon in Asia, with a prevalence of approximately 5%; this compared with 10-20% in Western countries . However, the recommendations of the Montreal consensus  have not yet been applied to epidemiological studies of GERD. Furthermore, the existence of essentially asymptomatic reflux esophagitis - seen in studies of Asian individuals undergoing upper gastrointestinal endoscopy as part of a routine health check [5, 6] and population-based endoscopic studies of GERD conducted in Europe [7, 8] - means that population-based endoscopic studies are needed to understand the epidemiology of GERD in Asia.
Irritable bowel syndrome (IBS) and dyspepsia are both common gastrointestinal disorders in the West, affecting 10-20%  and 20-40%  of the population, respectively. They are frequently diagnosed based on symptoms, and often coexist in the same patient . Data on the epidemiology of IBS and dyspepsia are limited in Asian countries, and prevalence estimates vary widely [12, 13]. However, the prevalence of IBS appears to be lower in Asia than in the West , with a prevalence of 3.8-6.6% in China according to Rome II criteria [15–18]. The prevalence of dyspepsia in populations from Korea, Taiwan and China is 11.7-18.4%, according to Rome II criteria [13, 19]. Further studies of IBS, dyspepsia and their relationship to each other, and to GERD, are needed in Asia.
The variation in prevalence of Helicobacter pylori infection is one potential explanation for the differences in the epidemiology of upper gastrointestinal diseases between Asian and Western populations and the changing patterns seen in Asia . H. pylori infection remains more prevalent in Asia than in the West [20, 21], where infection rates have declined in recent decades alongside improving socioeconomic conditions and the availability of eradication therapy [21, 22]. Eradication of H. pylori has an important role in the treatment or prevention of peptic ulcer disease, dyspepsia and gastric cancer [21, 23]. However, the role of H. pylori in GERD remains controversial ; H. pylori infection has been negatively associated with GERD in population studies from Asia, but the results of eradication studies in patients with peptic ulcer disease are inconsistent [25, 26].
Asian population-based studies, including endoscopic data and testing for H. pylori, are required to clarify the complex relationships between symptom-based GERD, reflux esophagitis, dyspepsia, IBS, peptic ulcer disease and H. pylori infection. We have previously conducted a pilot study in Shanghai to validate the methodology for the epidemiological study of GERD in China . This article reports the validation of a refined methodology in the Systematic Investigation of Gastrointestinal Diseases in China (SILC) study, which set out to investigate the epidemiology of upper and lower gastrointestinal diseases in five regions across China using symptom questionnaires that have undergone appropriate validation. The study aimed to collect reliable information on symptom patterns, comorbidities, health-related quality of life and findings from endoscopy, esophageal biopsy and laboratory investigations, including tests for H. pylori infection.
This study was conducted in five diverse regions of China, all of which are major population centres (Figure 1).
In total, 18 000 adults aged 18-80 years were randomly selected from Shanghai, Beijing, Xi'an, Wuhan and Guangzhou, using a stratified, multi-stage sampling methodology. Given that urban and rural populations are widely divergent in terms of environment and socioeconomic status, and because approximately half of the residents of these centres live in rural areas (Shanghai: 54.2%; Beijing: 42.5%; Xi'an: 50.4%; Wuhan: 41.4%; Guangzhou: 55.7% [28–33]), urban and rural populations were treated as two separate strata and sampled in a ratio of 1:1 (n = 1800 for each stratum in each region). At the first sampling stage, one or more districts from the urban stratum and one or more counties from the rural stratum were randomly selected from each region. At the second stage, one or more blocks were randomly selected from the urban districts, and one or more townships from the rural counties. At the third stage, one or more residential areas were randomly sampled from the urban blocks, and one or more villages from the rural townships.
All residents of the selected residential areas or villages were stratified according to their age and sex, and individuals were randomly selected from these strata in proportion to the overall age and sex distribution of the population in that region, using data from recent government population surveys (The Fifth Population Census In China, 2000, or the 1% sample survey, 2005) [34–39]. Residents who were younger than 18 years, older than 80 years, illiterate, or who had major psychiatric illness or severe visual, hearing or learning disabilities, were excluded from the study.
All selected individuals were asked to complete a general information questionnaire, and Chinese versions of the Reflux Disease Questionnaire (RDQ)  and modified Rome II questionnaire (Figure 2) . In addition, a random sub-sample of the total sample, 20% in each region, was asked to complete Chinese versions of the 36-item Short-Form Health Survey (SF-36)  and the Epworth Sleepiness Scale (ESS)  and to undergo a physical examination that included measurement of weight (kg), height (cm), and waist and hip circumference (cm). Measurements were made with individuals wearing indoor clothing without shoes. Weight was measured on an electronic scale to the nearest 0.1 kg, and height and circumference measurements were recorded to the nearest centimetre.
All participants from the Shanghai region were invited to undergo upper gastrointestinal endoscopy with esophageal biopsy. Endoscopic results were classified using the Los Angeles (LA) classification of reflux esophagitis  and the Prague C and M classification of endoscopically suspected esophageal metaplasia . Markers of microscopic esophagitis from biopsies were classified using criteria developed by an international panel of expert pathologists . Individuals in Shanghai were also asked to provide blood samples for H. pylori antibody serology using an immunoglobulin G enzyme-linked immunosorbent assay (ELISA) (Biohit, Helsinki, Finland).
The fieldwork period was from April 2007 to January 2008. Questionnaires were self-administered, with trained and supervised facilitators available to explain any questions that respondents found unclear. (Most queries were about the questionnaire formats, such as the skip rules in the modified Rome II questionnaire.) Participants completed the questionnaires at home or in local residential committee offices. Three attempts were made to contact a resident before he or she was considered to be a non-responder.
To encourage high response rates, local residential committee staff publicized the survey in the selected residential areas and villages using leaflets and posters. They also supported participation by setting up early morning and weekend sessions in local residential committee offices, where individuals could come to complete the questionnaires without this interfering with their daily routines. In addition, a small gift (such as shampoo or washing powder) was offered to reward participation. In the Shanghai region, the diagnostic benefits of a free blood test and endoscopy with biopsy were explained, along with the need to fast overnight. Breakfast was provided for those who underwent blood tests and/or endoscopy, and participants were transported to and from the hospital where endoscopy was performed. Individuals were subsequently provided with all test results.
The study was staffed by graduates from the Department of Health Statistics, Second Military Medical University in Shanghai, who received training centrally from gastrointestinal specialists and epidemiologists in Shanghai. Most had also taken part in the pilot survey in Shanghai and were familiar with the survey process. These study staff provided standardized training for facilitators, who were local university graduates or social workers at the sampled sites, before fieldwork commenced. After each questionnaire had been completed, it was checked for completeness and signed by the responsible member of the study staff before the participant received their gift.
Informed consent was obtained from study participants, who were free to discontinue their involvement in the study at any time. The study was approved by the Ethics Committee of the Second Military Medical University in Shanghai, China.
The general information questionnaire collected data on age, height, weight, sex, marital status, education, income, occupation, lifestyle, self-reported health status, family history of gastrointestinal diseases, and medical history (current and previous medical problems and related treatment). Current smoking was defined as smoking at least one cigarette daily, and drinking alcohol was classified as alcohol consumption on four or more occasions per month. Current occupation was classified as white-collar worker (including government employees, professionals and technicians), manufacturing industry worker, agricultural or fisheries worker, and other (including service sector and students). Participants indicated their health status, total monthly family income and weekly level of recreational exercise by selecting an option from appropriate categories.
The RDQ is a 12-item, patient-reported questionnaire that assesses the frequency and severity of upper gastrointestinal symptoms such as heartburn, regurgitation and epigastric pain. In this study, a 1-month recall period was used. Each item was scored on a 6-point Likert scale (Table 1). Traditionally, the items are combined into the dimensions of heartburn (burning behind the breastbone and pain behind the breastbone), regurgitation (acid taste in the mouth and movement of materials upwards from the stomach) and epigastric pain (epigastric pain and epigastric burning) . A GERD dimension can be obtained by combining the heartburn and regurgitation dimensions, and the epigastric pain dimension is also known as the dyspepsia dimension. There is strong evidence supporting the validity and reliability of the RDQ as a diagnostic tool in primary care  and as a measure of treatment response in clinical trials [48, 49]. Previous investigators found that a Chinese version of the RDQ tested in 10 hospitals in China was able to identify the presence of reflux symptoms experienced during the previous month . The Chinese version of the RDQ used in the present study underwent extensive linguistic validation, including forward and backward translation, cognitive debriefing of patients with GERD, and expert input from gastroenterologists. The pilot study in Shanghai showed that a version with a 1-week recall period had credible reliability and construct validity . In the current study, an item was added immediately after the RDQ that asked participants to rate how troublesome they found the symptoms that they had scored in the RDQ overall, using a 5-point scale (where the lowest score was 'not at all' and the highest score was 'extremely').
The Rome II questionnaire is designed to assess symptoms of upper and lower functional gastrointestinal disorders, including esophageal disorders, gastroduodenal disorders, bowel disorders, functional abdominal pain, biliary disorders and anorectal disorders, in a 3-month recall period . The Rome II questionnaire is widely used in research and clinical practice, and there is evidence supporting its validity across various cultures in Asia and the West, including a Chinese version in China [51–53]. The version of the Rome II questionnaire used in this study was translated into Chinese through a process of forward- and back-translation and reconciliation, and then tested for linguistic validity through a process of cognitive debriefing with literate volunteers. It was also modified by removal of reflux items covered by the RDQ, to shorten the survey for participants and minimize repetition and possible confusion. Individuals answered yes or no to eight mandatory items, and a further 16 items covering gastroduodenal, bowel and biliary symptoms were completed only if relevant.
The SF-36 is a generic questionnaire that is widely used to assess health status and well-being during the previous 4 weeks. It contains 36 items, clustered into eight dimensions (physical functioning, role-physical, bodily pain, general health, vitality, social functioning, role-emotional and mental health), plus one item on health change during the previous year . The score for each dimension is the sum of the scores of each item it contains, transformed to a value on a scale of 0 to 100, with high scores indicating good health-related quality of life. The reliability and validity of the SF-36 are well documented in a range of language versions, including Chinese [54–56]. The psychometric properties of the Chinese version of the SF-36 used in this study were assessed during the pilot study in Shanghai . The questionnaire was found to have credible reliability and construct validity.
The ESS is an eight-item, self-administered questionnaire that is widely used to measure daytime sleepiness in adults . The likelihood of dozing in various everyday situations is scored on a 4-point Likert scale, where a score of 3 indicates a high risk of dozing during the daytime and a score of 0 indicates no risk of dozing. Item scores are summed to produce a final score in the range 0-24. ESS scores above 12 suggest problems with excessive sleepiness, scores of 10-12 are borderline, and scores below 10 are considered normal. The reliability of the ESS has been demonstrated in English in Australia  and in Chinese in Hong Kong . For use in the present study, it was translated into Chinese through a process of forward- and back-translation and reconciliation, and then tested for linguistic validity through a process of cognitive debriefing with literate volunteers.
Data collection and analysis
Coding and double entry of questionnaire responses were carried out by two independent professional data-entry staff from the Department of Health Statistics. EpiData software (EpiData Foundation, Odense, Denmark)  was used to check for consistency between the two sets of data entries to ensure data quality. SAS 9.1.3 software (SAS Institute, Cary, NC, USA) was used for the data analyses. The mean value of the completed items was used to impute missing values where at least 50% of the items in an RDQ or SF-36 dimension or in the overall ESS had been completed. When values were missing for more than 50% of items, the dimension score (or questionnaire score for the ESS) was excluded from the analysis [60–62]. For the modified Rome II questionnaire, imputation was not performed if an item score was missing.
The internal consistency of survey instruments was evaluated using Cronbach's alpha coefficient to determine the extent to which items within each questionnaire were interrelated . Cronbach's alpha coefficients for each questionnaire were calculated by correlating all individual item scores with dimension scores and/or the overall score. An alpha coefficient above 0.70 suggested good internal consistency. Correlation and/or factor analyses were used to evaluate the construct validity of the RDQ, Rome II questionnaire and the SF-36 - in other words, whether each questionnaire actually measures the phenomena that it theoretically predicts. Factor analysis using principal component analysis and quartimax rotation was employed to explore the factor structure of each questionnaire. Factor loadings larger than 0.50 within one dimension were considered to support the factor construct, provided that the factor loadings were low across the other dimensions; cumulative rates were used to show the contributions of combinations of principal components . Correlation analysis was used to measure the strength of association between dimension scores and the total score for the SF-36 questionnaire, and between item scores and dimension scores for the RDQ. A correlation coefficient of more than 0.6 was considered to indicate a strong correlation, 0.3-0.6 a moderate correlation, and less than 0.3 a weak correlation .
Body mass index (BMI) was calculated from self-reported height and weight measurements from the general information questionnaire and, in the 20% sub-sample, from measurements taken by study staff as part of the physical examination. Respondents were categorized on the basis of BMI as having a low risk of cardiovascular disease and type 2 diabetes (< 18.5 kg/m2, underweight), an increasing but acceptable risk (18.5-22.9 kg/m2, normal weight), an increased risk (23.0-27.4 kg/m2, overweight) or a high risk (≥ 27.5 kg/m2, obese) . Agreement between participant-reported height and weight and the measurements made by study staff was assessed using the intraclass correlation coefficient (ICC).
Based on the Montreal definition of GERD for population-based studies , the RDQ was used to define symptom-defined GERD as mild symptoms of heartburn ('burning behind the breastbone' and/or 'pain behind the breastbone') and/or regurgitation ('acid taste in the mouth' and/or 'unpleasant movement of materials upwards from the stomach') occurring on at least 2 days a week (RDQ item frequency score ≥ 3 for a severity score of ≥ 2), or moderate/severe symptoms of heartburn and/or regurgitation occurring on at least 1 day a week (RDQ item frequency score ≥ 2 for a severity score ≥ 3).
As described above, reflux esophagitis was graded using the LA classification system  and endoscopically suspected esophageal metaplasia (suspected Barrett's esophagus) was assessed using the Prague C and M criteria . Hiatal hernia was defined as gastric folds extending at least 2 cm above the diaphragmatic hiatus during quiet respiration.
Peptic ulcer disease was defined as a defect in the gastric or duodenal wall that extended through the muscularis mucosae into the deeper layers of the wall (submucosa or the muscularis propria). H. pylori test results were classified as positive (≥ 38 enzyme immunoassay units [EIU]) or negative (< 38 EIU) .
Atrophic gastritis was defined on the basis of the more severely affected of the serum pepsinogen (PG) I concentration and the PGI/PGII ratio. Results were classified according to the manufacturer's recommendations for China. PGI levels < 30 μg/L and/or a PGI/PGII ratio < 3 indicated severe atrophy (achlorhydric or very hypochlorhydric conditions). PGI levels from 30 μg/L to < 50 μg/L and/or a PGI/PGII ratio from 3 to < 5 indicated moderate atrophy. PGI levels from 50 μg/L to < 70 μg/L and/or a PGI/PGII ratio from 5 to < 7 indicated mild atrophy (hypochlorhydric conditions). Levels above the cut-offs but in the presence of H. pylori antibodies indicated non-atrophic H. pylori gastritis.
IBS was defined according to the Rome II criteria as abdominal discomfort or pain that has at least two of the following three features: (1) relieved with defecation; (2) onset associated with a change in frequency of stool; and/or (3) onset associated with a change in form (appearance) of stool . IBS was described as diarrhoea-predominant (IBS-D) if patients had one or more of the following: more than three bowel movements a day, loose (mushy) or watery stools, and/or urgency (having to rush to have a bowel movement); and none of the following: fewer than three bowel movements a week, hard or lumpy stools, and/or straining during a bowel movement. IBS was described as constipation-predominant (IBS-C) if patients had one or more of the following: fewer than three bowel movements a week, hard or lumpy stools, and/or straining during a bowel movement; and none of the following: more than three bowel movements a day, loose (mushy) or watery stools, and/or urgency (having to rush to have a bowel movement).
Dyspepsia was defined according to the Rome II criteria as persistent or recurrent pain or discomfort centred in the upper abdomen, with no evidence that symptoms are exclusively relieved by defecation or associated with the onset of a change in stool frequency or stool form . Dyspepsia was described as ulcer-like if the predominant (most bothersome) symptom was pain centred in the upper abdomen. Dyspepsia was described as dysmotility-like if the predominant symptom was an unpleasant or troublesome nonpainful sensation (discomfort) centred in the upper abdomen; this sensation may be characterized by or associated with upper abdominal fullness, early satiety, bloating or nausea.
In total, 16 091 individuals completed the questionnaires within a period of 10 months in 2007 and 2008. The response rate ranged from 87.6% to 91.4% in the different regions, and the overall response rate was 89.4% (Table 2). In all five regions, the response rate was lower in men (78.7-89.9%) than in women (91.5-98.4%), and was generally lowest in the youngest age group (73.1-85.2% in 18-29 year-olds). Nearly half of the non-responders to the survey were in the 18-29-year-old age group; the response rate in this group was lower in men (75.2%) than in women (86.5%) overall and in all regions (data not shown). Thirteen participants were excluded from the analysis because of logical errors or insufficient completion of questionnaires, leaving a total of 16 078 individuals suitable for inclusion in the analysis.
In the 20% sub-sample, 3219 individuals responded, equating to a response rate of 89.4% overall (82.2-93.9% in the different regions), and data from 3214 individuals were suitable for analysis (Table 2). In the Shanghai sample, all 3153 participants (100%) provided blood samples, of which 3151 were suitable for inclusion in the analysis, and 1030 (32.7%) volunteered to undergo endoscopy with biopsy, of whom 1029 were suitable for inclusion in the analysis of endoscopic data. Six participants did not undergo biopsy because they met biopsy exclusion criteria (e.g. presence of esophageal varices or angioma), leaving 1022 individuals with biopsy results suitable for analysis.
The mean age of the participants was 43 years, and 52% were women (Table 3). BMI ranged from 11.8 kg/m2 to 41.0 kg/m2, with a mean BMI of 22.6 kg/m2. According to the BMI ranges appropriate for the population , 34.4% of participants were overweight and 8.1% were obese. When self-reported height and weight data were compared with measurements taken by study staff in the 20% sub-sample, ICCs of 0.97 and 0.98, respectively, were obtained, indicating that the self-reported values were reliable. Most participants were married (78%), did not smoke (72%) and did not drink alcohol (80%). Almost all smokers (95.6%) and drinkers (92.8%) were male. Further socioeconomic characteristics of participants are summarized in Table 4. The age and sex distribution of the participants was broadly representative of the distribution of the general population in each region (Table 5), although individuals in the youngest age groups were slightly under-represented at each study site.
Several demographic variables differed between those who did and those who did not volunteer for endoscopy in the Shanghai region. Greater proportions of participants who underwent endoscopy lived in a rural environment (63.1%, versus 43.8% of those who did not undergo endoscopy) or had a family history of gastrointestinal diseases (23.7% versus 11.5%), whereas the proportions of individuals who were in the youngest age range (18-29 years; 5.3% versus 13.9%), who were unmarried (3.8% versus 11.9%) or who were college graduates (8.9% versus 14.2%) were lower among the population who underwent endoscopy than the population that did not (all p ≤ 0.001). Sex, BMI, occupation, annual income, smoking status and alcohol consumption were similar among those who underwent endoscopy and those who did not.
Internal consistency (indicated by Cronbach's alpha coefficient) was above 0.7 for all questionnaires, demonstrating good reliability. Cronbach's alpha coefficient was 0.89 for the RDQ, 0.89 for the modified Rome II questionnaire, 0.80 for the ESS, and 0.91 for the overall SF-36. For seven of the eight individual SF-36 dimensions, Cronbach's alpha coefficient was above 0.7 (0.75-0.92), but for social functioning it was 0.51.
The RDQ, modified Rome II and SF-36 all demonstrated credible construct validity using correlation and/or factor analyses. For the RDQ, each dimension correlated most strongly with the items comprising it (Spearman correlation coefficients 0.63-0.87, p < 0.001; Table 6). Factor analysis of the RDQ indicated that the 12 items distributed to four factors that supported the theoretical construct of the RDQ (one representing the regurgitation dimension, another representing the heartburn dimension and two representing the dyspepsia dimension). Factor loadings ranged from 0.67 to 0.88, and the cumulative rate of the four factors was 80.1%.
Factor analysis of the modified Rome II questionnaire revealed that the 37 items distributed to seven factors with factor loadings that ranged from 0.57 to 0.94. The cumulative rate of the seven factors was 78.9%. These factors, representing dimensions of functional dyspepsia, IBS, an IBS subtype, biliary disorders, functional vomiting, aerophagia and an integrated dimension, provide empirical support for the validity of the Rome II classification system in China.
For the SF-36, all dimensions correlated most strongly with the items comprising it, except the physical functioning dimension. Spearman correlation coefficients for the association between an item score and its dimension score were greater than 0.6 (range 0.61-0.99) for all items except seven out of the ten physical functioning items (range 0.25-0.50). In factor analysis of the SF-36, the 36 items distributed to eight factors that generally supported the theoretical construct of the questionnaire (cumulative rate of 65.5%). Most items were distributed as expected from the theoretical construct (factor loadings 0.48-0.86), except for all mental health and vitality items, which were distributed to two factors (factor loadings 0.61-0.78); the social functioning items, which did not resolve into a distinct factor, but instead were distributed most strongly to role-emotional and mental health/vitality factors (factor loadings 0.45 and 0.43, respectively); and one general health item, which was also not clearly distributed to the expected factor.
The purpose of this study was to evaluate the validity of the methodology used to conduct the largest multicentre population-based survey of gastrointestinal symptoms in China to date, and the first to include upper gastrointestinal endoscopy with biopsy. The methodology was based on that applied in a pilot study in Shanghai , which verified the feasibility of conducting this larger and more extensive multicentre study.
High response rates were achieved to the simple, validated questionnaires used and, although the youngest age group was slightly under-represented, the survey sample was broadly representative of the general population in the survey regions. However, response rates are likely to have been low among migrant workers, who are registered at their place of origin rather than where they have moved in search of work, and who make up more than 10% of the Chinese population . Response rates among individuals invited for physical examination and blood sampling were high, demonstrating that the traditional reluctance of Chinese individuals to provide blood samples can be overcome. As anticipated, the response rate among those invited to undergo endoscopy was lower, and led to some response bias, but the number of participants undergoing endoscopy was still substantial (n = 1030).
The reliability and validity of the survey instruments in the current study were demonstrated for the modified Rome II questionnaire and ESS, and confirmed for the SF-36 and RDQ. Each questionnaire had a high Cronbach's alpha coefficient (≥ 0.8), suggesting good reliability. The internal consistency of the RDQ dimensions replicated findings from the original RDQ, as well as other language versions [48, 71]. Although test-retest reliability, known-groups validity and responsiveness to change were not assessed, the test-retest reliability of the Chinese versions of the RDQ and SF-36 was previously established in the Shanghai pilot study . The construct validity of the questionnaires was also credible. As found in the Shanghai pilot study and previous studies in China, the social functioning dimension of the SF-36 tends to perform less well than other dimensions [27, 54], and vitality is more strongly associated with mental health among Chinese-speaking people [54, 72–75]. It seems likely that these variations reflect underlying cultural perceptions and social standards regarding the relationship between health, energy (or 'qi') and social functioning in China . This limits the reliability of conclusions reached on the basis of data from this SF-36 dimension in China, and highlights the importance of assessing the psychometric validity of survey instruments in different cultures and settings.
In summary, the methodology used to conduct this large, multicentre epidemiological study of gastrointestinal diseases in five regions across China was valid and credible, and the survey questionnaires demonstrated acceptable psychometric properties. The prevalence of gastrointestinal diseases in these diverse regions of China, together with any associations of symptoms with demographic characteristics, health-related quality of life, sleep disturbance, and endoscopic, biopsy and laboratory findings, will be described in subsequent publications. The SILC study has the potential to make a major contribution to the epidemiological understanding of symptoms of gastrointestinal diseases in China, and to provide insight into the relationship between gastrointestinal symptoms and endoscopic findings that is likely to be of global significance.
Goh KL: Changing trends in gastrointestinal disease in the Asia-Pacific region. J Dig Dis. 2007, 8: 179-185. 10.1111/j.1751-2980.2007.00304.x.
Goh KL, Wong HT, Lim CH, Rosaida MS: Time trends in peptic ulcer, erosive reflux oesophagitis, gastric and oesophageal cancers in a multiracial Asian population. Aliment Pharmacol Ther. 2009, 29: 774-780. 10.1111/j.1365-2036.2009.03930.x.
Vakil N, Veldhuyzen van Zanten S, Kahrilas P, Dent J, Jones R: The Montreal definition and classification of gastro-esophageal reflux disease (GERD) - a global evidence-based consensus. Am J Gastroenterol. 2006, 101: 1900-1920. 10.1111/j.1572-0241.2006.00630.x.
Dent J, El-Serag HB, Wallander MA, Johansson S: Epidemiology of gastro-oesophageal reflux disease: a systematic review. Gut. 2005, 54: 710-717. 10.1136/gut.2004.051821.
Goh K-L: Gastroesophageal reflux disease (GERD) in the East - same as the West. J Clin Gastroenterol. 2007, 41 (Suppl 2): S54-58. 10.1097/MCG.0b013e3180322db1.
Wong BCY, Kinoshita Y: Systematic review on epidemiology of gastroesophageal reflux disease in Asia. Clin Gastroenterol Hepatol. 2006, 4: 398-407. 10.1016/j.cgh.2005.10.011.
Ronkainen J, Aro P, Storskrubb T, Johansson SE, Lind T, Bolling-Sternevald E, Graffner H, Vieth M, Stolte M, Engstrand L, Talley NJ, Agreus L: High prevalence of gastroesophageal reflux symptoms and esophagitis with or without symptoms in the general adult Swedish population: a Kalixanda study report. Scand J Gastroenterol. 2005, 40: 275-285. 10.1080/00365520510011579.
Zagari RM, Fuccio L, Wallander MA, Johansson S, Fiocca R, Casanova S, Farahmand BY, Winchester CC, Roda E, Bazzoli F: Gastro-oesophageal reflux symptoms, oesophagitis and Barrett's oesophagus in the general population: Loiano-Monghidoro study. Gut. 2008, 57: 1354-1359. 10.1136/gut.2007.145177.
Longstreth GF, Thompson WG, Chey WD, Houghton LA, Mearin F, Spiller RC: Functional bowel disorders. Gastroenterology. 2006, 130: 1480-1491. 10.1053/j.gastro.2005.11.061.
Stanghellini V, Tosetti C, Barbara G, De Giorgio R, Salvioli B, Corinaldesi R: Review article: the continuing dilemma of dyspepsia. Aliment Pharmacol Ther. 2000, 14 (Suppl 3): 23-30. 10.1046/j.1365-2036.2000.00397.x.
Gwee KA, Chua AS: Functional dyspepsia and irritable bowel syndrome, are they different entities and does it matter?. World J Gastroenterol. 2006, 12: 2708-2712.
Kang JY: Systematic review: the influence of geography and ethnicity in irritable bowel syndrome. Aliment Pharmacol Ther. 2005, 21: 663-676. 10.1111/j.1365-2036.2005.02396.x.
Hu WH, Wong WM, Lam CL, Lam KF, Hui WM, Lai KC, Xia HX, Lam SK, Wong BC: Anxiety but not depression determines health care-seeking behaviour in Chinese patients with dyspepsia and irritable bowel syndrome: a population-based study. Aliment Pharmacol Ther. 2002, 16: 2081-2088. 10.1046/j.1365-2036.2002.01377.x.
Chang FY, Lu CL: Irritable bowel syndrome in the 21st century: perspectives from Asia or South-east Asia. J Gastroenterol Hepatol. 2007, 22: 4-12. 10.1111/j.1440-1746.2006.04672.x.
Lau EM, Chan FK, Ziea ET, Chan CS, Wu JC, Sung JJ: Epidemiology of irritable bowel syndrome in Chinese. Dig Dis Sci. 2002, 47: 2621-2624. 10.1023/A:1020549118299.
Kwan AC, Hu WH, Chan YK, Yeung YW, Lai TS, Yuen H: Prevalence of irritable bowel syndrome in Hong Kong. J Gastroenterol Hepatol. 2002, 17: 1180-1186. 10.1046/j.1440-1746.2002.02871.x.
Dai N, Cong Y, Yuan H: Prevalence of irritable bowel syndrome among undergraduates in Southeast China. Dig Liver Dis. 2008, 40: 418-424. 10.1016/j.dld.2008.01.019.
Xiong LS, Chen MH, Chen HX, Xu AG, Wang WA, Hu PJ: A population-based epidemiologic study of irritable bowel syndrome in South China: stratified randomized study by cluster sampling. Aliment Pharmacol Ther. 2004, 19: 1217-1224. 10.1111/j.1365-2036.2004.01939.x.
Jeong JJ, Choi MG, Cho YS, Lee SG, Oh JH, Park JM, Cho YK, Lee IS, Kim SW, Han SW, Choi KY, Chung IS: Chronic gastrointestinal symptoms and quality of life in the Korean population. World J Gastroenterol. 2008, 14: 6388-6394. 10.3748/wjg.14.6388.
Shi R, Xu S, Zhang H, Ding Y, Sun G, Huang X, Chen X, Li X, Yan Z, Zhang G: Prevalence and risk factors for Helicobacter pylori infection in Chinese populations. Helicobacter. 2008, 13: 157-165. 10.1111/j.1523-5378.2008.00586.x.
Sung JJY, Kuipers EJ, El-Serag HB: Systematic review: the global incidence and prevalence of peptic ulcer disease. Aliment Pharmacol Ther. 2009, 29: 938-946. 10.1111/j.1365-2036.2009.03960.x.
Kusters JG, van Vliet AH, Kuipers EJ: Pathogenesis of Helicobacter pylori infection. Clin Microbiol Rev. 2006, 19: 449-490. 10.1128/CMR.00054-05.
Fuccio L, Laterza L, Zagari RM, Cennamo V, Grilli D, Bazzoli F: Treatment of Helicobacter pylori infection. BMJ. 2008, 337: a1454-10.1136/bmj.a1454.
Wu JC: Gastroesophageal reflux disease: an Asian perspective. J Gastroenterol Hepatol. 2008, 23: 1785-1793. 10.1111/j.1440-1746.2008.05684.x.
Kim N, Lim SH, Lee KH: No protective role of Helicobacter pylori in the pathogenesis of reflux esophagitis in patients with duodenal or benign gastric ulcer in Korea. Dig Dis Sci. 2001, 46: 2724-2732. 10.1023/A:1012783630913.
Yamamori K, Fujiwara Y, Shiba M, Watanabe T, Tominaga K, Oshitani N, Matsumoto T, Higuchi K, Arakawa T: Prevalence of symptomatic gastro-oesophageal reflux disease in Japanese patients with peptic ulcer disease after eradication of Helicobacter pylori infection. Aliment Pharmacol Ther. 2004, 20 (Suppl 1): 107-111. 10.1111/j.1365-2036.2004.01973.x.
Cao Y, Yan X, Ma X, Wang R, Fu Z, Johansson S, Wallander M-A, He J: Validation of a survey methodology for gastroesophageal reflux disease in China. BMC Gastroenterol. 2008, 8: 37-10.1186/1471-230X-8-37.
[Administration division of Shanghai]. [http://www.xzqh.org/quhua/31sh/]
The second national agricultural census of Beijing net. [http://www.bjstats.gov.cn/nypc/pcdt/pcxw/200804/t20080416_110349.htm]
[Administration Division of Xi'an]. [http://www.xzqh.org/quhua/61sx/01xi'an.htm]
[Administration division of Wuhan]. [http://www.xzqh.org/quhua/42hb/01wuhan.htm]
Zhenyu H, Ying H, Min C: [Implement the point of view of scientific development to promote the development of the modernization of Wuhan]. J Wuhan Commer Serv Coll. 2006, 20: 32-37.
[Administration division of Guangzhou]. [http://www.xzqh.org/quhua/44gd/01guangzhou.htm]
The fifth population census in China. [http://www.stats.gov.cn/tjsj/ndsj/2006/indexch.htm]
1% sample survey data held in 2005 by government in Shanghai. [http://www.stats-sh.gov.cn/english/index.htm]
1% sample survey data held in 2005 by government in Wuhan. [http://www.whtj.gov.cn/documents/tjnj2006/2/2-5.htm]
1% sample survey data held in 2005 by government in Beijing. [http://www.bjstats.gov.cn/tjnj/2006-tjnj/content/mV11_03-11.htm]
NBSC: 1% sample survey data held in 2005 by government in Guangdong. 2007, Beijing: National Bureau of Statistics of China
Xi'an census 2000. [http://www.stats.gov.cn/tjsj/ndsj/renkoupucha/2000fenxian/htm/table2.htm]
Shaw MJ, Talley NJ, Beebe TJ, Rockwood T, Carlsson R, Adlis S, Fendrick AM, Jones R, Dent J, Bytzer P: Initial validation of a diagnostic questionnaire for gastroesophageal reflux disease. Am J Gastroenterol. 2001, 96: 52-57. 10.1111/j.1572-0241.2001.03451.x.
Drossman DA: The functional gastrointestinal disorders and the Rome II process. Gut. 1999, 45 (Suppl 2): 1-5.
Ware JE, Sherbourne CD: The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care. 1992, 30: 473-483. 10.1097/00005650-199206000-00002.
Johns MW: A new method for measuring daytime sleepiness: the Epworth sleepiness scale. Sleep. 1991, 14: 540-545.
Lundell LR, Dent J, Bennett JR, Blum AL, Armstrong D, Galmiche JP, Johnson F, Hongo M, Richter JE, Spechler SJ, Tytgat GN, Wallin L: Endoscopic assessment of oesophagitis: clinical and functional correlates and further validation of the Los Angeles classification. Gut. 1999, 45: 172-180.
Sharma P, Dent J, Armstrong D, Bergman JJ, Gossner L, Hoshihara Y, Jankowski JA, Junghard O, Lundell L, Tytgat GN, Vieth M: The development and validation of an endoscopic grading system for Barrett's esophagus: the Prague C & M criteria. Gastroenterology. 2006, 131: 1392-1399. 10.1053/j.gastro.2006.08.032.
Fiocca R, Mastracci L, Riddell R, Takubo K, Vieth M, Yerian L, Sharma P, Fernström P, Ruth M: Development and validation of criteria for the recognition of histological lesions in patients with GERD. Gut. 2008, 57 (Suppl 2): A37-
Dent J, Vakil N, Jones R, Reimitz P, Schöning U, Halling K, Junghard O, Lind T: Validation of the reflux disease questionnaire for the diagnosis of gastroesophageal reflux disease in primary care. Gut. 2007, 56 (Suppl 3): A75-
Veldhuyzen van Zanten S, Armstrong D, Barkun A, Junghard O, White RJ, Wiklund IK: Symptom overlap in patients with upper gastrointestinal complaints in the Canadian confirmatory acid suppression test (CAST) study: further psychometric validation of the reflux disease questionnaire. Aliment Pharmacol Ther. 2007, 25: 1087-1097.
Johnsson F, Hatlebakk J, Klintenberg AC, Román J: The symptom relieving effect of esomeprazole 40 mg daily in patients with heartburn. Gastroenterology. 2001, 120 (Suppl): A437-
Chinese GERD Study Group: Value of reflux diagnostic questionnaire in the diagnosis of gastroesophageal reflux disease. Chin J Dig Dis. 2004, 5: 51-55. 10.1111/j.1443-9573.2004.00155.x.
Kwan AC, Bao TN, Chakkaphak S, Chang FY, Ke MY, Law NM, Leelakusolvong S, Luo JY, Manan C, Park HJ, Piyaniran W, Qureshi A, Long T, Xu GM, Xu L, Yuen H: Validation of Rome II criteria for functional gastrointestinal disorders by factor analysis of symptoms in Asian patient sample. J Gastroenterol Hepatol. 2003, 18: 796-802. 10.1046/j.1440-1746.2003.03081.x.
Shinozaki M, Kanazawa M, Sagami Y, Endo Y, Hongo M, Drossman DA, Whitehead WE, Fukudo S: Validation of the Japanese version of the Rome II modular questionnaire and irritable bowel syndrome severity index. J Gastroenterol. 2006, 41: 491-494. 10.1007/s00535-006-1799-9.
Whitehead WE, Bassotti G, Palsson O, Taub E, Cook EC, Drossman DA: Factor analysis of bowel symptoms in US and Italian populations. Dig Liver Dis. 2003, 35: 774-783. 10.1016/S1590-8658(03)00456-0.
Li L, Wang HM, Shen Y: Chinese SF-36 health survey: translation, cultural adaptation, validation, and normalisation. J Epidemiol Community Health. 2003, 57: 259-263. 10.1136/jech.57.4.259.
Ware JE, Kosinski M, Gandek B, Aaronson NK, Apolone G, Bech P, Brazier J, Bullinger M, Kaasa S, Leplege A, Prieto L, Sullivan M: The factor structure of the SF-36 Health Survey in 10 countries: results from the IQOLA Project. International Quality of Life Assessment. J Clin Epidemiol. 1998, 51: 1159-1165. 10.1016/S0895-4356(98)00107-3.
Ware JE, Gandek B, Kosinski M, Aaronson NK, Apolone G, Brazier J, Bullinger M, Kaasa S, Leplege A, Prieto L, Sullivan M, Thunedborg K: The equivalence of SF-36 summary health scores estimated using standard and country-specific algorithms in 10 countries: results from the IQOLA Project. International Quality of Life Assessment. J Clin Epidemiol. 1998, 51: 1167-1170. 10.1016/S0895-4356(98)00108-5.
Johns MW: Reliability and factor analysis of the Epworth sleepiness scale. Sleep. 1992, 15: 376-381.
Chung KF: Use of the Epworth Sleepiness Scale in Chinese patients with obstructive sleep apnea and normal hospital employees. J Psychosom Res. 2000, 49: 367-372. 10.1016/S0022-3999(00)00186-0.
Data entry, data management and basic statistical analysis system. [http://www.epidata.dk]
Ware JE, Snow KK, Kosinski M, Gandek B: SF-36 health survey manual and interpretation guide. 1993, Boston: New England Medical Center, The Health Institute
McHorney CA, Ware JE, Lu JF, Sherbourne CD: The MOS 36-item short-form health survey (SF-36): III. Tests of data quality, scaling assumptions, and reliability across diverse patient groups. Med Care. 1994, 32: 40-66. 10.1097/00005650-199401000-00004.
Gandek B, Ware JE, Aaronson NK, Alonso J, Apolone G, Bjorner J, Brazier J, Bullinger M, Fukuhara S, Kaasa S, Leplege A, Sullivan M: Tests of data quality, scaling assumptions, and reliability of the SF-36 in eleven countries: results from the IQOLA Project. International Quality of Life Assessment. J Clin Epidemiol. 1998, 51: 1149-1158. 10.1016/S0895-4356(98)00106-1.
Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika. 1951, 16: 297-334. 10.1007/BF02310555.
Fang J: Medical statistics and computer experiments. 2005, Singapore: Stallion Press
Hinkle DE, Jurs SG, Wiersma W: Applied statistics for the behavioural sciences. 1988, Boston: Houghton Mifflin, 2
World Health Organization: Appropriate body-mass index for Asian populations and its implications for policy and intervention strategies. Lancet. 2004, 363: 157-163. 10.1016/S0140-6736(03)15268-3.
Storskrubb T, Aro P, Ronkainen J, Sipponen P, Nyhlin H, Talley NJ, Engstrand L, Stolte M, Vieth M, Walker M, Agreus L: Serum biomarkers provide an accurate method for diagnosis of atrophic gastritis in a general population: The Kalixanda study. Scand J Gastroenterol. 2008, 43: 1448-1455. 10.1080/00365520802273025.
Thompson WG, Longstreth GF, Drossman DA, Heaton KW, Irvine EJ, Muller-Lissner SA: Functional bowel disorders and functional abdominal pain. Gut. 1999, 45 (Suppl 2): II43-47.
Drossman DA, Corazziari E, Talley N, Thompson WG, Whitehead WE: Rome II: A multinational consensus document on functional gastrointestinal disorders. Gut. 1999, 45 (Suppl 2): II1-II81.
China sees soaring migrant population. [http://www.chinagate.cn/english/news/49142.htm]
Shaw M, Dent J, Beebe T, Junghard O, Wiklund I, Lind T, Johnsson F: The Reflux Disease Questionnaire: a measure for assessment of treatment response in clinical trials. Health Qual Life Outcomes. 2008, 6: 31-36. 10.1186/1477-7525-6-31.
Chang DF, Chun CA, Takeuchi DT, Shen H: SF-36 health survey: tests of data quality, scaling assumptions, and reliability in a community sample of Chinese Americans. Med Care. 2000, 38: 542-548. 10.1097/00005650-200005000-00010.
Ren XS, Amick B, Zhou L, Gandek B: Translation and psychometric evaluation of a Chinese version of the SF-36 Health Survey in the United States. J Clin Epidemiol. 1998, 51: 1129-1138. 10.1016/S0895-4356(98)00104-8.
Tseng HM, Lu JF, Gandek B: Cultural issues in using the SF-36 Health Survey in Asia: results from Taiwan. Health Qual Life Outcomes. 2003, 1: 72-10.1186/1477-7525-1-72.
Lam CL, Gandek B, Ren XS, Chan MS: Tests of scaling assumptions and construct validity of the Chinese (HK) version of the SF-36 Health Survey. J Clin Epidemiol. 1998, 51: 1139-1147. 10.1016/S0895-4356(98)00105-X.
China statistical yearbook. [http://www.stats.gov.cn/tjsj/ndsj/2008/indexeh.htm]
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-230X/9/86/prepub
This study was supported by AstraZeneca R&D, Mölndal, Sweden. We thank the participants for their collaboration. We thank Dr Christopher Winchester from Oxford PharmaGenesis™ Limited, who provided writing services on behalf of AstraZeneca. We also thank Dr Benjamin Wong and Dr Xiaohua Jin for translating the questionnaires used in this study.
X. Yan, R. Wang, Y. Zhao, X. Ma, J. Fang, H. Yan, X. Kang, P. Yin, Y. Hao, Q. Li and D. Zou declare that they have no competing interests. J. He has served as the Director of the Department of Health Statistics, Second Military Medical University and has received research funding from AstraZeneca. J. Dent has served as a speaker, a consultant and an advisory board member for AstraZeneca, and has received research funding from AstraZeneca. J. Sung has served as a speaker, a consultant and an advisory board member for AstraZeneca, and has received research funding from AstraZeneca. S. Johansson is an employee of AstraZeneca. K. Halling and W. Liu were employees of AstraZeneca at the time the study was conducted; K. Halling is currently employed by PRO Consulting and W. Liu by Genzyme. The study was funded by AstraZeneca R&D, Mölndal, Sweden. AstraZeneca had no role to play in the content and conduct of the study. Writing support was provided by C. Winchester of Oxford PharmaGenesis™ Ltd and funded by AstraZeneca.
XY, RW, YZ, SJ, KH, WL, JH, JD and JS made substantial contributions to the conception and design of the study, XY, RW, YZ, XM, JF, HY, XK, PY, QL, YH, DZ, WL and JH participated in data collection, and XY, RW, YZ, SJ, KH, JH, JD and JS analysed and interpreted the data. All authors have been involved in critically revising the manuscript for intellectual content, and have given final approval of the version to be published.
Xiaoyan Yan, Rui Wang, Yanfang Zhao contributed equally to this work.
About this article
Cite this article
Yan, X., Wang, R., Zhao, Y. et al. Systematic investigation of gastrointestinal diseases in China (SILC): validation of survey methodology. BMC Gastroenterol 9, 86 (2009). https://doi.org/10.1186/1471-230X-9-86