Research in Autism Spectrum Disorders 7 (2013) 579–590
Contents lists available at SciVerse ScienceDirect
Research in Autism Spectrum Disorders
Jo urnal homepage: http://ees.elsevier.com/RASD/default.asp
What is available for case identification in autism research in mainland China?
a,b, b b c b
Xiang Sun *, Carrie Allison , Bonnie Auyeung , Fiona E. Matthews , Simon Baron-Cohen , a
Carol Brayne
a
Cambridge Institute of Public Health, Department of Public Health and Primary Care, University of Cambridge, Forvie Site, Robinson Way, CB2 0SR, UK
b
Autism Research Centre, Department of Psychiatry, University of Cambridge, Douglas House, 18b Trumpington Road, CB2 2AH, UK
c
MRC Biostatistics Unit, Cambridge Institute of Public Health, Forvie Site, Robinson Way, CB2 0SR, UK
A R T I C L E I N F O A B S T R A C T
Little is known about research on Autism Spectrum Conditions (ASC) in mainland China.
Article history:
Received 22 August 2012 The few available studies in mainland China have shown the screening and diagnostic
Received in revised form 26 November 2012 instruments for ASC used in mainland China were different from the West. Literature on
Accepted 27 November 2012 screening and diagnostic instruments and criteria were reviewed and current available
instruments were identified and evaluated. Eight screening instruments and two
Keywords: diagnostic instruments were identified. The Clancy Autism Behaviour Scale (CABS), the
Autism
Autism Behaviour Checklist (ABC) and the Childhood Autism Rating Scale (CARS) were the
Screening instrument
most frequently used instruments in mainland China. They were adopted from the West
Diagnostic instrument
more than two decades ago for detecting individuals with Childhood Autism but not the
Mainland China
whole autism spectrum. Standardised instruments need to be validated and adopted into
autism research in mainland China.
ß 2012 Elsevier Ltd. All rights reserved.
1. Introduction
The term Autism Spectrum Conditions (ASC) is used to describe a group of childhood-onset neurodevelopmental
disorders characterised by impairments in social interaction and communication, and the presence of repetitive and
stereotyped behaviours, interests and activities (World Health Organisation, 1993). Recent epidemiological studies reported
that the prevalence estimate of ASC was around 1 percent of the general population, which is much higher than earlier
studies indicated (Fombonne, 2009). As the prevalence estimates increase, the detection of ASC has drawn more attention
worldwide than before (Baird et al., 2001; Barbaro & Dissanayake, 2010; Rice et al., 2007).
Recent epidemiological research has adopted a two-stage process for case identification in the general population
(Baron-Cohen et al., 2009; Kim et al., 2011). The first stage is screening and the second stage is diagnostic assessment. The
aim of screening is to identify and distinguish individuals who are more likely to have the condition of interest from those
who are less likely to have it (Webb, 2005). Screening instruments are used to detect the condition before the usual time
of diagnosis. In terms of ASC, the screening instruments are usually checklists with questions related to possible autistic
features. Individuals with positive screening results are then referred for more standard diagnosis (Porta, 2008).
Screening for autism has been proposed by many researchers (Baird et al., 2001; Duby & Johnson, 2009). Different
* Corresponding author at: Cambridge Institute of Public Health, Department of Public Health and Primary Care, University of Cambridge, Forvie Site,
Robinson Way, University of Cambridge, CB2 0SR, UK. Tel.: +44 01223 763833; fax: +44 01223 330300.
E-mail address: [email protected] (X. Sun).
1750-9467/$ – see front matter ß 2012 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.rasd.2012.11.003
580 X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590
screening instruments have been developed to detect potential cases in specific age ranges and/or for specific subtypes
on the spectrum (Norris & Lecavalier, 2010; Vostanis, Smith, Chung, & Corbett, 1994).
In terms of diagnostic assessment, as there has been no absolute test for ASC, because no clear biomarkers are available,
diagnostic instruments for ASC largely depend on the information from parents and direct observation of the affected child.
A number of diagnostic instruments have been developed, such as the Autism Diagnostic Observational Schedule (Lord,
Rutter, DiLavore, & Risi, 2001), the Diagnostic Interview for Social and Communication Disorder (Wing, Leekam, Libby,
Gould, & Larcombe, 2002), the Autism Diagnostic Interview Revised (Rutter, LeCouteur, & Lord, 2003) and the
developmental, dimensional and diagnostic interview (Skuse et al., 2004). The combination of the ADOS and the ADI-R has
been used more frequently in recent large-scale epidemiological research for case confirmation of ASC (Levy, Mandell, &
Schultz, 2009).
There has been limited research on the current situation of ASC in mainland China, which refers to 23 provinces and 5
autonomous regions, excluding Hong Kong and Macao. A previous review reported that the prevalence of Childhood Autism
in mainland China was 10.3 per 10,000, which was much lower than Western estimates (Sun & Allison, 2009). In most
previous Chinese studies, the Clancy Autism Behaviours Scale (CABS) and the Autism Behaviour Checklist (ABC) were used as
screening instruments, and the Childhood Autism Rating Scale (CARS) was used as diagnostic instrument. These instruments
are not frequently used in prevalence studies in the West, which suggested that the screening and diagnostic instruments
used in mainland China are different from those used in Western studies (Sun & Allison, 2009). This paper sets out to
investigate the utility of instruments for ASC that have been applied to the Chinese population in mainland China. Learning
from what is available can help to decide whether it is necessary to introduce and adopt more recent and advanced
instruments for autism research to mainland China.
This paper has three objectives: to identify the instruments and criteria that have been used for case identification for ASC
in mainland China; to report the validity and reliability of these instruments; to summarise the current methodology and
propose directions for future research in ASC in mainland China.
2. Methods
2.1. Literature searches
Systematic literature searches were conducted using four databases from the establishment year of each database to
2011 for publications related to screening and diagnostic instruments for ASC in mainland China (Figs. 1 and 2). The two
English databases were PubMed and Web of Knowledge. The other two were Chinese databases, including China Web of
Knowledge and Weipu. The search methodology used in the English and Chinese databases was not identical. In the English
databases systematic searching was conducted using three steps: Step 1 used the terminology of conditions such as ‘autism’
Fig. 1. Search strategy for identifying validation studies in English databases.
X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590 581
Search 1:
th
Weipu database (searched on 27 December 2011)
Yea rs (1989-2011)
Step 1: “Gu Du Zheng (Autism)”/ [Key words] OR “Zi Bi Zheng (Autism)/ [Key
words] OR “Gudu Zheng Pu Xi Zhang Ai (Autism Spectrum)”/ [Key words] OR
“Gu Fan Xing Fa Yu Zhang Ai (Pervasive developmental disorder)”/all [Key
words] OR “Ai Si Be Ge (Asperger)”/ [Key words]
Step 2: “Validation”/ [Key words] OR “Screen”/ [Key words] OR “Screening test”
OR “Validity”/ [Key words] or “Reliability”/ [Key words] AND results in Step 1
th
China Web of Knowledge (searched on 27 December 2011)
Yea r (1989-2011)
Step 1: “Gu Du Zheng (Autism)”/ [Key words] OR “Zi Bi Zheng (Autism)/ [Key
words] OR “Gudu Zheng Pu Xi Zhang Ai (Autism Spectrum)”/ [Key words] OR
“Gu Fan Xing Fa Yu Zhang Ai (Pervasive developmental disorder)”/all [Key
words] OR “Ai Si Be Ge (Asperger)”/ [Key words]
Step 2: “Validation”/ [Key words] OR “Screen”/ [Key words] OR “Screening test”
OR “Validity”/ [Key words] or “Reliability”/ [Key words] AND results in Step 1
Search 2:
th
Weipu database/China Web of Knowledge (searched on 27 December 2011)
Yea r (1989-2011)
“Gu Du Zheng (Autism)”/ [Key words] OR “Zi Bi Zheng (Autism)”/ [Key words]
Fig. 2. Search strategy for identifying validation studies in Chinese databases.
or ‘autism spectrum’; Step 2 defined the study region as ‘China’; and Step 3 was specific to screening or validation studies. In
the Chinese databases no study region was defined for searching in Search 1; in Search 2, only the Chinese terminology of
‘autism’ was used for a broader search in order to capture the studies that might have been missed by Search 1.
2.2. Study selection
The papers identified from the systematic searches were examined against the following inclusion criteria. (1) The study
should focus on a screening tool, diagnostic tool or diagnostic criteria. (2) The sample should be population based or clinically
based with a clear sample size. (3) The reference diagnostic method should be clearly indicated in terms of diagnostic
instruments, criteria or clinical judgement. (4) The validity or reliability of the tool should be provided. When it was not clear
whether or not the paper should be included, the paper itself was examined when possible.
2.3. Data extraction
After excluding the duplications using Reference Manager, the following variables were extracted from the identified
papers: screening instrument, cut-off of instrument, sample characteristics, diagnostic instrument, diagnostic criteria,
reliability, and validity of the instruments.
3. Results
3.1. Studies identified
The results of the first search in each database were as follows: three papers from PubMed; 12 papers from Web of
Knowledge; 44 papers from Weipu; and 14 papers from China Web of Knowledge. After removing duplicates (n = 18),
abstracts were reviewed. The papers (n = 38) identified in the four databases were excluded from this review for the
following reasons: (1) the study sample was not from the population in mainland China (n = 6); (2) the study did not focus on
either screening or diagnostic instrument or diagnostic criteria (n = 29); (3) the study did not provide validity or reliability of
the instrument (n = 3). One paper from the English databases and 18 papers from the Chinese databases met the inclusion
criteria. After the second search of the Chinese databases, another three papers were identified, which had not been
identified in the first search either in the English or the Chinese databases. In total 22 studies on 9 screening instruments and
2 diagnostic instruments were included for review (Fig. 3). Fifteen studies focused on screening instruments, six studies were
on diagnostic instruments, and four studies were on diagnostic criteria. Three studies investigated more than one topic. A
summary of the studies and their characteristics is provided in Appendix A.
582 X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590
Fig. 3. Search results of studies on screening and diagnostic instruments.
3.2. Screening instruments
The screening instruments that have been applied to the Chinese population in mainland China include: Autism
Behaviour Checklist (ABC); Clancy Autism Behaviour Scale (CABS); Checklist for Autism in Toddlers-23 (CHAT-23); Modified
Checklist of Autism in Toddlers (M-CHAT); Autism Spectrum Screening Questionnaire (ASSQ); Flinders Observational
Schedule of Pre-verbal Autistic Characteristics-Revised (FOSPAC-R); Waterville Autistic Behaviour Scales (WABS); Early
Autism Screening Items (EASI), and Pictorial Autism Screening Scale for Infant and Toddler (PASS-IT).
3.2.1. Autism Behaviour Checklist
The ABC was developed by Krug, Arick, and Almond (1980). It was designed to be applied to individuals aged between 18
months and 35 years (Krug et al., 1980; Miranda-Linne & Melin, 2002). The ABC has 57 items, with each item rating between
1 and 4. The cut-off of the ABC is 53 (53). It can be administrated by the clinicians and can also be filled in by the parents
without prior training (Williams, 2003).
X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590 583
The ABC was first introduced into China for autism screening and diagnosis in 1989 (Tao, 1999; Wang, Wang, & Shen,
2003). The Chinese version of the ABC contains 57 items, including 10 items for tapping sensory behaviours (total score 30),
11 items for relationships (total score 35), 12 items for the use of physical development (total score 28), 13 items for
language (total score 31), and 11 items for daily living skills (total score 25).
Seven studies examined the utility of the ABC in Chinese samples. All seven studies were conducted within clinical
settings. During clinical referral, the children were first given a diagnosis of Childhood Autism, and the parents then
filled in the ABC. Four of them used the ABC in both children with Childhood Autism and children without autism (Li,
Zhong, Cai, Chen, & Zhou, 2005a; Lu & Yang, 2004; Yang, Huang, Jia, & Chen, 1993), while the other three only examined
the ABC in children with Childhood Autism (Wang, Wang, & Wang, 2003; Wang, Wang, & Shen, 2003; Yin, Chen, Luo, &
Li, 2011). The age of studied samples ranged from 1.5 to 16 years. Three diagnostic criteria were used for case
identification, including the International Statistical Classification of Disease and Related Health Problems, 10th edition,
(World Health Organisation, 1993) the Diagnostic and Statistical Manual of Mental Disorders, 3rd/4th edition (American
Psychiatric Association, 1980, 1994) and the Chinese Classification of Mental Disorders, 3rd edition (Chinese Society of
Psychiatry, 2001). The percentage agreement between the ABC and diagnostic criteria was calculated in six studies to
examine the utility of the ABC. The percentage agreement was the number of children who scored above the cut-off of
the screening instrument divided by the number of cases recruited. Four studies reported a high percentage agreement
(80–86.3%) between the ABC and ICD-10 (Lu & Yang, 2004; Wang, Zai, & Zhen, 2002; Wang, Wang, & Wang, 2003;
Wang, Wang, & Shen, 2003). However, the sensitivity and specificity were not provided in these four studies. Two
validation studies reported the validity of the ABC (Li et al., 2005a; Yang et al., 1993). The first validation study of the
ABC recommended a cut-off of 31 for screening (Yang et al., 1993), which reported a sensitivity of 100 percent and a
specificity of 100 percent. Using the cut-off of 31, the other validation study reported a sensitivity of 97 percent and a
specificity of 89 percent (Li et al., 2005a). The cut-offs adopted in these studies were different. Three studies adopted
53, two adopted 56, and two adopted 31. Studies also recommended different scores (50–72) on the ABC as cut-offs for
the diagnosis of Childhood Autism. One study reported that the ABC had good internal consistency in 178 children with
autism (Cronbach’s a = 0.81), and the percentage agreement between the ABC and the CCMD-3 was 89 percent (Yin
et al., 2011).
3.2.2. Clancy Autism Behaviour Scale
The CABS was first published in 1969 (Clancy, Dugdale, & Rendle-Short, 1969). However, no literature was found focusing
on the utility of the CABS in Western populations. The Chinese version of the CABS was designed to be completed by parents.
It contains 14 items, with each item rated on three frequency levels, including ‘Never (score 0)’, ‘Occasionally (score 1)’ and
‘Often (score 2)’. If the child scores equal to or higher than 14, and less than 3 items are scored as ‘Never’, and more than 6
items as ‘Often’, the child should be considered as screening positive for Childhood Autism.
Six studies were identified, of which four met the inclusion criteria. The other two were excluded because they did not
report either validity or reliability. All four studies were conducted in clinical settings (Li et al., 2005a; Li, Zhong, Cai, Chen, &
Zhou, 2005b; Wang, Wang, & Wang, 2003; Wang, Wang, & Shen, 2003). Two studies recruited both children with autism and
children without autism (Li et al., 2005a; Zhang, 2006). The study samples were aged between 1.5 and 8 years. The ICD-10
and DSM-IV were used as diagnostic criteria. Two studies reported high agreement (95%) between the CABS and ICD-10
(Wang, Wang, & Wang, 2003; Wang, Wang, & Shen, 2003). The agreement between the CABS and DSM-IV was 87 percent (Li
et al., 2005a). One study used clinical judgement for case confirmation without specifying the diagnostic criteria (Zhang,
2006). Two studies adopted 14 as a cut-off for the CABS (Wang, Wang, & Wang, 2003) while one study adopted a cut-off of 6
(Li et al., 2005a). The validity of CABS was reported in two studies (Li et al., 2005a; Zhang, 2006), and both suggested a good
validity of the CABS (sensitivity: 88%, 91%; specificity: 82%).
3.2.3. Checklist for Autism in Toddlers and its modified version
The CHAT includes nine questions for parents and five observations of the child (Baron-Cohen, Allen, & Gillberg, 1992). It
takes between 10 and 15 min to complete. The applicable age for the CHAT as a screening instrument is 18 months (Baron-
Cohen et al., 1992). It has been thoroughly validated and evaluated in Western populations (Baron-Cohen et al., 2000). The
CHAT has been adopted by American researchers and was modified as the M-CHAT (Dumont-Mathieu & Fein, 2005; Yama,
Freeman, Graves, Yuan, & Karen, 2012). Researchers in Hong Kong developed a screening instrument, the CHAT-23, for
screening in toddlers by combining the M-CHAT (23 questions) with graded scores and the observational Section B of the
CHAT (Wong et al., 2004). To be labelled positive on the CHAT-23, the child should meet one of the following criteria: first,
individual scores should be positive on more than 6 out of the 23 items; second, of the 7 core items on the CHAT-23, the
individual scores should be positive on more than 2 items.
So far, only one study has investigated the utility of the CHAT-23 (Wu et al., 2010) and two studies reported the
utility of the M-CHAT in Chinese samples (Gong et al., 2011; Zhang, Tan, Xiao, Zhang, & Zhang, 2011). All three studies
were conducted in children of no older than three years using DSM-IV as the diagnostic criteria. The sensitivity of the
CHAT-23 was 94 percent and specificity was 88 percent when applied to the general population (Wu et al., 2010). This
study indicated that the validity of CHAT-23 in this sample was higher than in the first study in Hong Kong. It
suggested that this might have been due to the relatively small size of the case group (n = 51) compared with the
control group (n = 482) in this study. Two studies on the M-CHAT were conducted in clinical samples. However, using
584 X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590
similar methodology, two studies reported quite different results on test validity of the M-CHAT. One study reported
that sensitivity was 96 percent, and specificity was 60 percent (Gong et al., 2011), while the other reported that
sensitivity was 72 percent and specificity was 95 percent (Zhang, 2006).
3.2.4. Autism Spectrum Screening Questionnaire
The Autism Spectrum Screening Questionnaire (ASSQ) was originally developed as a screening tool to detect Asperger’s
syndrome in the general population of school-aged children by Ehlers and Gillberg (Ehlers & Gillberg, 1993; Ehlers, Gillberg,
& Wing, 1999). The applicable age range of ASSQ is between 7 and 16 years (Ehlers et al., 1999). It was developed according to
the diagnostic criteria for ASC in ICD-10 and DSM-IV (Kopp & Gillberg, 2011; Mattila et al., 2009). The ASSQ contains 27 items
and each item was rated on a 3-point scale as follows: 0 indicating normal, 1 indicating maybe abnormal, and 2 indicating
definite abnormality (Mattila et al., 2009). The score of the ASSQ ranges from 0 to 54 with a recommended cut-off at 13. The
ASSQ takes only 10 min to complete. The sensitivity of the parent’s version of the ASSQ was 91 percent and specificity was 77
percent (Ehlers et al., 1999).
One validation study of the ASSQ in the Chinese population was identified. The ASSQ was applied to children with ASC
(n = 94) and typically developing children (n = 120), as well as children with other conditions (n = 71) (Guo et al., 2011).
Using a cut-off of 12 (12), the sensitivity of the ASSQ was 96 percent and specificity was 83 percent. This study
examined the performance of the ASSQ using the Receiver Operating Characteristic (ROC), which reported an area under
curve (AUC) of the ASSQ of 0.957 (Guo et al., 2011). No validation of the ASSQ was conducted in the general Chinese
population.
3.2.5. Flinders Observational Schedule of Pre-verbal Autistic Characteristics-Revised
The Flinders Observational Schedule of Pre-verbal Autistic Characteristics-Revised (FOSPAC-R) was developed by
researchers at Flinders University Australia (Yong, 2001). It was designed for the detection of Childhood Autism as early as 12
months old. The FOSPAC-R contains 17 items and rates on a 3-level scale, including normal behaviours, between normal and
abnormal behaviours, and abnormal behaviours. It takes between 15 and 20 min to complete. Only one study was identified
that validated the FOSPAC-R in clinical settings. The sample contained 27 children with Childhood Autism, 27 typically
developing children, and 27 children with mental retardation. This study first applied the FOSPAC-R to the study sample, and
the test positives were then further examined using the CHAT and diagnosed using DSM-IV by clinicians. Using a cut-off of
14, the validity of the FOSPAC-R for distinguishing children with Childhood Autism from the others was good
(sensitivity = 96%, specificity = 93%).
3.2.6. Waterville Autistic Behaviour Scales
The Waterville Autistic Behaviours Scales (WABS) was developed from the diagnostic criteria of ASC in DSM-IV and ICD-
10 (Song, Fang, & Sun, 2009). It contains four domains, including social behaviours, language and communication,
behaviours, and interests, as well as movement and cognition. One study investigated the validity of the WABS within
children having Childhood Autism, children with mental disorder and typically developing children (Song et al., 2009). This
study recommended a cut-off of 51.5 and reported a sensitivity of 100 percent and specificity of 97 percent.
3.2.7. Early Autism Screening Items
The Early Autism Screening Items (EASI) was developed by an American researcher, Larry Burd, and has not yet been
validated in the West (Zhang et al., 2011). The validation of the Chinese EASI was the first validation study since it was
developed in the West. The Chinese EASI contains 46 items. Each item is scored between 0 and 2 according to the frequency
and degree of autistic behaviours. Never or seldom is indicated by 0, 1 indicates sometimes, and 2 indicates often. A total
score of less than 40 is recommended as the cut-off. The study sample consisted of 25 children with ASC and 400 typically
developing children. The cases were previously diagnosed by clinicians using DSM-IV. The sensitivity of EASI was 80
percent and specificity was 75 percent (Zhang et al., 2011). The percentage agreement between the EASI and DSM-IV was 75
percent.
3.2.8. Pictorial Autism Screening Scale for Infant and Toddler
The PASS-IT was also developed by Larry Burd, and consists of 25 pictures of different facial expressions or gestures. The
pictures were designed to be shown to parents or caregivers to recall whether or not their child exhibited the expressions or
gestures shown in the picture. The parents were asked to give an answer of yes or no with a score of 1 and 0 respectively. A
total score of less than 20 (20) was recommended as a cut-off. In the validation study of the EASI, the PASS-IT was applied to
the same sample. The sensitivity of PASS-IT was 88 percent and specificity was 64 percent (Zhang et al., 2011). The
agreement between the PASS-IT and DSM-IV was 66 percent.
3.3. Diagnostic instruments
Studies on two diagnostic instruments were identified: Childhood Autism Rating Scale (CARS) and Autism Diagnostic
Interview (ADI). The Autism Diagnostic Observation Schedule (ADOS) was only introduced without any validation studies.
X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590 585
3.3.1. Childhood Autism Rating Scale
The Childhood Autism Rating Scale (CARS) was developed by Schopler, Reichler, and Renner (1988). It is a behaviour and
observation scale that is considered especially effective for differentiating children with mild, moderate and severe autism,
as well as discriminating between children with autism and those with mental retardation (Eaves & Milner, 1993; Magyar &
Pandolfi, 2007; Pereira, Riesgo, & Wagner, 2008; Stella, Mundy, & Tuchman, 1999). It includes a wide range of diagnostic
criteria and evaluates the child’s behaviours on 15 dimensions (Perry, Condillac, Freeman, Dunn-Geier, & Belair, 2005). Many
studies have reported relatively high reliability of the CARS (Garfin, McCallon, & Cox, 1988; Sevin, Matson, Coe, Fee, & Sevin,
1991).
The Chinese version of the CARS contains 15 items. The rating ranges between 1 and 4 on each item, indicating the
following: ‘appropriate behaviours for the child’s age’, ‘mildly abnormal’, ‘moderate abnormal’, and ‘severe abnormal’. The
total score of the CARS is 60 and the cut-off for Childhood Autism is 30.
Four studies that investigated the utility of the CARS were identified (Li et al., 2005a, 2005b; Lu & Yang, 2004; Yin et al.,
2011). All four studies were conducted within clinical settings, three of which recruited both children with autism and
children without autism. The age of study samples ranged from 1.8 to 16 years. Three diagnostic criteria were used,
including DSM-IV, ICD-10, and CCMD-3. All four studies adopted a cut-off of 30 on the CARS for diagnosis of Childhood
Autism. The agreement between the CARS and diagnostic criteria was high (97–100 percent). The validity of the CARS as a
diagnostic instrument for Childhood Autism was reported as excellent (sensitivity = specificity = 100%) by two studies (Li
et al., 2005a, 2005b). One study (Yin et al., 2011) reported that the internal consistency of the CARS was acceptable
(Cronbach’s a = 0.78).
3.3.2. Autism Diagnostic Interview and Autism Diagnostic Observational Schedule
The ADI was developed by Rutter, Le-Couteur and Lord (Le-Couteur et al., 1989). It is a standardised, face-to-face,
semi-structured diagnostic protocol for interviewing parents or caregivers of individuals referred for a possible ASC. In
the West, the current version, the Autism Diagnostic Interview-Revised (ADI-R), has been widely used in autism
research (Rutter et al., 2003). The ADI-R is an 85-page booklet containing 111 items in 3 core domains of ASC (Gray,
Tonge, & Sweeney, 2008), including social interaction, communication and repetitive and stereotyped behaviours,
interests and activities (RBIA). Each item is on a 4-scale rating from 0 to 3: ‘0’ indicates no evidence for abnormality; ‘2–
3’ indicates obvious evidence for abnormality. The ADI was introduced into mainland China in 1998. Two studies were
conducted in clinical settings to investigate the validity of the ADI in Chinese populations (Guo & Liu, 2002; Liu, Guo,
Guo, Yang, & Jia, 2004). Both studies applied the ADI in children with Childhood Autism and children with other mental
conditions. One study reported that 15 out of 16 items in the social interaction domain could statistically distinguish
children with autism from children without autism (p < 0.01). In the communication domain, 11 out of 13 items could
differentiate children with autism from children without autism (p < 0.01). In the RBIA domain, 4 out of 8 items
significantly differentiated children with autism from children without autism (p < 0.01) (Guo & Liu, 2002). The inter-
rater reliability of the ADI in the social interaction domain was good in 15 out of 16 items (Kappa 0.68). The inter-
rater reliability was good in 10 out of 13 items (Kappa 0.68) in the communication domain. The reliability was good
(Kappa 0.68) in 3 out of 8 items in the RBIA domain (Guo & Liu, 2002). It suggested that the relatively lower validity
and reliability of the RBIA domain of the Chinese ADI was due to the low specificity of the items in this domain,
according to previous studies (Fombonne, 1992; Yirmiya, Sigman, & Freeman, 1994), but not due to potential cultural
influence (Guo & Liu, 2002).
The other study investigated the reliability and validity of parts of the items on the ADI that were identified by the first
study. The second study examined 17 items on the ADI: seven items from the social interaction domain, six from the
communication domain and four from the RBIA domain (Liu et al., 2004). It reported that the inter-rater reliability of 11 out
of 17 items was moderate (Kappa: 0.43–0.58), and the reliability of another five items was good (Kappa: 0.6–0.64). Only one
item had fair inter-rater reliability (Kappa = 0.29). The test–retest reliability of nine items was good (Kappa: 0.61–0.73), and
the remaining eight items had moderate reliability (Kappa: 0.4–0.58). This revealed that all items could significantly
(p < 0.001) distinguish children with Childhood Autism from children with other mental conditions or typically developing
children (Liu et al., 2004).
The Autism Diagnostic Observational Schedule (ADOS) is a semi-structured, standardised, play-based observational
instrument (Lord et al., 2000). The assessment was designed to create a ‘social world’ for the subject to interact with the
examiner (Klein-Tasman, Risi, & Lord, 2007). The ADOS was first mentioned in the introductory report of the ADI (Guo &
Yang, 1998). However, the ADOS has not been applied to the Chinese population in the reviewed literature.
3.4. Comparison of diagnostic criteria
Four diagnostic criteria were used by Chinese clinicians: ICD-10 (World Health Organisation, 1993), DSM-III-R (American
Psychiatric Association, 1987), DSM-IV (American Psychiatric Association, 1994) and the Chinese Children Mental Diagnosis
(CCMD-2/2-R/3) (Chinese Society of Psychiatry, 1984, 1993, 2001). The first three are the same criteria used in Western
countries while the last one has been used only in mainland China.
Four studies were identified, which focused on the agreement between different diagnostic criteria of autism in
mainland China (Guo, Wan, & Shan, 2002; Liu & Jia, 2006; Tan, Ke, & Lu, 1998; Zhang, 2006). One study compared the
586 X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590
diagnostic results between ICD-10, DSM-IV and CCMD-2-R within a sample of 95 children with Childhood Autism (Tan
et al., 1998). The clinical judgement was the reference criteria. It reported that the percentage agreement between
CCMD-2-R and clinical judgement was 90 percent; agreement between DSM-IV and clinical judgement was 97 percent;
and agreement between ICD-10 and clinical judgement was 96 percent. This study also reported that the percentage
agreement between CCMD-2-R and the other two international criteria – DSM-IV and ICD-10 – was 48 and 59 percent
respectively. One study investigated the percentage agreement between CCMD-2-R, CCMD-3 and ICD-10. The sample
consisted of 117 children with Childhood Autism previously diagnosed by clinical judgement (Guo et al., 2002). This
study reported a percentage agreement of 100 percent between CCMD-3 and ICD-10 (Kappa = 1.00, p < 0.001); and an
agreement of 95.8 percent between CCMD-2-R and ICD-10 (Kappa = 0.92, p < 0.001). One study investigated the
agreement between CCMD-3 and DSM-IV within 255 clinically diagnosed autistic cases, which reported an agreement of
96.1 percent (Liu & Jia, 2006). Another study compared the diagnostic results between DSM-IV and clinical judgement
within a sample of 65 children with autism and 65 children without autism. It reported that the percentage agreement
between DSM-IV and clinical judgement was 100 percent; and the sensitivity of DSM-IV was 96 percent and specificity
was 92 percent (Zhang, 2006).
4. Discussion
4.1. Current screening and diagnostic instruments in mainland China
This study examined the available screening, diagnostic instruments and criteria for autism in mainland China. The most
frequently used instruments for screening in mainland China were the CABS and ABC. The most frequently used diagnostic
instrument was the CARS. The ADI was only validated without adoption in epidemiological research, while the ADOS has not
been applied to the Chinese population (Tang, Guo, Rice, Wang, & Cubells, 2010). It has been recommended that the
sensitivity and specificity of a screening instrument should be between 70 and 80 percent in order to be considered
psychometrically sound (Glascoe, 2005). The validity of the CABS, ABC, CARS, CHAT-23 and ASSQ was reported as good in
previous studies reviewed in this paper (sensitivity > 80%, specificity > 80%). These instruments were well respected and
widely used, but they were used mainly to capture cases of Childhood Autism. Considering the life-long course and
complexity of the condition, they might be weak in detecting cases of Asperger’s syndrome or PDD-NOS. In addition, the
CARS was also used as a second-stage screening instrument in previous prevalence studies in mainland China (Liu et al.,
2004; Zhang, Xu, & Zhang, 2005).
The most frequently used screening instruments were introduced to China in the late 1980s, and there have been very few
updates regarding the contents since they were first translated and validated. In particular, the CABS was developed in 1969
and widely used in autism research in mainland China (Clancy et al., 1969); however, the data on its validity and reliability in
the West was lacking. The CARS and ABC were also adopted early and they are still in use for ASC research in the West
(Fernandes & Miilher, 2008; Perry et al., 2005). The CHAT and the M-CHAT were developed relatively recently and have been
validated and adopted widely in research on ASC in the West (Baron-Cohen et al., 1992, 2000; Yama et al., 2012). The WABS,
FOSPAC-R, EASI, and PASS-IT were developed most recently and have not been widely adopted or studied in the West (Zhang
et al., 2011).
4.2. Limitations of review
There are several limitations of this review. First, the literature search was conducted using four databases. It is possible
that there are studies that were not identified because they were not published in major journals. However, the search
methodology was systematic, with double-checking using a second search in all databases; it was unlikely that publication
and selection bias could influence the results. Second, as all the papers were in Chinese, and the data was extracted and
translated into English by the first author only, and there could be language implications. Thus, it would be valuable to have
a second reader for data extraction and translation. Third, the results presented for the validity and reliability of
instruments were summarised according to the analytical methods stated in each paper. However, since the detailed
description of analytical approaches was generally lacking in reviewed papers, the analytical methods were assumed from
the presentation of results in each paper. Thus, there might be misinterpretation of the analytical approaches of the
reviewed studies.
4.3. Critique of research methodology in reviewed studies
In the reviewed studies, the information related to subject selection and diagnostic procedures was generally missing.
The lack of useful research techniques would make it difficult to replicate the research or compare the results with other
studies.
In terms of sampling method, the samples in reviewed studies were drawn mostly from clinical patients who already had
an autism diagnosis before the validation took place (17, 89.5%). Two studies selected children with other subtypes of ASC as
cases, while other studies only selected children with Childhood Autism (Guo et al., 2011; Liu et al., 2007). Only two studies
identified cases prospectively within the general population (Wu et al., 2010; Yang et al., 2010).
X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590 587
After the diagnosis, cases were examined by clinicians using screening instruments. Then, the performance of the
screening instruments was compared with the previous diagnosis. The clinicians in the research were generally not blind to
the diagnostic status of the participants. This could bias the results of screening validity and reliability as well as the
comparison of the diagnosis.
Six studies investigated the utility of instruments in children with Childhood Autism without comparable controls. Those
studies that had a control group did not provide information on whether the controls were drawn from the same population
as the cases. Therefore, it is possible that the results generated from such studies may be confounded by variables other than
the autistic condition.
The screening instruments used in mainland China often had two cut-offs. The lower cut-off was used for screening and
the higher cut-off for diagnosis. The individuals who scored above the lower cut-off were considered a potential case of
autism, and those who scored above the higher cut-off were given an autism diagnosis. The cut-off for the same screening
instrument varied among reviewed studies, which made it difficult to compare their results.
The analytical methods on the validity and reliability of instruments varied among reviewed studies. There were
four approaches: (1) simple percentage: examine the performance of an instrument by calculating the percentage of
cases identified by the instrument among the number of already known cases recruited in the study to generate the
agreement between screening instruments and the reference diagnostic criteria; (2) Cohen’s Kappa agreement test: a
statistical measure of inter-rater agreement for qualitative items. It investigates the extent to which there is an
agreement other than that expected by chance (Cohen, 1968); (3) validity: using sensitivity and specificity; (4)
statistic test to investigate whether the instruments can distinguish children with autism from children without
autism, such as the chi-square test.
Some studies reported particularly high sensitivity and specificity (sensitivity = 100%, specificity = 100%), such as the first
validation study of the ABC (Yang et al., 1993). However, the samples of these studies consisted of very severe autistic cases
and typically developing children. Those samples were not representative of the children on the whole autism spectrum in
the general population. The details of analytical methodology were generally lacking in reviewed papers. The type of analysis
could only be learned from the presentation of results. The usage of different statistical methods led to difficulty in justifying
these results and comparing them among studies.
4.4. Case identification and further direction of ASC in mainland China
As the reported prevalence of ASC has increased considerably in Western epidemiological studies (Fombonne, 2009),
the public health issues regarding individuals with ASC have attracted great attention (Brownson, Newschaffer, & Ali-
Abarghoui, 1997; Rice, Schendel, Cunniff, & Doernberg, 2004). China has a population of more than 1.37 billion. If
Western prevalence estimates for ASC (around 1%) are accurate and cross-culturally stable, then, when applied to China,
13.7 million individuals are expected to have ASC. However, research in this area is limited and the available studies
suggested a much lower prevalence in the Chinese population. This may be due in part to the lack of recognition of this
condition. Thus, improvement in case identification could help an understanding of the current situation of ASC in
mainland China. The screening and diagnostic instruments for ASC should have the ability to capture not only the most
obvious autistic features but also the more subtle ones, since individuals on the borderline of the spectrum also need to
be identified and helped. The lack of screening and diagnostic instruments for the whole autism spectrum leads to
difficulty in the comparison between East and West. It also leads to difficulty in policy-making and service provision
towards individuals with ASC. Most importantly, it will lead to individuals with milder conditions struggling for help.
Among identified instruments, the CHAT-23 and ASSQ were developed for the whole autism spectrum, and much
evidence for their utility has been provided by Western studies. Although these two instruments have been applied to the
Chinese population only once, they have demonstrated acceptable validity as screening instruments for ASC. However,
the CHAT-23 is only applicable to young children at a mental age of between 18 and 24 months (Wong et al., 2004), and
the utility of ASSQ in the general population still requires investigation. Furthermore, the utility of adopted instruments
needs to be confirmed with the diagnosis made by standardised diagnostic instruments in the general population. In this
way, a large population study of ASC can be achieved in mainland China. Thus, the adoption of advanced and well-
developed screening and diagnostic instruments to China is needed in order to improve the epidemiological and
aetiological research on ASC in mainland China.
4.5. Conclusions
Studies on screening and diagnostic instruments for ASC in mainland China are limited. Two screening instruments, the
ABC and CABS, have been more thoroughly studied in mainland China. Both of them were adopted from the West more than
two decades ago to detect potential cases of Childhood Autism. The CARS has been most frequently used as the diagnostic
instrument in autism research in mainland China. The ADI was introduced but has not been thoroughly adopted. There is a
lack of consistency in study methodology among identified studies, especially the varied adoption of cut-offs. Therefore, it is
difficult to make comparisons among studies. There is an urgent need to introduce more advanced and well-developed
screening and diagnostic instruments to mainland China, as well as more standardised research methodology for the
development of such instruments for ASC in the future. 588 Appendix A
Validity and reliability of screening and diagnostic instruments in mainland China.
Tool Author Case no. Cases age Sample Control no. Control age Cut-off for Cut-off Sensitivity Specificity Agreement Reference (mean age) source suspicion for diagnosis percentage standard
ABC Yang et al. (1993) 60 2–17 (6.37) Clinical 157 (MR) 7–17 (11.7) 31 50 31 cut-off: 100% 31 cut-off: 100% – DSM-III-R 108 (N) 4–8 (6.3) 50 cut-off: 97% 50 cut-off: 100% Wang et al. (2002) 20 2.33–11 (4) Clinical 20 (ADHD) 6–12 (8.5) 53 68 – – 80% ICD-10 Wang, Wang, and 20 2–8 (4.5) Clinical 0 – 56 72 – – 85% ICD-10 Wang (2003) Wang, Wang, and 22 2–8 (4.6) Clinical 0 – 56 72 – – 86% ICD-10 Shen (2003) X.
Lu and Yang (2004) 43 1.83–16 Clinical 31 (N) 2–14 53 67 – – 81% ICD-10 Sun Li et al. (2005a) 28 1.5–3 (26 children) Clinical 34 (N) 1.5–3 (31 children); 31 – 97% 89% 94% DSM-IV
I7 (2 children) I7 (2 children) et
89% – CCMD-3 Yin et al. (2011) 178 – Clinical 0 – – 53 Cronbach’s al.
a = 0.81 /
CABS Wang, Wang, and 20 2–8 (4.5) Clinical 0 – 14 – – – 95% ICD-10 Research Wang (2003) Wang, Wang, and 22 2–8 (4.6) Clinical 0 – 14 – – – 96% ICD-10
Shen (2003) in
Li et al. (2005a) 28 1.5–3 (26 children) Clinical 34 (N) 1.5–3 (31 children); 6 – 91%, 82% 87% DSM-IV Autism I7 (2 children) I7 (2 children)
Zhang (2006) 65 2.5–4 Clinical 65 (N) – – – 88% 82% – Clinical Spectrum judgement CHAT-23 Wu et al. (2010) 535 1.5–2 General 0 – (1) I6 items positive in Either meet Either meet – DSM-IV population 23 items; (2) I2 items (1) or (2): 94% (1) or (2): 88%
positive in 7 key items Disorders M-CHAT Zhang et al. (2011) 25 0.96–2.92 (1.9) Clinical 400 (N) 1.75–2.75 (2) (1) I3 items positive in Either meet Either meet 94% DSM-IV 23 items; (2) I2 items (1) or (2): 72% (1) or (2): 95%
positive in 6 key items 7
Gong et al. (2011) 93 1.2–3 (2.3) Clinical 85 (N) 1–2.9 (2) (1) I3 items positive in 96% 60% – DSM-IV (2013) 23 items; (2) I2 items
positive in 6 key items 579–590 ASSQ Guo et al. (2011) 94 3.9–9.6 (6.75) Clinical 120 (N) 4.7–7.3 (6) 12 – 96% 83% – DSM-IV 71 (MD) 10.8–16.8 (13.8) WABS Song et al. (2009) 30 – Special 30 (MD) – 51.5 – 100% 97% – Clinical school judgement 33 (N) – FOSPAC-R Yang et al. (2010) 27 1–1.42 Clinical 27 (N) 1–1.42 14 – 96% 93% – DSM-IV 27 (MD) 1–1.42 CHAT EASI Zhang et al. (2011) 25 0.96–2.92 (1.9) Clinical 400 (N) 1.75–2.75 (2) 40 – 80% 75% 75% DSM-IV aPASS-ITLu andZhang Yang et (2004) al. (2011)4325 1.83–16 0.96–2.92 (1.9) Clinical Clinical 31 (N) 400 (N) 2–14 1.75–2.75 (2) 30 20 36 – – 88% – 64% 98% 66% ICD-10 DSM-IV CARS Li et al. (2005a) 28 1.5–3 (26 children) Clinical 34 (N) 1.5–3 (31 children); 30 – 100% 100% 100% DSM-IV I7 (2 children) I7 (2 children) Li et al. (2005b) 41 2–10 (3.8) Clinical 34 (MD) 3–11 (4.5) – 30 100% 100% 100% DSM-IV Yin et al. (2011) 178 – Clinical 0 – 30 37 Cronbach’s a = 0.78 97% – CCMD-3 a Diagnostic instrument; CABS: Clancy Autism Behaviour Scale; ABC: Autism Behaviour Checklist; CARS: Childhood Autism Rating Scale; CHAT: Checklist of Autism in Toddlers; M-CHAT: Modified Checklist of Autism in Toddlers; ASSQ: Autism Spectrum Screening Questionnaire; WABS: Waterville Autistic Behaviour Scales; FOSPAC-R: Flinders Observational Schedule of Pre-verbal Autistic Characteristics-Revised; EASI: Early Autism Screening Items; PASS-IT: Pictorial Autism Screening Scale for Infant and Toddler; N: typical developed children; MR: mental retardation; ADHD: attention deficiency and hyperactivity disorder; MD: mental disorders.
X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590 589
Acknowledgements
XS was partly funded by the Waterloo Foundation, Cambridge Commonwealth Trust and Clare Hall of the University of
Cambridge. SBC, CA, and BA were funded by the Medical Research Council UK, the Wellcome Trust, and the team were funded
by the NIHR CLAHRC for Cambridgeshire and Peterborough NHS Foundation Trust during the period of this work. FM was
funded by MRC UK.
References
American Psychiatric Association. (1980). Diagnostic and statistical manual of mental disorders (3rd ed. DSM-III). Washington: American Psychiatric Association.
American Psychiatric Association. (1987). Diagnostic and statistical manual of mental disorders (3rd ed., rev. (DSM-III-R)). Washington, DC: American Psychiatric
Association.
American Psychiatric Association. (1994). Diagnostic and statistical manual of mental disorders (DSM-IV) (4th ed.). Washington, DC: APA: American Psychiatric
Association.
Baird, G., Charman, T., Cox, A., Baron-Cohen, S., Swettenham, J., Wheelwright, S., et al. (2001). Current topic: Screening and surveillance for autism and pervasive
developmental disorders. Archives of Disease in Childhood, 84(6), 468–475.
Barbaro, J., & Dissanayake, C. (2010). Prospective identification of autism spectrum disorders in infancy and toddlerhood using developmental surveillance: The
social attention and communication study. Journal of Developmental and Behavioral Pediatrics, 31(5), 376–385.
Baron-Cohen, S., Allen, J., & Gillberg, C. (1992). Can autism be detected at 18 months? The needle, the haystack, and the CHAT. British Journal of Psychiatry, 161,
839–843.
Baron-Cohen, S., Scott, F. J., Allison, C., Williams, J., Bolton, P., Matthews, F. E., et al. (2009). Prevalence of autism-spectrum conditions: UK school-based population
study. The British Journal of Psychiatry, 194(6), 500–509.
Baron-Cohen, S., Wheelwright, S., Cox, A., Baird, G., Charman, T., Swettenham, J., et al. (2000). Early identification of autism by the Checklist for Autism in Toddlers
(CHAT). Journal of Royal Society of Medicine, 93(10), 521–525.
Brownson, R. C., Newschaffer, C. J., & Ali-Abarghoui, F. (1997). Policy research for disease prevention: Challenges and practical recommendations. American Journal
of Public Health87(5).
Chinese Society of Psychiatry. (1984). Chinese Classification of Mental Disorders (CCMD)(2nd ed.). .
Chinese Society of Psychiatry. (1993). Chinese Classification of Mental Disorders(2nd ed. revised (CCMD-2-R)). .
Chinese Society of Psychiatry. (2001). Chinese Classification of Mental Disorders(3rd ed. (CCMD-3)). .
Clancy, H., Dugdale, A., & Rendle-Short, J. (1969). The diagnosis of infantile autism. Developmental Medicine and Child Neurology, 11(4), 432–442.
Cohen, J. (1968). Weighted kappa: Nominal scale agreement with provision for scaled disagreement or partial credit. Psychological Bulletin, 70(4), 213–220.
Duby, J. C., & Johnson, C. P. (2009). Universal screening for autism spectrum disorders: A snapshot within the big picture. Pediatric Annals, 38(1), 36–41.
Dumont-Mathieu, T., & Fein, D. (2005). Screening for autism in young children: The Modified Checklist for Autism in Toddlers (M-CHAT) and other measures.
Mental Retardation and Developmental Disabilities Research Reviews, 11(3), 253–262.
Eaves, R. C., & Milner, B. (1993). The criterion-related validity of the Childhood Autism Rating Scale and the Autism Behavior Checklist. Journal of Abnormal Child
Psychology, 21(5), 481–491.
Ehlers, S., & Gillberg, C. (1993). The epidemiology of Asperger syndrome. A total population study. Journal of Child Psychology and Psychiatry and Allied Disciplines,
34(8), 1327–1350.
Ehlers, S., Gillberg, C., & Wing, L. (1999). A screening questionnaire for Asperger syndrome and other high-functioning autism spectrum disorders in school age
children. Journal of Autism Developmental Disorders, 29(2), 129–141.
Fernandes, F. D., & Miilher, L. P. (2008). Relations between the Autistic Behavior Checklist (ABC) and the functional communicative profile. Pro Fono, 20(2),
111–116.
Fombonne, E. (1992). Diagnostic assessment in a sample of autistic and developmentally impaired adolescents. Journal of Autism and Developmental Disorders,
22(4), 563–581.
Fombonne, E. (2009). Epidemiology of pervasive developmental disorders. Pediatrics in Review, 65(6), 591–598.
Garfin, D. G., McCallon, D., & Cox, R. (1988). Validity and reliability of the Childhood Autism Rating Scale with autistic adolescents. Journal of Autism and
Developmental Disorders, 18(3), 367–378.
Glascoe, F. P. (2005). Screening for developmental and behavioral problems. Mental Retardation and Developmental Disabilities Research Reviews, 11(3), 173–179.
Gong, Y., Liu, J., Li, C., Jia, M., Song, W., & Guo, Y. (2011). Reliability and validity of the Chinese version of the Modified Checklist for Autism in Toddlers. Chinese
Journal of Mental Health, 25(6), 409–414.
Gray, K. M., Tonge, B. J., & Sweeney, D. J. (2008). Using the Autism Diagnostic Interview-Revised and the Autism Diagnostic Observation Schedule with young
children with developmental delay: Evaluating diagnostic validity. Journal of Autism and Developmental Disorders, 38(4), 657–667.
Guo, L., Wan, Y., & Shan, Y. (2002). A clinical study of the 3rd edition of Chinese classification of mental disorders for childhood autism. Chinese Journal of Psychiatry,
35(3), 162–165.
Guo, Y., & Liu, J. (2002). Reliability and diagnostic validity study on Autism Diagnostic Interview-Revised. Chinese Journal of Psychiatry, 35: 42–45.
Guo, Y., & Yang, X. (1998). Introduction and clinical application of Autism Diagnostic Interview. Journal of Foreign Psychiatry, 25(3), 174–177.
Guo, Y. Q., Tang, Y., Rice, C., Lee, L. C., Wang, Y. F., & Cubells, J. F. (2011). Validation of the Autism Spectrum Screening Questionnaire, Mandarin Chinese Version
(CH-ASSQ) in Beijing, China. Autism, 15(6), 713–727.
Kim, Y. S., Leventhal, B. L., Koh, Y. J., Fombonne, E., Laska, E., Lim, E. C., et al. (2011). Prevalence of autism spectrum disorders in a total population sample. American
Journal of Psychiatry, 168(9), 904–912.
Klein-Tasman, B. P., Risi, S., & Lord, C. E. (2007). Effect of language and task demands on the diagnostic effectiveness of the autism diagnostic observation schedule:
The impact of module choice. Journal of Autism and Developmental Disorders, 37(7), 1224–1234.
Kopp, S., & Gillberg, C. (2011). The Autism Spectrum Screening Questionnaire (ASSQ)-Revised Extended Version (ASSQ-REV): An instrument for better capturing
the autism phenotype in girls? A preliminary study involving 191 clinical cases and community controls. Research in Developmental Disabilities, 32(6),
2875–2888.
Krug, D. A., Arick, J., & Almond, P. (1980). Behaviour checklist for identifying severely handicapped individuals with high levels of autistic behaviour. Journal of
Child Psychology and Psychiatry, 21(3), 221–229.
Le-Couteur, A., Rutter, M., Lord, C., Rios, P., Robertson, S., Holdgrafer, M., et al. (1989). Autism diagnostic interview: A standardized investigator-based instrument.
Journal of Autism and Developmental Disorders, 19(3), 363–387.
Levy, S. E., Mandell, D. S., & Schultz, R. T. (2009). Autism. Lancet, 374(9701), 1627–1638.
Li, J., Zhong, J., Cai, L., Chen, Y., & Zhou, M. (2005a). Comparison of clinical application of three autism rating scale. China Journal of Contemporary Paediatrics, 7(1),
59–62.
Li, J., Zhong, J., Cai, L., Chen, Y., & Zhou, M. (2005b). An investigation of clinical application of Childhood Autism Rating Scale. Chinese Journal of Child Health Care,
13(3), 267–268.
Liu, J., Guo, W. Y., Guo, Y., Yang, X., & Jia, M. (2004). The developmental of a screening checklist of childhood autism. Chinese Journal of Mental Health, 18(6),
400–404.
Liu, J., & Jia, M. (2006). A comparative study of Diagnostic Criteria of CCMD-3 and DSM-IV on Childhood Autism. Chinese Journal of Medical Health, 20(9), 568–571.
590 X. Sun et al. / Research in Autism Spectrum Disorders 7 (2013) 579–590
Liu, J., Yang, X., Jia, M., Qu, C., Shi, J., Liu, G., et al. (2007). Epidemiological survey of the pervasive developmental disorder of 2–6 years old children in Beijing.
Chinese Mental Health Journal, 2(3), 290–293.
Lord, C., Risi, S., Lambrecht, L., Cook, E. H., Jr., Leventhal, B. L., DiLavore, P. C., et al. (2000). The autism diagnostic observation schedule-generic: A standard measure
of social and communication deficits associated with the spectrum of autism. Journal of Autism and Developmental Disorders, 30(3), 205–223.
Lord, C., Rutter, M., DiLavore, P., & Risi, S. (2001). Autism Diagnostic Observation Schedule (ADOS). Los Angeles, CA: Western Psychological Services.
Lu, J., & Yang, Z. (2004). Neurodevelopment anomalies assessment in autistic children. Chinese Journal of Modern Medicine, 14(12), 42–48.
Magyar, C. I., & Pandolfi, V. (2007). Factor structure evaluation of the childhood autism rating scale. Journal of Autism and Developmental Disorders, 37(9),
1787–1794.
Mattila, M. L., Jussila, K., Kuusikko, S., Kielinen, M., Linna, S. L., Ebeling, H., et al. (2009). When does the Autism Spectrum Screening Questionnaire (ASSQ) predict
autism spectrum disorders in primary school-aged children? European Child and Adolescent Psychiatry, 18(8), 499–509.
Miranda-Linne, F. M., & Melin, L. (2002). A factor analytic study of the Autism Behavior Checklist. Journal of Autism and Developmental Disorders, 32(3), 181–188.
Norris, M., & Lecavalier, L. (2010). Screening accuracy of Level 2 autism spectrum disorder rating scales. A review of selected instruments. Autism, 14(4), 263–284.
Pereira, A., Riesgo, R. S., & Wagner, M. B. (2008). Childhood autism: Translation and validation of the Childhood Autism Rating Scale for use in Brazil. Jornal de
Pediatria, 84(6), 487–494.
Perry, A., Condillac, R. A., Freeman, N. L., Dunn-Geier, J., & Belair, J. (2005). Multi-site study of the Childhood Autism Rating Scale (CARS) in five clinical groups of
young children. Journal of Autism and Developmental Disorders, 35(5), 625–634.
Porta, M. (2008). A dictionary of epidemiology: A handbook sponsored by the I.E.A.. Oxford: Oxford University Press.
Rice, C., Schendel, D., Cunniff, C., & Doernberg, N. (2004). Public health monitoring of developmental disabilities with a focus on the autism spectrum disorders.
American Journal of Medical Genetics Part C: Seminars in Medical Genetics, 125C(1), 22–27.
Rice, C. E., Baio, J., Van Naarden, B. K., Doernberg, N., Meaney, F. J., & Kirby, R. S. (2007). A public health collaboration for the surveillance of autism spectrum
disorders. Paediatric and Perinatal Epidemiology, 21(2), 179–190.
Rutter, M., LeCouteur, A., & Lord, C. (2003). Autism Diagnostic Interview-Revised Manual. Los Angeles, CA: Western Psychological Services.
Schopler, E., Reichler, J., & Renner, B. (1988). The Childhood Autism Rating Scale (CARS). Los Angeles: Western Psychological Service.
Sevin, J. A., Matson, J. L., Coe, D. A., Fee, V. E., & Sevin, B. M. (1991). A comparison and evaluation of three commonly used autism scales. Journal of Autism and
Developmental Disorders, 21(4), 417–432.
Skuse, Warrington, R., Bishop, D., Chowdhury, U., Lau, J., Mandy, W., et al. (2004). The developmental, dimensional and diagnostic interview (3di): A novel
computerized assessment for autism spectrum disorders. Journal of the American Academy of Child and Adolescent Psychiatry (0890-8567 (Print)).
Song, Y., Fang, J., & Sun, L. (2009). A new, high precision behavior-based questionnaire for screening and pre-diagnosing autism in children: Implication for clinical
applications and large population investigation. Journal of Psychological Science, 32(6), 1504–1507.
Stella, J., Mundy, P., & Tuchman, R. (1999). Social and nonsocial factors in the Childhood Autism Rating Scale. Journal of Autism and Developmental Disorders, 29(4),
307–317.
Sun, X., & Allison, C. (2009). A review of the prevalence of Autism Spectrum Disorder in Asia. Research in Autism Spectrum Disorder, 4(2), 156–167.
Tan, Z., Ke, X., & Lu, R. (1998). A comparison study of three diagnostic criteria on Autism. Chinese Journal of Psychiatry, 31, 175–177.
Tang, Y., Guo, Y., Rice, C. E., Wang, Y., & Cubells, J. F. (2010). Introduction of the ‘‘gold standard’’ diagnostic instrument
International Journal of Psychiatry, 37(1), 38–40.
Tao, K.-t. (1999). Child psychiatry. Nanjing: Jiansu Science and Technology Press.
Vostanis, P., Smith, B., Chung, M. C., & Corbett, J. (1994). Early detection of childhood autism: A review of screening instruments and rating scales. Child Care Health
and Development, 20(3), 165–177.
Wang, W., Zai, L., & Zhen, L. e. a. (2002). An epidemiological investigation on autistic disorder in Changzhou Province. Journal of Clinical Psychiatric Medicine, 12(3),
148–149.
Wang, Y., Wang, G., & Wang, Y. (2003). Analysis of childhood autism by using Clancy Autism Behavior Scale and Autism Behavior Checklist. Journal of Shandong
University (Health Science), 41(2), 213–214.
Wang, Y., Wang, Y., & Shen, Y. (2003). Investigation of intelligence and behavior impairment in autistic children. Chinese Journal of Child Health Care, 11(2),
133–134.
Webb, P. (2005). Essential epidemiology: An introduction for students and health professionals. Cambridge: Cambridge University Press.
Williams, J. (2003). Screening for autism spectrum disorders. University of Cambridge.
Wing, L., Leekam, S. R., Libby, S. J., Gould, J., & Larcombe, M. (2002). The Diagnostic Interview for Social and Communication Disorders: Background, inter-rater
reliability and clinical use. Journal of Child Psychology and Psychiatry and Allied Disciplines, 43(3), 307–325.
Wong, V., Hui, L. H., Lee, W. C., Leung, L. S., Ho, P. K., Lau, W. L., et al. (2004). A modified screening tool for autism (Checklist for Autism in Toddlers [CHAT-23]) for
Chinese children. Pediatrics, 114(2), e166–e176.
World Health Organisation. (1993). The ICD-10 Classification of Mental and Behavioural Disorder: Diagnosis Criteria for Research. Geneva: World Health Organisation.
Wu, X., Lu, Y., Wang, Y., Zheng, Q., Wang, T., Lin, J., et al. (2010). Investigation of childhood autism status in Lianyungang city. Journal of Modern Medicine and
Hygiene, 26(24), 3724–3726.
Yama, B., Freeman, T., Graves, E., Yuan, S., & Karen, C. M. (2012). Examination of the Properties of the Modified Checklist for Autism in Toddlers (M-CHAT) in a
Population Sample. Journal of Autism and Developmental Disorders, 42(1), 23–24.
Yang, W., Shao, Z., Gan, W., Shen, W., Tang, L., & Wang, Y. (2010). Modification and clinical application of Childhood Autism Screening Checklist. Chinese Journal of
Child Health Care, 18(12), 954–956.
Yang, X. L., Huang, L. Q., Jia, M. X., & Chen, S. (1993). Validation study of Autism Behavior Checklist. Chinese Journal of Mental Health, 7(6), 279–280.
Yin, Q., Chen, J., Luo, X., & Li, X. (2011). Reliability and validity of Childhood Autism Rating Scale and Autism Behaviour Checklist. International Medicine and Hygiene
Guideline Newspaper, 17(12), 1470–1475.
Yirmiya, N., Sigman, M., & Freeman, B. J. (1994). Comparison between diagnostic instruments for identifying high-functioning children with autism. Journal of
Autism and Developmental Disorders, 24(3), 281–291.
Yong, R. (2001). Current research in the area of Autism and Savant Syndrome. International Education Journal, 2(4), 329–333.
Zhang, J., Xu, C., & Zhang, J. (2005). Survey of Autism among Preschool Children in Wuhan. China Maternal and Child Health, 20, 2395–2396.
Zhang, N., Tan, H., Xiao, H., Zhang, J., & Zhang, S. (2011). A comparison study of three early screening instruments of autism for preschool children. Chinese Journal
of Clinical Paediatrics, 29(7), 656–660.
Zhang, Q. (2006). The evaluation of diagnosis of Childhood Autism by DSM-IV and Clancy Autism Behavior Scale. Journal of Medical Theory and Practice, 19(5), 586–587.