WITHDRAWN: Improving the Measurement of Alexithymia in Autistic Adults: A Psychometric Investigation and Refinement of the Twenty-item Toronto Alexithymia Scale

doi:10.21203/rs.3.rs-96364/v1

Download PDF

Research

WITHDRAWN: Improving the Measurement of Alexithymia in Autistic Adults: A Psychometric Investigation and Refinement of the Twenty-item Toronto Alexithymia Scale

https://doi.org/10.21203/rs.3.rs-96364/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 02 Mar, 2021

Read the published version in Molecular Autism →

Version 1

posted

You are reading this older preprint version

Read the latest preprint version →

The full text of this preprint has been withdrawn by the authors while they make corrections to the work. Therefore, the authors do not wish this work to be cited as a reference. Questions should be directed to the corresponding author.

Editorial notes are used to provide important context regarding the topic of a preprint or to alert readers to potential issues concerning that preprint or a downstream publication associated with it. For more information on editorial notes, see our Editorial Policies.

Background: Alexithymia, a personality trait characterized by difficulties interpreting emotional states, is commonly elevated in autistic adults, and a growing body of literature suggests that this trait underlies a number of cognitive and emotional differences previously attributed to autism. Although questionnaires such as the twenty-item Toronto Alexithymia Scale (TAS-20) are frequently used to measure alexithymia in the autistic population, there have been few studies attempting to determine the psychometric properties of these questionnaires in autistic adults, including whether differential item functioning (I-DIF) exists between autistic and general population adults.

Methods: We conducted an in-depth psychometric analysis of the TAS-20 in a large sample of 743 cognitively able autistic adults recruited from the Simons Foundation SPARK participant pool and 721 general population controls enrolled in a large international psychological study. The factor structure of the TAS-20 was examined using confirmatory factor analysis, and item response theory was used to further refine the scale based on local model misfit and I-DIF between the groups. Correlations between alexithymia and other clinical outcomes were used to assess the nomological validity of the revised alexithymia scale in the SPARK sample.

Results: The TAS-20 did not exhibit adequate model fit in either the autistic or general population samples. Empirically driven item reduction was undertaken, resulting in an eight-item unidimensional scale (TAS-8) with sound psychometric properties and practically ignorable I-DIF between diagnostic groups. Correlational analyses indicated that TAS-8 scores meaningfully predict autistic trait levels, anxiety and depression symptoms, and quality of life, even after controlling for trait neuroticism.

Limitations: Limitations of the current study include a sample of autistic adults that was overwhelmingly female, later-diagnosed, and well-educated; clinical and control groups drawn from different studies with variable measures; and an inability to test several other important psychometric characteristics of the TAS-8, including sensitivity to change and I-DIF across multiple administrations.

Conclusions: These results indicate the potential of the TAS-8 as a psychometrically robust tool to measure alexithymia in both autistic and non-autistic adults. A free online score calculator has been created to facilitate the use of norm-referenced TAS-8 latent trait scores in research applications (available at http://asdmeasures.shinyapps.io/TAS8_Score).

Psychology

autism

alexithymia

Bayesian statistics

differential item functioning

emotion

item response theory

factor analysis

measurement

psychometric

reliability

validity

Alexithymia is a subclinical condition characterized by difficulties in identifying and describing one’s own emotional state (1,2). Individuals scoring high on measures of alexithymia exhibit difficulties recognizing and labeling their internal emotional states, discriminating between different emotions of the same affective valence, and describing and communicating their emotional states to others. These individuals also tend to exhibit a reduction in imaginal processes and a stimulus-bound, externally-oriented style of thinking (i.e., “concrete thinking”). Alexithymia is not itself considered a psychiatric diagnosis; rather, the condition can better be described as a dimensional personality trait that is expressed to varying degrees in the general population and associated with a host of medical, psychiatric, and psychosomatic conditions (2–14). Although there is taxometric evidence to suggest that alexithymia is a dimensional rather than categorical construct (15–17), researchers frequently categorize a portion of individuals as having “high alexithymia” based on questionnaire scores above a certain threshold, with upwards of 10% of the general population exceeding these thresholds (18–20). Over the last five decades, a large body of research has emerged to suggest that alexithymia is a transdiagnostic predictor of important clinical outcomes, such as suicidal ideation and behavior, non-suicidal self-injury, risky drinking, and reduced response to various medical and psychotherapeutic treatments (21–26).

Alexithymia is a construct of particular interest in research on autism spectrum disorder (hereafter “autism), a condition frequently associated with difficulties in processing, recognizing, communicating, and regulating emotions (27–32). A recent meta-analysis of published studies identified large differences between autistic adolescents/adults and neurotypical controls on self-reported alexithymia as measured by the Toronto Alexithymia Scale (TAS (2,33,34)), with an estimated 49.93% of autistic individuals exceeding cutoffs for “high alexithymia” on the twenty-item TAS (TAS-20), compared to only 4.89% of controls (3). Alexithymia has also been suggested to be part of the “Broader Autism Phenotype” (35–37), the cluster of personality characteristics observed in parents of autistic children and other individuals with high-levels of subclinical autistic traits (38). Along with verbal IQ, self-reported alexithymia is one of the stronger predictors of task-based emotion processing ability in the autistic population (29), and a number of studies measuring both alexithymia and core autism symptoms have concluded that alexithymia accounts for some or all of the emotion-processing differences attributed to the categorical diagnosis of autism (39–51). Within the autistic population, alexithymia is also a meaningful predictor of the severity of co-occurring mental health conditions, showing relationships with symptoms of depression, general anxiety, social anxiety, non-suicidal self-injury, and suicidality (52–58).

Despite the impressive body of literature on alexithymia in autistic individuals and its relationships with other constructs, there has been surprisingly little investigation into the measurement properties of alexithymia measures in the autistic population (59). One small study by Berthoz and Hill (60) addressed the validity of two common alexithymia scales (the TAS-20 and Bermond-Vorst Alexithymia Questionnaire–Form B [BVAQ-B] (61)) in a sample of 27 autistic adults and 35 neurotypical controls. In this small sample, the investigators found that autistic adults adequately comprehended the content of the alexithymia questionnaires, also noting high correlations between the two measures in both diagnostic groups. A subset of the sample also completed the same forms 4–12 months later, and test-retest reliability coefficients for both the TAS-20 and BVAQ-B in autistic adults were deemed adequate (test-retest Pearson r = 0.92 and 0.81 for the TAS-20 and BVAQ-B total scores, respectively, with all subscale rs > 0.62). The internal consistency of the TAS-20 and its three subscales has also been reported in a sample of 27 autistic adults by Samson et al. (62), who reported adequate reliability for the TAS-20 total score (α = 0.84), “difficulty identifying feelings” (DIF) subscale (α = 0.76), and “difficulty describing feelings” (DDF) subscale (α = 0.81) subscales, but subpar reliability for the TAS-20 “externally-oriented thinking” (EOT) subscale (α = 0.65). Additional studies have also replicated the high correlations between TAS-20 and BVAQ scores in autistic adults (42) and demonstrated the TAS-20 total score and combined DIF/DDF subscales to be reliable in samples of cognitively able autistic adolescents (51,56). Nevertheless, we are unaware of any study to date systematically investigating the psychometric properties of the TAS-20 or any other alexithymia measure in autistic individuals using large-sample latent variable modeling techniques.

Given the prominence of the TAS-20 as the primary alexithymia measure employed in autism literature (3,29,59), the remainder of this paper will focus specifically on this scale. Although the TAS-20 is extensively used in research on alexithymia in a number of clinical and non-clinical populations (2), a number of psychometric concerns have been raised about the measure’s factor structure, reliability, utility in specific populations, and confounding by general psychological distress (2,63–69). In particular, the original three-factor structure of the TAS-20 (consisting of DIF, DDF, and EOT) often fails to achieve adequate model fit, although the use of a bifactor structure and/or removal of reverse-coded items may alleviate this issue (2,64,69). Most of the psychometric problems associated with the TAS-20 are driven by the EOT subscale, which often exhibits subpar internal consistency (including in the autistic sample reported by Samson et al. (62)), contains several items that relate poorly to the overall construct, and seems to be particularly problematic when the scale is used in samples of children and adolescents (2,63,65,66,70).

Another issue raised in the literature is the relatively high correlation between TAS-20 scores and trait neuroticism/general psychological distress (2,67,68). Although the creators of the TAS-20 have argued that the relationship between alexithymia and neuroticism is in in line with theoretical predictions (2), interview measures of alexithymia such as the Toronto Structured Interview for Alexithymia (TSIA (71)) do not correlate highly with neuroticism, potentially indicating that the previously observed correlation between TAS-20 scores and neuroticism reflects a response bias on self-report items rather than the a true relationship between neuroticism and the alexithymia construct (72,73). Regardless of the true nature of this relationship, a high correlation between the TAS-20 and neuroticism remains problematic, as a sizable portion of the ability of the TAS-20 score to predict various clinical outcomes may be driven by neuroticism, which is itself a strong predictor of a number of different psychopathologies (74–77). Notably, given the paucity of alexithymia measurement studies in samples of autistic individuals, no study to date has determined whether the TAS-20 continues to exhibit these same measurement issues in the autistic population.

Given the paucity of alexithymia measurement studies in samples of autistic individuals, we lack sufficient published evidence as to whether the TAS-20 exhibits these same measurement issues in the autistic population. Further, the comparability of item responses between autistic and neurotypical respondents is a psychometric issue that has yet to be addressed in the alexithymia literature. Differential item functioning (referred to here as “item DIF” [I-DIF] to avoid confusion with the DIF TAS-20 subscale) is often present when comparing questionnaire scores between autistic and non-autistic individuals (78–80), indicating differences in the ways item responses relate to underlying traits. I-DIF is important to consider when comparing test scores between groups, as it has the potential to obscure the magnitude of existing group differences, either creating artifactual group differences when none exist or masking small but meaningful differences between two groups. Although the large differences between autistic and neurotypical individuals on measures of alexithymia are unlikely to be entirely due to I-DIF, it remains possible that I-DIF may substantially bias between-group effect sizes. Furthermore, previous investigations of measurement invariance of the TAS-20 between general population samples and clinical samples of psychiatric patients have often only found evidence for partial invariance across groups (2), suggesting that I-DIF likely exists between autistic and non-autistic adults on at least some of the TAS-20 items. I-DIF may also exist between specific subgroups of the autistic population (e.g., based on age, sex, education level, or presence of comorbidities), and explicit testing of this psychometric property is necessary to determine whether a given measure can be considered equivalent across multiple sociodemographic categories. Notably, while the I-DIF null hypothesis of complete equivalence of all parameters between groups is always false at the population level (81), the effects of I-DIF may be small enough to be practically ignorable, allowing for reasonably accurate between-group comparisons (82,83). Thus, an important step of I-DIF analysis is the calculation of effect sizes, which help to determine whether the observed I-DIF is large enough to bias item or scales scores to a practically meaningful extent.

Given the importance of the alexithymia construct in the autism literature and the many unanswered questions regarding the adequacy of the TAS-20 in multiple populations, there is a substantial need to determine whether the TAS-20 is an adequate measure of alexithymia in the autistic population. Thus, in the current study, we comprehensively evaluated the psychometric properties of the TAS-20 in a large sample of autistic adults, assessing the measure’s latent structure, reliability, and differential item functioning by diagnosis and across multiple subgroups of the autistic population. Additionally, as a secondary aim, we sought to remove poorly-fitting items and items exhibiting I-DIF by diagnosis, creating a shortened version of the TAS with strong psychometric properties and the ability to accurately reflect true latent trait differences between autistic and non-autistic adults. We further established the nomological validity of the refined TAS by confirming hypothesized relationships with core autism features, co-occurring psychopathology, trait neuroticism, demographic features, and quality of life. Lastly, in order to more fully interrogate the relationships between trait neuroticism and alexithymia in the autistic population, we conducted additional analyses to determine whether our reduced TAS form was able to predict additional variance in autism features, psychopathology, and quality of life once controlling for levels of neuroticism.

The current investigation was a secondary data analysis of TAS-20 responses collected as a part of multiple online survey studies (See “Participants” section for more details on each study). Participants reporting professional diagnoses of autism spectrum disorder were recruited from the Simons Foundation Powering Autism Research for Knowledge (SPARK) cohort, a U.S.-based online community that allows autistic individuals and their families to participate in autism research studies (84). In order to compare TAS scores and item responses between autistic and non-autistic individuals, we combined the SPARK sample with open data from the Human Penguin Project (85,86), a large multinational survey study investigating the relationships between core body temperature, social network structure, and a number of other variables (including alexithymia measured using the TAS) in adults from the general population. The addition of a control group provides a substantial amount of additional information, allowing us to assess I-DIF across diagnostic groups, assess the psychometric properties of any newly-created TAS short forms in the general population, and generate normative scores for these short forms based on the distribution of TAS scores in this sample. Although autism status was not assessed in the control sample, the general population prevalence of approximately 2% autistic adults (87) does not cause enough “diagnostic noise” in an otherwise non-autistic sample to meaningfully bias item parameter estimates or alter tests of differential item functioning (78).

Participants

SPARK (Autism) Sample

Using the SPARK Research Match service, we invited autistic adults between the ages of 18 and 45 years, 11 months to take place in our study via the SPARK research portal. All individuals self-reported a prior professional diagnosis of autism spectrum disorder or equivalent condition (e.g., Asperger syndrome, PDD-NOS). Notably, although these diagnoses are not independently validated by SPARK, the majority of participants are recruited from university autism clinics and thus have a very high likelihood of valid autism diagnosis (84). Furthermore, validation of diagnoses in the Interactive Autism Network, a similar participant pool now incorporated into SPARK, found that 98% of registry participants were able to produce valid clinical documentation of self-reported diagnoses when requested (88). Autistic participants in our study completed a series of surveys via the SPARK platform that included the TAS-20, additionally providing demographics, current and lifetime psychiatric diagnoses, and scores on self-report questionnaires measuring autism severity, quality of life, co-occurring psychiatric symptoms, and a number of other clinical variables (see “Measures” section for descriptions of the questionnaires analyzed in the current study). These data were collected during winter and spring of 2019 as part of a larger study on repetitive thinking in autistic adults (project number RM0030Gotham), and the SPARK participants in the current study are a subset of those described by Williams et al. (78). Participants received a total of $50 in Amazon gift cards for completion of the study. A total of 1,012 individuals enrolled in the study, 743 of whom were included in the current analyses. Participants were excluded if they (a) did not self-report a professional diagnosis of autism on the demographics form, (b) did not complete the TAS-20, (c) indicated careless responding as determined by incorrect answers to two instructed-response items (e.g., Please respond ‘Strongly Agree’ to this question.), or (d) answered “Yes” or “Suspected” to a question regarding being diagnosed with Alzheimer’s disease (which given the age of participants in our study almost certainly indicated random or careless responding). All participants gave informed consent, and all study procedures were approved by the institutional review board at Vanderbilt University Medical Center.

Human Penguin Project (Neurotypical) Sample

Data from a neurotypical control sample was derived from an open dataset generated from the Human Penguin Project (HPP) (85,86), a multinational survey study designed to test the theory of social thermoregulation (89). Because the full details of this sample have been reported elsewhere (85,86), we provide only a brief overview, focusing primarily on the participants whose data were utilized in the current study. The HPP sample was collected in two separate studies in 2015–2016: one online pilot study (N = 232) that recruited participants from Amazon’s Mechanical Turk and the similar crowdsourcing platform Prolific Academic (90,91) and a larger cross-national study (12 countries, total N = 1523) that recruited subjects from 15 separate university-based research groups. In order to eliminate problems due to the non-equivalence of TAS items in different languages, we used only those data where the TAS-16 was administered in English (i.e., all crowdsourced pilot data, as well as cross-national data from the University of Oxford, Virginia Commonwealth University, University of Southampton, Singapore Management University, and University of California, Santa Barbara). Additionally, in order to match the HPP and SPARK samples on mean age, we excluded all HPP participants over the age of 60. Notably, individuals aged 45–60 were included due to the relative excess of individuals aged 20–30 in the HPP sample, which caused the subsample of 18–45-year-old HPP participants to be several years younger on average than the SPARK sample. The final HPP sample thus consisted of a total of 721 English-speaking adults aged 18–60 (MTurk n = 122; Prolific n = 84; Oxford n = 129; Virginia n = 148; Southampton n = 6; Singapore n = 132; Santa Barbara n = 100). As a part of this study, all participants completed a 16-item version of the TAS (TAS-16) that excludes four TAS-20 items (16, 17, 18, and 20) on the basis of poor factor loadings in the psychometric study of Kooiman et al. (63). In addition to item-level data from the TAS-16, we extracted the following variables: age (calculated from birth year), sex, and site of recruitment. The HPP was approved under an “umbrella” ethics proposal at Vrije Universiteit, Amsterdam, and separately at each contributing site. All study procedures complied with the ethics code outlined in the Declaration of Helsinki.

Measures

Toronto Alexithymia Scale (TAS)

The TAS (2,33) is a the most frequently and widely used self-report measure of alexithymia, as well as the most commonly administered alexithymia measure in the autism literature (3). The most popular version of this form, the TAS-20 has been used in medical, psychiatric, and general-population samples as a composite measure of alexithymia for over 25 years (2), and this form has been translated into over 30 languages/dialects. The TAS-20 contains twenty items rated on a five-point Likert scale items from Strongly Disagree to Strongly Agree. The TAS-20 is organized into three subscales, Difficulty Identifying Feelings (DIF; 7 items), Difficulty Describing Feelings (DDF; 5 items), and Externally-oriented Thinking (EOT; 8 items), corresponding to three of the four components of the alexithymia construct defined by Nemiah, Freyberger, and Sifneos (1). Notably, the fourth component, Difficulty Fantasizing (DFAN), was also included in the original 26-item version of the TAS (34), but this subscale showed poor coherency with the other three and was ultimately dropped from the measure (2). The sum of items on the TAS-20 is often used as an overall measure of alexithymia, and scores of 61 or higher are typically used to create binary alexithymia classifications in both general population and clinical samples.

As noted earlier, neurotypical participants in the HPP sample filled out the TAS-16, a version of the TAS-20 in which four problematic items have been removed from the scale (63). However, as we wished to compare total scores from the TAS-20 between HPP and SPARK samples, we conducted single imputation for missing items in both groups using a random-forest algorithm implemented in the R missForest package (92–94). Such item-level imputation allowed for us to approximate the TAS-20 score distribution of the HPP participants, including the proportion of individuals exceeding the “high alexithymia” cutoff of 61. Notably, although the “high alexithymia” cutoff is theoretically questionable given the taxometric evidence for alexithymia as a purely dimensional construct (2), we chose to calculate this measure to facilitate comparisons with prior literature that primarily reported the proportion of autistic adults exceeding this cutoff (3).

Clinical Measures for Validity Testing

In addition to the TAS-20, individuals in the SPARK sample only completed a number of other self-report questionnaires, including measures of autism symptomatology, co-occurring psychopathology, trait neuroticism, and autism-related quality of life. Measures of autistic traits included the Social Responsiveness Scale–Second Edition (SRS-2) total T-score (95) and a self-report version of the Repetitive Behavior Scale–Revised (RBS-R) (96,97), from which we derived measures of “lower-order” and “higher-order” repetitive behaviors (i.e., the Sensory Motor [SM] and Ritualistic/Sameness [RS] subscales reported by McDermott et al. (96)). Depression was measured using autism-specific scores on the Beck Depression Inventory–II (BDI-II) (78,98), and we additionally used BDI-II item 9 (Suicidal Thoughts or Wishes) to quantify current suicidality. We additionally assessed generalized and social anxiety using the Generalized Anxiety Disorder–7 (GAD-7) (99) and Brief Fear of Negative Evaluation Scale–Short Form (BFNE-S) (100,101), respectively. Somatization was quantified using a modified version of the Patient Health Questionnaire–15 (PHQ-15) (102), which extended the symptom recall period to three months and excluded the two symptoms of dyspareunia and menstrual problems. We measured trait neuroticism using ten items from the international personality item pool (103), originally from the Multidimensional Personality Questionnaire’s “Stress Reaction” subscale (104) and referred to here as the IPIP-N10. Lastly, autism-related quality of life was measured using the Autism Spectrum Quality of Life (ASQoL) questionnaire (105). More in-depth descriptions of all measures analyzed in the current study, including reliability estimates in the SPARK sample, can be found in the Supplemental Methods.

Statistical Analyses

Confirmatory Factor Analysis and Model-based Bifactor Coefficients

All statistical analyses were performed in the R statistical computing environment (106).

In order to test the appropriateness of the proposed TAS-20 factor structure in autistic adults, we performed a confirmatory factor analysis (CFA) on TAS-20 item responses in our SPARK sample. The measurement model in our CFA included a bifactor structure with one “general alexithymia” factor onto which all items loaded, as well as four “specific” factors representing the three subscales of the TAS-20 and the common method factor for the reverse-coded items (69). In addition, given the previously identified problems with the EOT subscale and the reverse-coded items (2), we additionally examined a bifactor model fit only to the forward-coded DIF and DDF items, removing both the EOT and reverse-coded items. Although not the focus of the current investigation, we also fit the original and reduced TAS factor models in the HPP sample in order to determine whether any identified model misfit was present only in autistic adults or more generally across both samples. We fit the model using a diagonally weighted least squares estimator (107) with a mean- and variance-corrected test statistic (i.e., “WLSMV” estimation), as implemented in the R package lavaan (108). Very few of the item responses in our dataset contained missing values (0.16% missing item responses in the SPARK sample, no missing TAS-16 data in HPP sample), and missing values were singly imputed using missForest (92–94).

Model fit was evaluated using the Chi square test of exact fit, comparative fit index (CFI; 109), Tucker-Lewis index (TLI; 110), root mean square error of approximation (RMSEA; 111), standardized root mean square residual (SRMR; 112), and weighted root mean square residual (WRMR; 113,114). The categorical maximum likelihood (cML) estimator proposed by Savalei (115) was used to calculate the CFI, TLI, and RMSEA, as these indices better approximate the population values of the maximum likelihood-based fit indices used in linear CFA than analogous measures calculated from the WLSMV test statistic (116). Moreover, the SRMR was calculated using the unbiased estimator (i.e., SRMR_u) proposed by Maydeu-Olivares (117, see also 118) and implemented in lavaan for categorical estimators. CFI_cML/TLI_cML values greater than 0.95, RMSEA_cML values less than 0.06, SRMR_u values less than 0.08, and WRMR values less than 1.0 were defined as indicating adequate global model fit, based on standard rules of thumb employed in the structural equation modeling literature (112–114). In addition to the aforementioned global fit indices, we checked for localized areas of model misfit based on examination of the residual correlations (119), with residuals greater than 0.1 indicating areas of potentially significant misfit and/or violations of local independence (120).

Confirmatory bifactor models were further interrogated with the calculation of several model-based coefficients (121–123) including (a) coefficient omega total (ω_T), a measure of the reliability of the multidimensional TAS-20 total score, (b) coefficient omega hierarchical (ω_H), a measure of general factor saturation (i.e., the proportion of total score variance attributable to the general factor), (c) coefficient omega subscale (ω_S), a measure of the reliability for each individual subscale, (d) coefficient omega hierarchical subscale (ω_HS), a measure of the proportion of subscale variance attributable to the specific factor, (e) the explained common variance (ECV; the ratio of general factor variance to group factor variance) for the total score and each item separately, and (f) the percentage of uncontaminated correlations (PUC), a supplementary index used in tandem with total ECV to determine whether a scale can be considered “essentially unidimensional” (122,124). Omega coefficients calculated in the current study were based on the categorical data estimator proposed by Green and Yang (125). ECV coefficients were also calculated for individual subscales (S-ECV) as an additional measure of subscale general factor saturation.

Item Response Theory and Differential Item Functioning Analyses

After selecting an appropriate factor model, we evaluated the ECV and PUC coefficients to determine whether the model could be reasonably well-approximated by a unidimensional item response theory (IRT) model. We then fit the data from the TAS items included in the best-fitting factor model to a graded response model (126) in our SPARK sample using maximum marginal likelihood estimation (127), as implemented in the mirt R package (128). Model fit was assessed using the limited-information C₂ statistic (129,130), as well as C₂-based approximate fit indices and SRMR. Based on previously-published guidelines (131), we defined values of CFI_C2 > 0.975, RMSEA_C2 < 0.089, and SRMR < 0.05 as indicative of good model fit. Residual correlations were examined to determine areas of local dependence, with values greater than ± 0.1 indicative of potential misfit. Items with multiple large residual correlations were flagged for removal, and the IRT model was then re-fit and iteratively tested until all areas of local misfit were removed.

After refining the unidimensional TAS model in the SPARK sample, we further investigated the same model in the HPP sample. Once a structural model was found to fit in both samples, we fit a multi-group graded response model to the full dataset, using this model to examine I-DIF between groups. DIF was tested using the iterative Wald test procedure proposed by Cao et al. (132) and implemented in R by the first author (133), with Benjamini-Hochberg–corrected (134) p values < 0.05 used to flag items for I-DIF. Significant omnibus Wald tests were followed up with tests of individual item parameters to determine which parameters significantly differed between groups (135). Notably, this I-DIF procedure is quite powerful in large sample sizes, and thus I-DIF effect-size indices were used to determine whether the differential functioning of a given item was small enough to be ignorable in practice. In particular, we used the weighted area between curves (wABC) as a measure of I-DIF magnitude, with values greater than 0.30 indicative of practically significant I-DIF (83). We additionally reported the expected score standardized difference (ESSD), a standardized effect size interpretable on the metric of Cohen’s d (82). Items exhibiting practically significant I-DIF between autistic and non-autistic adults were further flagged for removal, and this process was repeated iteratively until the resulting TAS short form contained no items with practically significant I-DIF by diagnostic group. The total effect of all I-DIF (i.e., differential test functioning [DTF]) was then estimated using the unsigned expected test score difference in the sample (UETSDS), the expected absolute difference in manifest test scores between individuals of different groups possessing the same underlying trait level (83).

After removing items based on between-group I-DIF, we then examined I-DIF of the resulting short form across subsets of the autistic population. Using the same iterative Wald procedure and effect size criteria as the between-group analyses, we tested whether TAS items functioned differently across groups based on sex, gender, age (> 30 vs. ≤30 years), race (non-Hispanic White vs. Other), level of education (any higher education vs. no higher education), age of autism diagnosis (≥ 18 years old vs. <18 years), self-reported co-occurring conditions (current depressive disorder, current anxiety disorder, and lifetime attention deficit hyperactivity disorder [ADHD]). Although many fewer stratification variables were collected in the HPP sample, I-DIF was also examined within that sample according to age (> 30 vs. ≤30 years), sex, and phase of the project (i.e., pilot study vs. multi-site study). These I-DIF results were used to further refine the measure such that the resulting TAS short form exhibited I-DIF across all groups that was small enough to be practically ignorable. All items retained in the TAS form at this stage were incorporated into the final measure.

Once the TAS short form was finalized, we then fit an additional multi-group graded response model on only those final items, constraining item parameters to be equal between groups and setting the scale of the latent variable by constraining the general population sample to have a mean of 0 and standard deviation of 1. Using this model, we then estimated maximum a-posteriori (MAP) TAS latent trait scores for each individual, which were interpretable as Z-scores relative to the general population (i.e., a score of 1 is one full standard deviation above the mean of our non-autistic normative sample). Individual reliability coefficients were also examined, with values greater than 0.7 being deemed sufficiently reliable for interpretation at the individual level.

Validity Testing

To further test the validity of the newly generated TAS latent trait scores in autistic adults, we investigated the relationships between these scores and a number of clinical variables that have previously demonstrated relationships with alexithymia in either autistic adults or the general population. Based on previous literature (58), we hypothesized that alexithymia would show moderate to strong positive correlations with neuroticism (IPIP-N10), autistic traits (SRS-2), repetitive behavior (RBS-R), depression (BDI-II), generalized anxiety (GAD-7), social anxiety (BFNE-S), suicidality (BDI item 9), and somatic symptom burden (PHQ-15), as well as moderate negative correlations with autism-specific QoL (ASQoL). Given the documented relationships between neuroticism and alexithymia, we further examined the magnitude of these correlations after controlling for levels of neuroticism. We additionally examined relationships between alexithymia scores and demographic variables, including age, sex, race/ethnicity, age of autism diagnosis, and level of education. Notably, alexithymia is correlated with older age, male sex, and lower education level in the general population (136–138), and we expected that these relationships would replicate in the current SPARK sample (with the exception of the correlation with age, given the restricted age range in our current sample). We did not, however, expect to find significant associations between alexithymia and race/ethnicity or age of autism diagnosis.

Relationships between alexithymia and external variables were examined using robust Bayesian variants of the Pearson correlation coefficient (for continuous variables, e.g., SRS-2 scores), polyserial correlation coefficient (for ordinal variables, such as the BDI-II suicidality item and education level), partial correlation coefficient (when testing relationships after controlling for neuroticism), and unequal-variances t-test (139–141), as implemented using custom R code (142) and the brms package (143). Additional technical details regarding model estimation procedures and prior distributions can be found in the Supplemental Methods. Standardized effect sizes produced by these methods (i.e., r, r_p, and d) were summarized using the posterior median and 95% highest-density credible interval (CrI).

Readability Analysis

As a supplemental analysis, we evaluated the readability of the TAS-20 and the newly-derived short form using the FORCAST formula (150). This formula is well-suited for questionnaire material, as it ignores the number of sentences, average sentence length, or hard punctuation (standard metrics for text in prose form), instead focusing exclusively on the number of monosyllabic words (151). FORCAST grade level equivalent was calculated for both the TAS-20 (excluding the questionnaire directions) and the short form derived in the current study using Readability Studio version 2019.3 (Oleander Software, Ltd, Vandalia, OH, USA). Although we did not attempt to select items based on readability, this analysis was constructed to ensure that shortening of the TAS questionnaire did not substantially increase the reading level, thereby making the short form measure less accessible to younger or less educated respondents.

Participants and Demographics

In total, our sample included TAS data from 1464 unique individuals across the two data sources (Table 1). Autistic adults in the SPARK sample (n = 743, age = 30.91 ± 7.02 years, 63.5% female sex) were predominantly non-Hispanic White (79.4%) and college-educated (46.4% with a 2- or 4-year college degree, and an additional 26.5% with some college but no degree), similar to the previous sample drawn from this same SPARK project (78). The median age of autism diagnosis was 19.17 years (IQR = [10.33, 28.79]), indicating the majority of individuals in the sample were diagnosed in adulthood. The majority of participants reported a current depressive or anxiety disorder (defined as symptoms in the past three months or an individual currently being treated for one of these disorders), with depression present in 59.2% and anxiety present in 71.7%. TAS-20 scores in the SPARK sample were present across the full range of trait levels (M = 60.55, SD = 13.11), and just over half of the sample (54.5%) was classified as “high alexithymia” based on TAS-20 total scores greater than or equal to 61. Less demographic information was available for the general population adults in the HPP sample (n = 721, age = 30.92 ± 13.01 years, 64.9% female), but the available demographics indicated that these individuals were well-matched to the SPARK sample on age and sex. Partially imputed TAS-20 scores in the HPP sample were slightly higher than other general population samples (M = 50.21, SD = 11.21), and based on these scores, 17.1% of HPP participants were classified as having “high alexithymia.” As anticipated, large differences in TAS-20 total scores were present between groups (d = 0.880, 95% CrI [0.767, 0.995]).

Table 1

Demographics for Autistic and General Population Samples

	SPARK (n = 743)	HPP (n = 721)
Age (Years)	30.91 (7.02)	30.92 (13.01)
Sex
Male	271 (36.5%)	253 (35.1%)
Female	472 (63.5%)	468 (64.9%)
Gender Identity
Cisgender Man	245 (33.0%)	—
Cisgender Woman	400 (53.8%)	—
Transgender Man	15 (2.0%)	—
Transgender Woman	6 (0.8%)	—
Non-binary	76 (10.2%)	—
Non-Hispanic White	590 (79.4%)	—
Education
No High School Diploma	25 (3.4%)	—
High School Diploma/GED	140 (18.8%)	—
Vocational Certificate	36 (4.8%)	—
Some College	197 (26.5%)	—
Associate Degree	74 (10.0%)	—
Bachelor’s Degree	171 (23.0%)	—
Graduate/Professional Degree	100 (13.5%)	—
Age of Autism Diagnosis (Years)	19.67 (11.17)	—
Current Depression	440 (59.2%)	—
Current Anxiety	533 (71.7%)	—
Current Suicidality	292 (39.3%)	—
Lifetime ADHD	342 (46.0%)	—
TAS-20 Total Score	60.55 (13.11)	50.21 (11.21)^a
TAS-8 Latent Trait Score	1.01 (1.17)	0.01 (0.93)
"High Alexithymia" (TAS-20 ≥ 61)	405 (54.5%)	123 (17.1%)^a

Note. Continuous variables are presented as M (SD), and categorical variables are presented as N (%). All data in both samples were gathered by self-report. SPARK = Simons Powering Autism Research Knowledge; HPP = Human Penguin Project; ADHD = attention deficit hyperactivity disorder; TAS = Toronto Alexithymia Scale.

^a Participants in the HPP sample completed a 16-item version of the TAS, which excluded items 16, 17, 18, and 20. For comparison with the TAS-20 scores in the SPARK sample, these four items were imputed for all HPP participants using random forest imputation.

Confirmatory Factor Analysis

Within the SPARK sample, the confirmatory factor model for the full TAS-20 exhibited subpar model fit, with only the SRMR_u meeting a priori fit index cutoff values (Table 2). Additionally, examination of residual correlations revealed five values greater than 0.1, indicating a non-ignorable degree of local model misfit. Model-based bifactor coefficients indicated strong reliability and general factor saturation of the TAS-20 composite (ω_T = 0.912, ω_H = 0.773), though the ECV/PUC indicated that the scale could not be considered “essentially unidimensional” (ECV = 0.635, PUC = 66.8%). Both the DIF and DDF subscales exhibited good composite score reliability (ω_S = 0.906 and 0.854, respectively), although omega hierarchical coefficients indicated that the vast majority of reliable variance in each subscale was due to the “general alexithymia” factor (DIF: ω_HS = 0.162, S-ECV = 0.753; DDF: ω_HS = 0.145, S-ECV = 0.768, respectively). Conversely, the EOT subscale exhibited very poor reliability, with only one fourth of common subscale variance attributable to the general factor (ω_S = 0.451, ω_HS = 0.300, S-ECV = 0.245). Examination of the factor loadings further confirmed the inadequacy of the EOT subscale, as seven of the eight EOT items (5, 8, 10, 15, 16, 18, 19, and 20) loaded poorly onto the “general alexithymia” factor (λ_G = -0.116–0.311; Supplemental Table S1). Notably, these psychometric issues were not limited to autistic adults. The fit of the TAS-20 CFA model in the HPP sample was equally poor, and bifactor coefficients indicating the psychometric inadequacy of the EOT and reverse-scored items were replicated in this sample as well (Table 2).

Table 2

Confirmatory Factor Analysis Fit Indices and Model-based Omega Coefficients

Index	TAS-20 Bifactor: SPARK	TAS-20 Bifactor: HPP	TAS-11 Bifactor: SPARK	TAS-11 Bifactor: HPP
Model Fit Indices
χ2 (df)^a	590.6 (145)	669.9 (145)	151.6 (33)	124.0 (33)
CFI_cML	0.924	0.900	0.970	0.978
TLI_cML	0.900	0.869	0.951	0.963
RMSEA_cML [90% CI]	0.072 [0.066, 0.078]	0.086 [0.081, 0.092]	0.080 [0.069, 0.092]	0.068 [0.056, 0.079]
SRMR_u [90% CI]	0.036 [0.033, 0.004]	0.051 [0.047, 0.056]	0.020 [0.017, 0.024]	0.019 [00.015, 0.023]
WRMR	1.119	1.565	0.768	0.699
\|Residuals\| > 0.1	2.60%	8.90%	0%	0%
Largest Residual	0.149	0.225	0.084	0.055
Bifactor Coefficients
w_T/w_H	0.912/0.773	0.914/0.741	0.929/0.861	0.925/0.952
w_S/w_HS (DIF)	0.906/0.162	0.880/0.224	0.913/0.087	0.892/0.071
w_S/w_HS (DDF)	0.854/0.145	0.803/0.120	0.800/0.163	0.839/0.223
w_S/w_HS (EOT)	0.451/0.300	0.512/0.307	—	—
w_S/w_HS (REV)	0.559/0.441	0.692/0.689	—	—

Note. Fit indices that above the a priori cutoffs for acceptable model fit (CFI/TLI > 0.95, RMSEA < 0.06, SRMR < 0.08, WRMR < 1, all residuals < 0.1) are presented in bold. TAS = Toronto Alexithymia Scale; SPARK = Simons Powering Autism Research Knowledge; HPP = Human Penguin Project; CFI_cML = comparative fit index (categorical maximum likelihood estimation); TLI_cML= Tucker-Lewis Index (categorical maximum likelihood estimation); RMSEA_cML = root mean square error of approximation (categorical maximum likelihood estimation); SRMR_u = population-unbiased standardized root mean square residual; WRMR = weighted root mean square residual; w_T = omega total (composite reliability of total score); w_H= omega hierarchical (proportion of total score variance accounted for by general factor); w_S = omega subscale (composite reliability of subscale score); w_HS = omega hierarchical subscale (proportion of subscale score variance accounted for by specific factor); DIF = difficulty identifying feelings; DDF = difficulty describing feelings; EOT = externally-oriented thinking; REV = reverse-coded item method factor.

^a all p values < 0.001

Following the removal of the EOT and reverse-coded items from the TAS-20, we fit a bifactor model with two specific factors (DIF and DDF) to the remaining 11 items in our SPARK sample. The fit of this model was substantially improved over the TAS-20, with all indices except RMSEA_cML exceeding a priori designated cutoffs (Table 2) and all residuals correlations below 0.1. Moreover, model-based coefficients (ECV = 0.815; PUC = 50.9%) indicated that the 11-item TAS was unidimensional enough to be fit by a standard graded response model with little parameter bias. Notably, the estimated reliability and general factor saturation of the 11-item TAS composite score were higher than those of the 20-item composite (ω_T = 0.925, ω_H = 0.852), suggesting that the inclusion of EOT and reverse-coded items on the scale actually reduces the amount of scale variance attributable to the underlying alexithymia construct. Fit of the 11-item TAS model in the HPP sample was equally strong (Table 2), with an approximately equal ECV (0.793) supporting the essential unidimensionality of this scale in both samples.

Item Response Theory Analyses

A unidimensional graded response model fit to the 11-item TAS short form did not display adequate fit according to a priori fit index guidelines (C₂(44) = 485.7, p < 0.001, CFI_C2 = 0.955, RMSEA_C2 = 0.116, SRMR = 0.068). Examination of residual correlations indicated that item 7 (I am often puzzled by sensations in my body) was particularly problematic, exhibiting a very large residual correlation of 0.259 with item 3 as well as two other residuals greater than 0.1. Removal of this item caused the resulting 10-item graded response model to approximately meet the minimum standards for adequate fit (C₂(35) = 485.7, p < 0.001, CFI_C2 = 0.976, RMSEA_C2 = 0.086, SRMR = 0.051), with all remaining residual correlations below 0.1. The overall fit of this 10-item model was somewhat worse in the HPP sample (C₂(35) = 319.9, p < 0.001, CFI_C2 = 0.960, RMSEA_C2 = 0.106, SRMR = 0.065); however, it is notable that this model contained item 17, which was not contained within the TAS-16 and was thus fully imputed in the HPP sample. Removal of this item resulted in a substantial improvement in fit in the HPP sample (C₂(27) = 169.1, p < 0.001, CFI_C2 = 0.974, RMSEA_C2 = 0.086, SRMR = 0.058), with fit indices approximately reaching the a priori cutoffs. As the 9-item TAS also exhibited good fit in the SPARK sample (C₂(27) = 161.7, p < 0.001, CFI_C2 = 0.980, RMSEA_C2 = 0.082, SRMR = 0.049), we chose this version of the measure to test I-DIF between autistic and general population adults.

For the remaining nine TAS items, I-DIF was evaluated across diagnostic groups using the iterative Wald test procedure. Significant I-DIF was found in eight of the nine items (all except item 6) at the p < 0.05 level (Table 3); however, effect size indices suggested that practically significant I-DIF was only present in item 3 (I have physical sensations that even doctors don’t understand; wABC = 0.433, ESSD = 0.670). The remaining items all exhibited I-DIF with small standardized effect sizes (all wABC < 0.165, all |ESSD| < 0.187), allowing these effects to be ignored in practice (83). After removal of item 3, we re-tested I-DIF the resulting eight-item scale (TAS-8), producing nearly identical results (significant I-DIF for all items except 6; all wABC < 0.167, all |ESSD| < 0.186). The overall DTF of the TAS-8 was also small enough to be ignorable, with the average difference in total scores between autistic and non-autistic adults of the same trait level being less than 0.5 scale points (UETSDS = 0.460, ETSSD = -0.011).

Table 3

Differential Item Functioning Results Comparing Autistic and General Population Adults on 9-item Toronto Alexithymia Scale

TAS-20 Item #	χ²(5)	p-value	wABC	ESSD	Parameters^a
1	35.30	<0.001	0.089	-0.018	a₁, d₁, d₂
2	23.18	<0.001	0.164	0.157	d₂, d₃
3	65.10	<0.001	0.433^b	0.670^b	*d₂, d₃, d₄*
9	26.03	<0.001	0.064	-0.021	d₁
11	30.47	<0.001	0.165	0.001	a₁, d₂, d₃
12	30.19	<0.001	0.149	-0.187	d₁
13	57.66	<0.001	0.064	-0.022	a₁, d₁, d₂, d₃, d₄
14	61.90	<0.001	0.031	-0.022	a₁, d₁, d₂, d₃, d₄

Note. Results indicate omnibus Wald tests of differential item functioning using the iterative anchor-selection method of Cao et al. (2017). P-values are corrected for a 5% false discovery rate using the Benjamini-Hochberg procedure. Parameters that were significantly different between groups when tested alone with follow-up Wald tests (FDR < 0.05) are indicated in the Parameters column. wABC = weighted area between curves; ESSD = expected score standardized difference (in Cohen’s d metric); a₁ = slope parameter; d₁-d₄ = item intercept parameters (i.e., item “difficulty” parameters).

^aParameters in bold are larger (i.e., more discriminating for a parameters and “easier” for d parameters) in the autistic group. Larger values of a indicate that the item is more strongly related to the latent trait in autistic adults, whereas larger values of d indicate that a given item response is endorsed at lower latent trait levels in autistic adults relative to the general population.

^bPractically significant DIF (i.e., wABC > 0.3).

After establishing practical equivalence in item parameters between the two diagnostic groups, we then tested I-DIF for the TAS-8 for a number of subgroups within the HPP and SPARK samples. Within the general population HPP sample, all eight TAS-8 items displayed no significant I-DIF across by sex, age (≥ 30 vs. <30), or phase of the HPP study (all ps > 0.131). Similarly, in the SPARK sample, there was no significant I-DIF by sex, gender, race, education level, current anxiety disorder, history of ADHD, or current suicidality (all ps > 0.105). However, significant I-DIF was found across several demographics, including age (item 6; wABC = 0.0543, ESSD = -0.045), age of autism diagnosis (items 2, 6, and 14; all wABC < 0.267, all |ESSD| < 0.135), and current depressive disorder (item 13; wABC = 0.274, ESSD = 0.361), although wABC values for these items indicated that the degree of I-DIF was ignorable in practice.

As no items of the TAS-8 exhibited practically significant I-DIF across any of the tested contrasts, we retained all eight items for the final TAS short form. A graded response model fit to the full sample exhibited adequate fit (C₂(20) = 240.4, p < 0.001, CFI_C2 = 0.983, RMSEA_C2 = 0.087, SRMR = 0.045) and no residual correlations greater than 0.1. A multi-group model with freely estimated mean/variance for the autistic group was used to calculate the final item parameters (Table 4), as well as individual latent trait scores. Item characteristic curves indicated that all TAS-8 items behaved appropriately, although the middle response option was insufficiently utilized for three of the eight items (Fig. 1). The MAP-estimated latent trait scores for the TAS-8 showed strong marginal reliability (ρ_xx = 0.895, 95% bootstrapped CI: [0.895, 0.916]), and individual reliabilities were greater than the minimally acceptable 0.7 for the full range of possible TAS-8 scores (i.e., latent trait values between − 2.19 and 3.52; Fig. 2A). Item information plots for the eight TAS-8 items (Fig. 2B) indicated that all items contributed meaningful information to the overall test along the full trait distribution of interest. TAS-8 latent trait scores were also highly correlated with total scores on the TAS-20 (r = 0.910, 95% CrI [0.897, 0.922]), indicating that the general alexithymia factor being assessed by this short form is strongly related to the alexithymia construct as operationalized by the TAS-20 total score. Diagnostic group differences in TAS-8 latent trait scores remained large, with autistic individuals demonstrating substantially elevated levels of alexithymia on this measure (d = 1.014 [0.887, 1.139]).

Table 4

TAS-8 Graded Response Model Parameters and Equivalent Factor Loadings for Full Sample

TAS-20 Item #	Item Content	a₁	d₁	d₂	d₃	d₄	l	h²
1	I am often confused about what emotion I am feeling.	2.802	3.092	-0.689	-2.740	-6.336	0.855	0.731
2	It is difficult for me to find the right words for my feelings.	2.190	3.478	0.491	-0.931	-3.841	0.790	0.623
6	When I am upset, I don’t know if I am sad, frightened, or angry.	2.335	2.090	-0.805	-2.413	-5.497	0.808	0.653
9	I have feelings that I can’t quite identify.	2.402	3.137	0.072	-1.434	-5.170	0.816	0.666
11	I find it hard to describe how I feel about people.	1.870	2.745	-0.234	-1.505	-4.340	0.740	0.547
12	People tell me to describe my feelings more.	1.235	1.739	-0.526	-1.636	-3.644	0.587	0.345
13	I don’t know what’s going on inside me.	1.892	2.054	-0.646	-2.231	-4.771	0.743	0.553
14	I often don’t know why I am angry.	1.538	1.285	-1.133	-2.201	-4.361	0.671	0.450

Note. Parameters estimated using maximum marginal likelihood based on Bock-Aitkin EM algorithm. This model contained two groups: general population (q fixed to M = 0, SD = 1 in this group) and autistic group (mean and SD of q free to vary), with all item parameters constrained to equality between groups. TAS = Toronto Alexithymia Scale; a₁ = slope parameter; d₁-d₄ = item intercept parameters (more positive values indicate “easier” items); l = factor loading on single factor; h² = communality (squared factor loading).

Validity Analyses

Overall, the TAS-8 latent trait scored demonstrated a pattern of correlations with other variables that generally resembled the relationships seen in other clinical and non-clinical samples (Table 5). The TAS-8 latent trait score was highly correlated with autistic traits as measured by the SRS-2 (r = 0.642 [0.598, 0.686]), additionally exhibiting moderate correlations with lower-order (r = 0.386 [0.320, 0.450]) and higher-order (r = 0.432 [0.372, 0.494]) repetitive behaviors as measured by the RBS-R. TAS-8 latent trait scores were also correlated with psychopathology measures, exhibiting the hypothesized pattern of correlations with depression, anxiety, somatic symptom burden, social anxiety, and suicidality, as well as lower autism-related quality of life. As with other versions of the TAS, the TAS-8 displayed a moderate-to-large correlation with trait neuroticism (r = 0.475 [0.416, 0.531]), raising the possibility that relationships between TAS-8 scores and internalizing psychopathology are driven by neuroticism rather than alexithymia per se. To investigate this possibility further, we calculated partial correlations between the TAS-8 and other variables after controlling for IPIP-N10 scores, using a Bayes factor to test the interval null hypothesis that r_p falls between − 0.1 and 0.1 (i.e., < 1% of additional variance in the outcome is explained by the TAS-8 score after accounting for neuroticism). Bayes factors provided substantial evidence that the partial correlations between the TAS-8 and SRS-2, RBS-R subscales, BDI-II, and ASQoL exceeded the ROPE. Additionally, while partial correlations with the BFNE-S, PHQ-15, and BDI suicidality item were all greater than zero, Bayes factors suggested that all three of these correlations were more likely to lie within the ROPE than outside of it (all BF_ROPE < 0.258). There was only anecdotal evidence that the partial correlation between the TAS-8 and GAD-7 exceeded the ROPE (BF_ROPE = 2.18). However, there was a 91.3% posterior probability of that correlation exceeding the ROPE, suggesting that there was a strong likelihood of alexithymia explaining a meaningful amount of additional variance in anxiety symptoms beyond that accounted for by neuroticism.

The relationships between TAS-8 scores and demographic variables were also examined in order to determine whether relationships found in the general population apply to autistic adults. As hypothesized, TAS-8 scores showed a small and practically insignificant correlation with age (r = 0.032 [-0.041, 0.104], BF_ROPE= 5.77 ´ 10^-6), likely due to the absence of older adults (i.e., ages 60+) in our sample. Alexithymia also showed a nonzero negative correlation with education level, although the magnitude of this relationship was small enough to not be practically significant (r_poly = -0.089 [-0.163, -0.017], BF_ROPE = 0.045). Unlike in the general population, females in the SPARK sample had slightly higher TAS-8 scores (d = 0.183 [0.022, 0.343]), although this difference was small and not practically significant (BF_ROPE = 0.265). Additionally, there was an absence of practically significant differences in alexithymia by race/ethnicity (d = -0.052 [-0.247, 0.141], BF_ROPE = 0.029). Lastly, age of autism diagnosis was positively correlated with TAS-8 scores (r = 0.133 [0.06, 0.204]), although this correlation was also small enough to not be practically significant (BF_ROPE = 0.014).

Readability Analysis

Using the FORCAST algorithm, we calculated the equivalent grade level of the full TAS-20 (including instructions) to be 10.2 (i.e., appropriate for individuals at the reading level of a high school sophomore after the second month of class). Using this same algorithm, the TAS-8 items had a readability of 8.8, indicating a moderate decrease in word difficulty. Thus, in addition to improving the psychometric properties of the measure, our item reduction procedure seemingly improved the overall readability of the TAS.

While alexithymia is theorized to account for many traits associated with the autism phenotype (39–51), studies to date have not typically assessed the psychometric properties of alexithymia measures in the autistic population, and the suitability of most alexithymia measures for use in autistic individuals remains largely unknown. In the current study, we performed a rigorous examination of the psychometric properties of the TAS-20, the most widely used measure of self-reported alexithymia, in a large and diverse sample of autistic adults. Overall, we found the TAS-20 questionnaire to have a number of psychometric issues, including a poorly-fitting measurement model, several items that are minimally related to the overall alexithymia construct, and items that function differentially when answered by autistic and non-autistic adults. In response to these issues, we performed an empirically-based item reduction of the TAS-20 questionnaire, which resulted in an eight-item unidimensional TAS short form (TAS-8). The TAS-8 was found to be a psychometrically robust instrument in both general population and autistic samples, displaying strong model-data fit to a unidimensional structure, high score reliability, strong nomological validity, and practically ignorable amounts of I-DIF between diagnostic groups and subgroups of autistic and general population adults. Item reduction also significantly reduced the reading level of the TAS-8 compared to the TAS-20, indicating that this form may be more comprehensible by younger, less educated, or less cognitively able respondents. In sum, our findings suggest that the TAS-8 is a reliable and valid measure of alexithymia suitable for use by autistic adults as well as adults in the general population.

While the 20-item TAS possessed adequate composite score reliability in our sample, bifactor confirmatory factor models failed to support the theorized structure of the questionnaire in the autistic population. The TAS-20 items assessing the EOT facet of the alexithymia construct and the form’s reverse-coded items were particularly problematic, both exhibiting poor subscale reliabilities and contributing little common variance to the general alexithymia factor. These psychometric issues were further confirmed in our general population HPP sample, indicating that these problems were not unique to the autistic population. Removal of the EOT and reverse-coded items from the model greatly improved overall fit, but three additional items needed to be removed in order to meet our a priori standards of adequate IRT model fit and negligible I-DIF by diagnostic group. The final TAS-8 short form consisted of five DIF items (1, 6, 9, 13, and 14) and three DDF items (2, 11, and 12) that ostensibly form the core of the “general alexithymia” construct measured by the TAS-20 total score. Using item response theory, we generated norm-referenced TAS-8 scores that are immediately interpretable on the scale of a Z-score (i.e., M = 0, SD = 1) and can similarly be scaled to the familiar T-score metric (M = 50, SD = 10). As scores on the TAS-8 are both norm-referenced and psychometrically robust, we believe they present a viable alternative to TAS-20 total scores in any study protocol that includes the TAS-20 or one of its short forms (notably, these scores can be calculated from any subset of the eight TAS-8 items). To facilitate the calculation and use of the TAS-8 latent trait scores in alexithymia research, we have created an easy-to-use online scoring tool (available at http://asdmeasures.shinyapps.io/TAS8_Score) that converts TAS-8 item responses into general population-normed latent trait scores and corresponding T-scores.

Tests of convergent and divergent validity of the TAS-8 score were largely in line with prior results, indicating that self-reported alexithymia is moderately to strongly correlated with autistic traits, repetitive behaviors, internalizing psychopathology, suicidality, and poorer quality of life. Relationships were also observed between TAS-8 scores and sex, age of autism diagnosis, and education level, although these effects were small enough to be practically insignificant (i.e., |r|s < 0.2 and |d|s < 0.2). Moreover, despite a fairly large correlation between TAS-8 scores and neuroticism, partial correlation analyses demonstrated that alexithymia still explained substantial unique variance in autism symptomatology, depression, generalized anxiety, and quality of life over and above that accounted for by neuroticism. However, partial correlations with somatic symptom burden, social anxiety, and suicidal ideation failed to exceed the pre-specified interval null hypothesis, indicating that alexithymia in the autistic population only predicts these symptom domains insofar as it correlates positively with trait neuroticism. As alternative measures of alexithymia such as the TSIA (71) do not correlate highly with neuroticism (72,73), future research should investigate the degree to which alexithymia measured multimodally continues to predict internalizing psychopathology in the autistic population and other clinical groups of interest.

One particularly surprising finding is the poor correlation between alexithymia and somatic symptom burden, given the theoretical status of alexithymia as a potential driver of somatization and a large literature showing relationships between these constructs (2). One particular reason that this correlation may be substantially attenuated is that our short form removed the psychometrically problematic TAS-20 item 3 (I have physical sensations that even doctors don’t understand.), which in addition to assessing the experience of undifferentiated emotions common in alexithymia also seemingly captures the phenomenon of medically unexplained symptoms. We confirmed that this was in fact the case in our SPARK sample, as the polyserial correlation between this item and PHQ-15 total scores was very high (r_poly = 0.492 [0.435, 0.543]) and very minimally attenuated after controlling for overall alexithymia as measured by the TAS-8 latent trait score (r_p,poly = 0.424 [0.364, 0.485], BF_ROPE = 4.79 ⋅ 10¹⁰). Notably, a recent study has found that item 3 of the TAS-20 is the single most important item when discriminating individuals with a functional somatic condition (fibromyalgia) from healthy controls (152), providing additional evidence to support our suspicion that this particular item drives much of the correlation between the TAS-20 and somatic symptomatology. Additional work in this area should attempt to measure alexithymia in a multimodal manner (e.g., simultaneously administering the TAS-8, a second self-report questionnaire such as the BVAQ (61) or Perth Alexithymia Questionnaire [PAQ] (70), an observer-report measure such as the Observer Alexithymia Scale (153), and an interview measure such as the TSIA), determining which relationships between alexithymia and important covariates of interest (e.g., somatization, autism symptoms, emotion recognition, and psychopathology) are due to the underlying alexithymia construct or measurement artifacts specific to certain alexithymia assessments.

This work has meaningful implications for the study of alexithymia in the autistic population and in general, as it provides strong psychometric support for the TAS-8 questionnaire as a general-purpose measure of alexithymia across multiple clinical and non-clinical populations. These findings are particularly useful for autism research, as they indicate that the TAS-8 can be used to compare levels of alexithymia between autistic and general-population samples without worry that differences in scores are significantly biased by qualitative differences in the ways individuals in each group answer the questionnaire items. Moreover, the between-group difference in TAS-8 scores (d = 1.014) was approximately 15% larger than the same group difference in TAS-20 scores (d = 0.880), indicating that the TAS-8 is better able to discriminate between autistic and non-autistic adults than its parent form. Although the current study did not validate this form for use in other clinical populations where alexithymia is a trait of interest (e.g., individuals with eating disorders, functional somatic syndromes, substance abuse disorders, or general medical conditions), future studies in these populations are warranted to determine whether the improved measurement properties of the TAS-8 are useful in improving inferences about alexithymia in those groups as well.

This study has a number of strengths, including its large and diverse sample of both autistic and non-autistic participants, robust statistical methodology, wide array of clinical measures with which to assess the validity of the TAS-8, and consideration of the role that neuroticism plays in explaining relationships between alexithymia and internalizing psychopathology. However, this investigation is not without limitations. Most notably, the two samples of participants (from SPARK and HPP, respectively), while both recruited online, were drawn from different studies with dissimilar protocols and different versions of the TAS questionnaire. Most notably, the HPP sample completed the TAS-16 questionnaire, which omits four of the more poorly performing items of the original TAS-20. Thus, in order to estimate TAS-20 total scores in this group of individuals, we were required to impute those items for all 721 participants with an unknown degree of error. Interestingly, the HPP sample reported TAS-20 scores that were 1.5–6 points larger on average than previous large-scale general-population studies using the TAS-20 (18,154), and it is thus unclear whether the imputation of four items using data from an autistic sample artificially inflated these scores. However, as the TAS-8 did not include any of the imputed items, we can be reasonably confident that the scores on this measure genuinely reflect the true underlying alexithymia construct levels in the current general population sample.

An additional limitation is that the HPP sample was not screened for autism diagnoses, and there remains a possibility that some of these individuals could have met diagnostic criteria for autism or had a first-degree relative on the autism spectrum. However, previous studies have indicated that a small portion of autistic individuals (i.e., approximately 2% per current prevalence estimates (87)) in an otherwise neurotypical sample is insufficient to substantially bias parameter estimates or attenuate differential item functioning (78), leading us to believe that the current group comparisons remain valid. Nevertheless, the HPP sample was only assessed on a small number of relevant demographic domains, leaving unanswered questions about the relationships between alexithymia as measured by the TAS-8 and many demographic and clinical variables of interest in general-population adults. Fortunately, as the TAS-8 score can be calculated from item-level TAS-20 data, many extant datasets currently exist that can provide answers to these questions, further supporting or refuting the validity of the TAS-8 as a measure of alexithymia in the general population.

In addition to the limitations of the HPP sample, several limitations of the better-characterized SPARK sample were also present. As discussed in our previous work with this sample (78), it is not entirely representative of the autistic population, having a higher proportion of females, a higher average education level, later mean age of autism diagnosis, and a higher prevalence of co-occurring anxiety and depressive disorders than is expected in this population (155). Nevertheless, a strength of the IRT method is the fact that unrepresentative samples are able to still provide unbiased item parameter estimates provided that there is minimal I-DIF between subgroups of the population of interest (156). As we found little meaningful I-DIF within autistic adults across numerous demographic and clinical groupings, we feel very confident that the parameter estimates generated from the current study will generalize well to future samples. In addition, as SPARK does not include data on cognitive functioning, we were unable to determine whether the TAS-8 demonstrated relationships with verbal IQ, as has been previously reported with TAS-20 scores in the autistic population (51). It remains unclear whether this relationship is an artifact of the generally high reading level of the TAS-20 (which would ideally be attenuated using the TAS-8) or a manifestation of some other relationship between alexithymia and verbal intelligence (i.e., higher verbal ability may relate to an increased ability to describe subjective experiences such as emotional states and therefore less alexithymia). Future studies of alexithymia in the autistic population should incorporate measures of cognitive performance, ideally testing whether self-report measures such as the TAS-8 function equivalently in autistic adults with higher and lower verbal abilities.

Another limitation concerns the correspondence of the TAS-8 to the theoretical alexithymia construct itself. As noted previously, alexithymia is made up of four interrelated facets: DIF, DDF, EOT, and difficulty fantasizing (DFAN), the latter two of which are not measured directly by the TAS-8. Because of this, the questionnaire arguably lacks content validity compared to the full TAS-20 or four-dimensional measures such as the TSIA. However, our results indicated that the EOT factor measured by the TAS was not highly correlated with the “general alexithymia” factor (which had its highest loadings on DIF/DDF items) and therefore does not adequately measure this facet of the alexithymia construct. Other measures, such as the PAQ (70), have found that a more restricted EOT factor (primarily reflecting one’s tendency to not focus attention on one’s own emotions) correlates much more highly with other measures of the alexithymia construct, likely representing a better operationalization of the EOT facet of alexithymia. In addition, items reflecting the DFAN dimension of alexithymia have displayed poor psychometric properties in both questionnaire and interview measures, and there is currently debate as to whether these items truly measure part of the alexithymia construct (2,33,157–160). Moreover, studies in the autism population examining the correlates of alexithymia have found the DIF and DDF subscales to be most important in predicting clinically meaningful outcomes such as depression, anxiety, and social communication difficulties (58). Thus, it is our belief that the “core” of alexithymia (consisting of difficulty identifying and describing emotional experiences) is likely sufficient to represent this construct, particularly when options to measure the EOT and DFAN facets are psychometrically inadequate. Future research in alexithymia would greatly benefit from additional psychometric studies that aim to generate optimal instruments to measure all facets of the alexithymia construct, coupled with tests of the incremental validity of the EOT/DFAN trait facets over and above a score composed of solely DIF/DDF items.

A final limitation of our study is the fact that we were unable to test all meaningful psychometric properties of the TAS-8. In particular, our study was cross-sectional, necessarily prohibiting us from assessing test-retest reliability, temporal stability, and I-DIF across repeated test administrations. Additionally, as alexithymia appears to be amenable to change with psychological interventions (161,162), future studies should also investigate whether the TAS-8 latent trait score is sensitive to change, and if so, determine the minimal clinically important difference in this score. Additional psychometric characteristics that could be tested include convergent validity with other alexithymia measures such as the PAQ or TSIA, predictive validity for clinically meaningful outcomes, and I-DIF across language, culture, medium of administration (e.g., pen and paper vs. electronic), age group (e.g., adolescents vs. adults), and other diagnostic contrasts beyond the autism population. As inferences in the psychological science are only as reliable and valid as the measures they utilize (163), we encourage autism researchers and individuals in psychological science more broadly to consider the importance of measurement in their science and to devote more effort to investigating and justifying the ways in which complex psychological constructs such as alexithymia are operationalized.

The TAS-20 is a widely used measure of alexithymia that has more recently become the de facto measure of choice for this construct in the autism literature. However, this measure has so far lacked robust psychometric evidence for its reliability and validity in the population of autistic adults. Leveraging two large datasets of autistic and general-population adults, we performed an in-depth investigation of the TAS-20 and its measurement properties in autistic adults, revealing several psychometric shortcomings of this commonly used questionnaire. By reducing the number of items on the measure, we were able to produce a unidimensional short form, the TAS-8, which exhibited superior psychometric properties to the TAS-20 in samples of both autistic and non-autistic adults. Furthermore, in order to allow others to utilize the population-normed latent trait scores generated by our IRT model, we have created a user-friendly online score calculator for the TAS-8 that is freely available to interested researchers (https://asdmeasures.shinyapps.io/TAS8_Score/). Although the measurement properties of the TAS-8 were strong in this study we stress that this single measure should not be considered the “gold standard” of alexithymia measurement in autism or any other population. In agreement with the original authors of the TAS (2), we recommend that researchers interested in robustly measuring the alexithymia construct use multiple measures that include both self- and proxy-report questionnaires, ideally supplemented by observational or interview measures. Additional studies are still needed to fully explore the psychometric properties of the TAS-8, but in light of the current study, we believe that this revised questionnaire has potential to greatly improve the measurement of alexithymia both within and outside the field of autism research.

Attention Deficit Hyperactivity Disorder (ADHD)

Autism Spectrum Quality of Life (ASQoL)

Beck Depression Inventory–II (BDI-II)

Bermond-Vorst Alexithymia Questionnaire (BVAQ)

Brief Fear of Negative Evaluation–Short (BFNE-S)

Categorical Maximum Likelihood (cML)

Confirmatory Factor Analysis (CFA)

Comparative Fit Index (CFI)

Credible Interval (CrI)

Differential Item Functioning (I-DIF)

Difficulty Describing Feelings (DDF)

Difficulty Fantasizing (DFAN)

Difficulty Identifying Feelings (DIF)

Expected Score Standardized Difference (ESSD)

Expected Test Score Standardized Difference (ETSSD)

Explained Common Variance (ECV)

Externally Oriented Thinking (EOT)

Generalized Anxiety Disorder–7 (GAD-7)

Human Penguin Project (HPP)

Item Explained Common Variance (I-ECV)

Item Response Theory (IRT)

Maximum A-Posteriori (MAP)

Patient Health Questionnaire–15 (PHQ-15)

Percentage of Uncontaminated Correlations (PUC)

Perth Alexithymia Questionnaire (PAQ)

Repetitive Behavior Scale–Revised (RBS-R)

Ritualistic/Sameness (RS)

Root Mean Square Error of Approximation (RMSEA)

Sensory Motor (SM)

Simons Powering Autism Research Knowledge cohort (SPARK)

Social Responsiveness Scale–Second Edition (SRS-2)

Standardized Root Mean Square Residual (SRMR)

Toronto Alexithymia Scale (TAS)

Toronto Structured Interview for Alexithymia (TSIA)

Tucker-Lewis Index (TLI)

Unsigned Expected Test Score Difference in the Sample (UETSDS)

Weighted Area Between Curves (wABC)

Ethics approval and consent to participate

All participants gave informed consent for participation in the study. All procedures in the SPARK sample were approved by the institutional review board at Vanderbilt University Medical Center, and the Human Penguin Project was approved under an “umbrella” ethics proposal at Vrije Universiteit, Amsterdam, and separately at each contributing site. All study procedures complied with the ethics code outlined in the Declaration of Helsinki.

Consent for publication

Not applicable.

Competing interests

ZJW serves on the family advisory committee of the Autism Speaks Autism Treatment Network Vanderbilt site and the autistic researcher review board of the Autism Intervention Network for Physical Health (AIR-P). ZJW also serves as a consultant to Roche. KOG has no competing interests.

Funding

This work was supported by grants from the National Institute of General Medical Sciences T32-GM007347 (ZJW); National Institute of Mental Health R01-MH113576 (KG) Nancy Lurie Marks Family Foundation (ZJW). Content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. No funding body or source of support had a role in the study design, data collection, analysis, or interpretation, decision to publish, or preparation of this manuscript.

Authors’ contributions

ZJW conceptualized and designed the study, cleaned and processed the data, performed all statistical analyses, created all figures and tables, drafted the initial manuscript, and approved the final manuscript as submitted. KOG designed the larger survey study, coordinated and supervised data collection through SPARK, critically reviewed the manuscript, and approved the final manuscript as submitted.

Acknowledgements

The authors are grateful to all of the individuals and families enrolled in SPARK, the SPARK clinical sites and SPARK staff. They appreciate obtaining access to demographic and phenotypic data on SFARI Base. Approved researchers can obtain the SPARK population dataset described in this study by applying at https://base.sfari.org. The authors would also like to acknowledge the investigators of the Human Penguin Project, whose open data contributed meaningfully to the current investigation. Lastly, the authors would like to thank the authors of the TAS-20 (R. Michael Bagby, James D. A. Parker, and Graeme J. Taylor) for their invaluable contributions to the literature on alexithymia and its measurement.

Nemiah JC, Freyburger H, Sifneos PE. Alexithymia: A view of the psychosomatic process. In: Hill OW, editor. Modern trends in psychosomatic medicine. Vol. 3. London: Butterworths; 1976. pp. 430–9. Modern trends series.
Bagby RM, Parker JDA, Taylor GJ. Twenty-five years with the 20-item Toronto Alexithymia Scale. J Psychosom Res. 2020 Apr 1;131:109940.
Kinnaird E, Stewart C, Tchanturia K. Investigating alexithymia in autism: A systematic review and meta-analysis. Eur Psychiatry. 2019 Jan;55:80–9.
Westwood H, Kerr-Gaffney J, Stahl D, Tchanturia K. Alexithymia in eating disorders: Systematic review and meta-analyses of studies using the Toronto Alexithymia Scale. J Psychosom Res. 2017 Aug 1;99:66–81.
Morie KP, Yip SW, Nich C, Hunkele K, Carroll KM, Potenza MN. Alexithymia and Addiction: A Review and Preliminary Data Suggesting Neurobiological Links to Reward/Loss Processing. Curr Addict Rep. 2016 Jun;3(2)(1):239–48.
Kajanoja J, Scheinin NM, Karlsson L, Karlsson H, Karukivi M. Illuminating the clinical significance of alexithymia subtypes: A cluster analysis of alexithymic traits and psychiatric symptoms. J Psychosom Res. 2017 Jun 1;97:111–7.
Berardis DD, Campanella D, Nicola S, Gianna S, Alessandro C, Chiara C, et al. The Impact of Alexithymia on Anxiety Disorders: a Review of the Literature. Curr Psychiatry Rev. 2008 May 1;4(2):80–6.
Aaron RV, Fisher EA, de la Vega R, Lumley MA, Palermo TM. Alexithymia in individuals with chronic pain and its relation to pain intensity, physical interference, depression, and anxiety: A systematic review and meta-analysis. Pain. 2019 May;160(5):994–1006.
Fogley R, Warman D, Lysaker PH. Alexithymia in schizophrenia: associations with neurocognition and emotional distress. Psychiatry Res. 2014 Aug 15;218(1–2):1–6.
Ricciardi L, Demartini B, Fotopoulou A, Edwards MJ. Alexithymia in Neurological Disease: A Review. J Neuropsychiatry Clin Neurosci. 2015 Feb;6(3):179–87. 27(.
Kojima M. Alexithymia as a prognostic risk factor for health problems: a brief review of epidemiological studies. Biopsychosoc Med. 2012 Dec;17(1):21. 6(.
Cruise KE, Becerra R. Alexithymia and problematic alcohol use: A critical update. Addict Behav. 2018 Feb;1:77:232–46.
De Gucht V, Heiser W. Alexithymia and somatisation: A quantitative review of the literature. J Psychosom Res. 2003 May 1;54(5):425–34.
Hadji-Michael M, McAllister E, Reilly C, Heyman I, Bennett S. Alexithymia in children with medically unexplained symptoms: a systematic review. J Psychosom Res. 2019 Aug 1;123:109736.
Parker JDA, Keefer KV, Taylor GJ, Bagby RM. Latent structure of the alexithymia construct: A taxometric investigation. Psychol Assess. 2008;20(4):385–96.
Mattila AK, Keefer KV, Taylor GJ, Joukamaa M, Jula A, Parker JDA, et al. Taxometric analysis of alexithymia in a general population sample from Finland. Personal Individ Differ. 2010 Aug 1;49(3):216–21.
Keefer KV, Taylor GJ, Parker JDA, Bagby RM. Taxometric Analysis of the Toronto Structured Interview for Alexithymia: Further Evidence That Alexithymia Is a Dimensional Construct. Assessment. 2019 Apr 1;26(3):364–74.
Franz M, Popp K, Schaefer R, Sitte W, Schneider C, Hardt J, et al. Alexithymia in the German general population. Soc Psychiatry Psychiatr Epidemiol. 2008 Jan;43(1):54–62.
Mattila AK, Kronholm E, Jula A, Salminen JK, Koivisto A-M, Mielonen R-L, et al. Alexithymia and somatization in general population. Psychosom Med. 2008 Jul;70(6):716–22.
Moriguchi Y, Maeda M, Igarashi T, Ishikawa T, Shoji M, Kubo C, et al. Age and gender effect on alexithymia in large, Japanese community and clinical samples: a cross-validation study of the Toronto Alexithymia Scale (TAS-20). Biopsychosoc Med. 2007 Mar;6(1):7. 1(.
Greene D, Boyes M, Hasking P. The associations between alexithymia and both non-suicidal self-injury and risky drinking: A systematic review and meta-analysis. J Affect Disord. 2020 Jan 1;260:140–66.
De Berardis D, Fornaro M, Orsolini L, Valchera A, Carano A, Vellante F, et al. Alexithymia and Suicide Risk in Psychiatric Disorders: A Mini-Review. Front Psychiatry [Internet]. 2017 [cited 2020 Sep 21];8. Available from: https://www.frontiersin.org/articles/10.3389/fpsyt.2017.00148/full.
Hemming L, Taylor P, Haddock G, Shaw J, Pratt D. A systematic review and meta-analysis of the association between alexithymia and suicide ideation and behaviour. J Affect Disord. 2019 Jul 1;254:34–48.
Pinna F, Manchia M, Paribello P, Carpiniello B. The Impact of Alexithymia on Treatment Response in Psychiatric Disorders: A Systematic Review. Front Psychiatry [Internet]. 2020 [cited 2020 Sep 21];11. Available from: https://www.frontiersin.org/articles/10.3389/fpsyt.2020.00311/full?report=reader#h5.
Porcelli P, Michael Bagby R, Taylor GJ, De Carne M, Leandro G, Todarello O. Alexithymia as Predictor of Treatment Outcome in Patients with Functional Gastrointestinal Disorders. Psychosom Med. 2003 Oct;65(5):911–8.
Lumley MA, Neely LC, Burger AJ. The assessment of alexithymia in medical settings: Implications for understanding and treating health problems. J Pers Assess. 2007 Nov 14;89(3):230–46.
Nuske HJ, Vivanti G, Dissanayake C. Are emotion impairments unique to, universal, or specific in autism spectrum disorder? A comprehensive review. Cogn Emot. 2013 Sep 1;27(6):1042–61.
Velikonja T, Fett A-K, Velthorst E. Patterns of Nonsocial and Social Cognitive Functioning in Adults With Autism Spectrum Disorder: A Systematic Review and Meta-analysis. JAMA Psychiatry. 2019 Feb;76(2)(1):135–51.
Sivathasan S, Fernandes TP, Burack JA, Quintin E-M. Emotion processing and autism spectrum disorder: A review of the relative contributions of alexithymia and verbal IQ. Res Autism Spectr Disord. 2020 Sep;1:77:101608.
Beck KB, Conner CM, Breitenfeldt KE, Northrup JB, White SW, Mazefsky CA. Assessment and treatment of emotion regulation impairment in autism spectrum disorder across the life span: Current state of the science and future directions. Child Adolesc Psychiatr Clin N Am. 2020 Jul 1;29(3):527–42.
Peñuelas-Calvo I, Sareen A, Sevilla-Llewellyn-Jones J, Fernández-Berrocal P. The “Reading the Mind in the Eyes” Test in Autism-Spectrum Disorders Comparison with Healthy Controls: A Systematic Review and Meta-analysis. J Autism Dev Disord. 2019 Mar 1;49(3):1048–61.
Uljarevic M, Hamilton A. Recognition of Emotions in Autism: A Formal Meta-Analysis. J Autism Dev Disord. 2013 Jul;1(7):1517–26. 43(.
Bagby RM, Parker JDA, Taylor GJ. The twenty-item Toronto Alexithymia scale—I. Item selection and cross-validation of the factor structure. J Psychosom Res. 1994 Jan;38(1):23–32.
Taylor GJ, Ryan D, Bagby M. Toward the development of a new self-report alexithymia scale. Psychother Psychosom. 1985;44(4):191–9.
Berthoz S, Lalanne C, Crane L, Hill EL. Investigating emotional impairments in adults with autism spectrum disorders and the broader autism phenotype. Psychiatry Res. 2013;208(3):257–64.
Leonardi E, Cerasa A, Famà FI, Carrozza C, Spadaro L, Scifo R, et al. Alexithymia Profile in Relation to Negative Affect in Parents of Autistic and Typically Developing Young Children. Brain Sci. 2020 Aug;10(8):496.
Szatmari P, Georgiades S, Duku E, Zwaigenbaum L, Goldberg J, Bennett T. Alexithymia in parents of children with autism spectrum disorder. J Autism Dev Disord. 2008 Nov;38(10):1859–65.
Sucksmith E, Roth I, Hoekstra RA. Autistic traits below the clinical threshold: re-examining the broader autism phenotype in the 21st century. Neuropsychol Rev. 2011;21(4):360–89.
Bird G, Cook R. Mixed emotions: The contribution of alexithymia to the emotional symptoms of autism. Transl Psychiatry. 2013 Jul;3(7):e285–5.
Cook R, Brewer R, Shah P, Bird G. Alexithymia, not autism, predicts poor recognition of emotional facial expressions. Psychol Sci. 2013;24(5):723–32.
Bird G, Press C, Richardson DC. The role of alexithymia in reduced eye-fixation in autism spectrum conditions. J Autism Dev Disord. 2011 Nov;41(11)(1):1556–64.
Bird G, Silani G, Brindley R, White S, Frith U, Singer T. Empathic brain responses in insula are modulated by levels of alexithymia but not autism. Brain. 2010 May;133(5)(1):1515–25.
Trevisan DA, Bowering M, Birmingham E. Alexithymia, but not autism spectrum disorder, may be related to the production of emotional facial expressions. Mol Autism. 2016 Nov 11;7(1):46.
Gaigg SB, Cornell AS, Bird G. The psychophysiological mechanisms of alexithymia in autism spectrum disorder. Autism. 2018 Feb 1;22(2):227–31.
Ola L, Gullon-Scott F. Facial emotion recognition in autistic adult females correlates with alexithymia, not autism. Autism. 2020 Jul 21;1362361320932727.
Heaton P, Reichenbacher L, Sauter D, Allen R, Scott S, Hill E. Measuring the effects of alexithymia on perception of emotional vocalizations in autistic spectrum disorder and typical development. Psychol Med. 2012 Nov;42(11):2453–9.
Allen R, Davis R, Hill E. The Effects of Autism and Alexithymia on Physiological and Verbal Responsiveness to Music. J Autism Dev Disord. 2013 Feb 1;43(2):432–44.
Santiesteban I, Gibbard C, Drucks H, Clayton N, Banissy MJ, Bird G. Individuals with Autism Share Others’ Emotions: Evidence from the Continuous Affective Rating and Empathic Responses (CARER) Task. J Autism Dev Disord [Internet]. 2020 May 28 [cited 2020 Sep 21]; Available from: https://doi.org/10.1007/s10803-020-04535-y.
Shah P, Hall R, Catmur C, Bird G. Alexithymia, not autism, is associated with impaired interoception. Cortex. 2016 Aug 1;81:215–20.
Mul C, Stagg SD, Herbelin B, Aspell JE. The feeling of me feeling for you: Interoception, alexithymia and empathy in autism. J Autism Dev Disord. 2018 Sep 1;48(9):2953–67.
Milosavljevic B, Carter Leno V, Simonoff E, Baird G, Pickles A, Jones CRG, et al. Alexithymia in Adolescents with Autism Spectrum Disorder: Its Relationship to Internalising Difficulties, Sensory Modulation and Social Cognition. J Autism Dev Disord. 2016 Apr 1;46(4):1354–67.
South M, Rodgers J. Sensory, emotional and cognitive contributions to anxiety in autism spectrum disorders. Front Hum Neurosci. 2017;11:20.
Albantakis L, Brandi M-L, Zillekens IC, Henco L, Weindel L, Thaler H, et al. Alexithymic and autistic traits: Relevance for comorbid depression and social phobia in adults with and without autism spectrum disorder. Autism. 2020 Jul 14;1362361320936024.
Costa AP, Loor C, Steffgen G. Suicidality in Adults with Autism Spectrum Disorder: The Role of Depressive Symptomatology, Alexithymia, and Antidepressants. J Autism Dev Disord. 2020 Oct 1;50(10):3585–97.
Moseley RL, Gregory NJ, Smith P, Allison C, Baron-Cohen S. A ‘choice’, an ‘addiction’, a way ‘out of the lost’: Exploring self-injury in autistic people without intellectual disability. Mol Autism. 2019 Apr 11;10(1):18.
Pickard H, Hirsch C, Simonoff E, Happé F. Exploring the cognitive, emotional and sensory correlates of social anxiety in autistic and neurotypical adolescents. J Child Psychol Psychiatry. 2020 Mar;jcpp.13214.
Morie KP, Jackson S, Zhai ZW, Potenza MN, Dritschel B. Mood Disorders in High-Functioning Autism: The Importance of Alexithymia and Emotional Regulation. J Autism Dev Disord. 2019 Jul;1(7):2935–45. 49(.
Oakley BFM, Jones EJH, Crawley D, Charman T, Buitelaar J, Tillmann J, et al. Alexithymia in autism: Cross-sectional and longitudinal associations with social-communication difficulties, anxiety and depression symptoms. Psychol Med. 2020 Oct;8:1–13.
Huggins CF, Donnan G, Cameron IM, Williams JHG. A systematic review of how emotional self-awareness is defined and measured when comparing autistic and non-autistic groups. Res Autism Spectr Disord. 2020 Sep;1:77:101612.
Berthoz S, Hill EL. The validity of using self-reports to assess emotion regulation abilities in adults with autism spectrum disorder. Eur Psychiatry. 2005 May;20(3):291–8.
Vorst HCM, Bermond B. Validity and reliability of the Bermond–Vorst Alexithymia Questionnaire. Personal Individ Differ. 2001 Feb;30(3):413–34.
Samson AC, Huber O, Gross JJ. Emotion regulation in Asperger’s syndrome and high-functioning autism. Emotion. 2012 Aug;12(4):659–65.
Kooiman CG, Spinhoven P, Trijsburg RW. The assessment of alexithymia: A critical review of the literature and a psychometric study of the Toronto Alexithymia Scale-20. J Psychosom Res. 2002 Dec;53(6):1083–90.
Preece D, Becerra R, Robinson K, Dandy J. Assessing Alexithymia: Psychometric Properties and Factorial Invariance of the 20-Item Toronto Alexithymia Scale in Nonclinical and Psychiatric Samples. J Psychopathol Behav Assess. 2018 Jun 1;40(2):276–87.
Loas G, Braun S, Delhaye M, Linkowski P. The measurement of alexithymia in children and adolescents: Psychometric properties of the Alexithymia Questionnaire for Children and the twenty-item Toronto Alexithymia Scale in different non-clinical and clinical samples of children and adolescents. PLOS ONE. 2017 May;25(5):e0177982. 12(.
Parker JDA, Eastabrook JM, Keefer KV, Wood LM. Can alexithymia be assessed in adolescents? Psychometric properties of the 20-item Toronto Alexithymia Scale in younger, middle, and older adolescents. Psychol Assess. 2010;22(4):798–808.
Preece DA, Becerra R, Boyes ME, Northcott C, McGillivray L, Hasking PA. Do self-report measures of alexithymia measure alexithymia or general psychological distress? A factor analytic examination across five samples. Personal Individ Differ. 2020 Mar 1;155:109721.
Marchesi C, Ossola P, Tonna M, De Panfilis C. The TAS-20 more likely measures negative affects rather than alexithymia itself in patients with major depression, panic disorder, eating disorders and substance use disorders. Compr Psychiatry. 2014 May 1;55(4):972–8.
Tuliao AP, Klanecky AK, Landoy BVN, McChargue DE. Toronto Alexithymia Scale–20: Examining 18 Competing Factor Structure Solutions in a U.S. Sample and a Philippines Sample. Assessment. 2020 Oct;1(7):1515–31. 27(.
Preece DA, Becerra R, Allan A, Robinson K, Chen W, Hasking P, et al. Assessing alexithymia: Psychometric properties of the Perth Alexithymia Questionnaire and 20-item Toronto Alexithymia Scale in United States adults. Personal Individ Differ. 2020 Nov;1:166:110138.
Bagby RM, Taylor GJ, Parker JDA, Dickens SE. The development of the Toronto Structured Interview for Alexithymia: item selection, factor structure, reliability and concurrent validity. Psychother Psychosom. 2006;75(1):25–39.
Montebarocci O, Surcinelli P. Correlations between TSIA and TAS-20 and their relation to self-reported negative affect: A study using a multi-method approach in the assessment of alexithymia in a nonclinical sample from Italy. Psychiatry Res. 2018 Dec;270:187–93.
Rosenberg N, Rufer M, Lichev V, Ihme K, Grabe H-J, Kugel H, et al. Observer-Rated Alexithymia and its Relationship with the Five-Factor-Model of Personality. Psychol Belg. 2016 May 26;56(2):118–34.
Ormel J, Jeronimus BF, Kotov R, Riese H, Bos EH, Hankin B, et al. Neuroticism and common mental disorders: Meaning and utility of a complex relationship. Clin Psychol Rev. 2013 Jul 1;33(5):686–97.
Brandes CM, Tackett JL. Contextualizing neuroticism in the Hierarchical Taxonomy of Psychopathology. J Res Personal. 2019 Aug;1:81:238–45.
Tackett JL, Quilty LC, Sellbom M, Rector NA, Bagby RM. Additional evidence for a quantitative hierarchical model of mood and anxiety disorders for DSM-V: The context of personality structure. J Abnorm Psychol. 2008 Nov;117(4):812–25.
Kotov R, Gamez W, Schmidt F, Watson D. Linking “big” personality traits to anxiety, depressive, and substance use disorders: A meta-analysis. Psychol Bull. 2010 Sep;136(5):768–821.
Williams ZJ, Everaert J, Gotham KO. Measuring Depression in Autistic Adults: Psychometric Validation of the Beck Depression Inventory–II. Assessment. 2020 Aug 29;107319112095288.
Cassidy SA, Bradley L, Cogger-Ward H, Shaw R, Bowen E, Glod M, et al. Measurement Properties of the Suicidal Behaviour Questionnaire-Revised in Autistic Adults. J Autism Dev Disord. 2020 Oct;50(10):3477–88.
Pelton MK, Crawford H, Robertson AE, Rodgers J, Baron-Cohen S, Cassidy S. A Measurement Invariance Analysis of the Interpersonal Needs Questionnaire and Acquired Capability for Suicide Scale in Autistic and Non-Autistic Adults. Autism Adulthood. 2020 May;27(3):193–203. 2(.
Cohen J. The earth is round (p < .05). Am Psychol. 1994;49(12):997–1003.
Meade AW. A taxonomy of effect size measures for the differential functioning of items and scales. J Appl Psychol. 2010;95(4):728–43.
Edelen MO, Stucky BD, Chandra A. Quantifying ‘problematic’ DIF within an IRT framework: Application to a cancer stigma index. Qual Life Res. 2015 Jan 1;24(1):95–103.
Feliciano P, Daniels AM, Snyder LG, Beaumont A, Camba A, Esler A, et al. SPARK: A US Cohort of 50,000 Families to Accelerate Autism Research. Neuron. 2018;97(3):488–93.
Hu C-P, Yin J-X, Lindenberg S, Dalğar İ, Weissgerber SC, Vergara RC, et al. Data from the Human Penguin Project, a cross-national dataset testing social thermoregulation principles. Sci Data. 2019 Apr 17;6(1):32.
IJzerman H, Lindenberg S, Dalğar İ, Weissgerber SSC, Vergara RC, Cairo AH, et al. The Human Penguin Project: Climate, social integration, and core body temperature. Collabra Psychol. 2018 Oct 19;4(1):37.
Dietz PM, Rose CE, McArthur D, Maenner M. National and State Estimates of Adults with Autism Spectrum Disorder. J Autism Dev Disord [Internet]. 2020 May 10 [cited 2020 Oct 5]; Available from: https://doi.org/10.1007/s10803-020-04494-4.
Daniels AM, Rosenberg RE, Anderson C, Law JK, Marvin AR, Law PA. Verification of parent-report of child autism spectrum disorder diagnosis to a web-based autism registry. J Autism Dev Disord. 2012 Feb 1;42(2):257–65.
IJzerman H, Coan JA, Wagemans FMA, Missler MA, Beest I van, Lindenberg S, et al. A theory of social thermoregulation in human primates. Front Psychol [Internet]. 2015 [cited 2020 Sep 24];6. Available from: https://www.frontiersin.org/articles/10.3389/fpsyg.2015.00464/full.
Stewart N, Chandler J, Paolacci G. Crowdsourcing Samples in Cognitive Science. Trends Cogn Sci. 2017 Oct 1;21(10):736–48.
Palan S, Schitter C. Prolific.ac—A subject pool for online experiments. J Behav Exp Finance. 2017.
Stekhoven DJ, Bühlmann P. MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012 Jan 1;28(1):112–8.
Stekhoven DJ. missForest. Nonparametric Missing Value Imputation using Random Forest. 2013.
Golino HF, Gomes CMA. Random forest as an imputation method for education and psychology research: its impact on item fit and difficulty of the Rasch model. Int J Res Method Educ. 2016 Oct 1;39(4):401–21.
Constantino JN, Gruber CP. Social Responsiveness Scale–Second Edition (SRS-2): Manual. 2nd ed. Torrance: Western Psychological Services; 2012.
McDermott CR, Farmer C, Gotham KO, Bal VH. Measurement of subcategories of repetitive behaviors in autistic adolescents and adults. Autism Adulthood. 2020 Mar;2(1)(1):48–60.
Bodfish JW, Symons FJ, Parker DE, Lewis MH. Varieties of repetitive behavior in autism: Comparisons to mental retardation. J Autism Dev Disord. 2000 Jun 1;30(3):237–43.
Beck AT, Steer RA, Brown GK. BDI-II, Beck Depression Inventory: Manual. 2nd ed. San Antonio: Psychological Corporation; 1996. 38 p.
Spitzer RL, Kroenke K, Williams JBW, Löwe B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med. 2006;166(10):1092–7.
Leary MR. A brief version of the Fear of Negative Evaluation Scale. Pers Soc Psychol Bull. 1983 Sep 1;9(3):371–5.
Carleton RN, Collimore KC, McCabe RE, Antony MM. Addressing revisions to the Brief Fear of Negative Evaluation scale: Measuring fear of negative evaluation across anxiety and mood disorders. J Anxiety Disord. 2011 Aug 1;25(6):822–8.
Kroenke K, Spitzer RL, Williams JBW. The PHQ-15: Validity of a new measure for evaluating the severity of somatic symptoms. Psychosom Med. 2002 Apr;64(2):258–66.
Goldberg LR, Johnson JA, Eber HW, Hogan R, Ashton MC, Cloninger CR, et al. The international personality item pool and the future of public-domain personality measures. J Res Personal. 2006 Feb 1;40(1):84–96.
Tellegen A, Waller NG. Exploring personality through test construction: Development of the Multidimensional Personality Questionnaire. In: Boyle GJ, Matthews G, Saklofske DH, editors. The SAGE Handbook of Personality Theory and Assessment: Personality Measurement and Testing. Thousand Oaks: SAGE; 2008. pp. 261–92.
McConachie H, Mason D, Parr JR, Garland D, Wilson C, Rodgers J. Enhancing the validity of a quality of life measure for autistic people. J Autism Dev Disord. 2018;48(5):1596–611.
R Core Team. R: A language and environment for statistical computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2020. Available from: https://www.R-project.org/.
Li C-H. Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behav Res Methods. 2016 Sep;48(3):936–49.
Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw. 2012;48(2).
Bentler PM. Comparative fit indexes in structural models. Psychol Bull. 1990;107(2):238–46.
Tucker LR, Lewis C. A reliability coefficient for maximum likelihood factor analysis. Psychometrika. 1973 Mar;38(1):1–10.
Steiger JH. Structural model evaluation and modification: An interval estimation approach. Multivar Behav Res. 1990 Apr;25(2):173–80.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct Equ Model Multidiscip J. 1999 Jan;1(1):1–55. 6(.
DiStefano C, Liu J, Jiang N, Shi D. Examination of the weighted root mean square residual: Evidence for trustworthiness? Struct Equ Model Multidiscip J. 2018 May 4;25(3):453–66.
Yu C-Y. Evaluating cutoff criteria of model fit indices for latent variable models with binary and continuous outcomes [Internet] [PhD Thesis]. [Los Angeles, CA]: University of California Los Angeles; 2002. Available from: https://www.statmodel.com/download/Yudissertation.pdf.
Savalei V. Improving fit indices in structural equation modeling with categorical data. Multivar Behav Res [Internet]. 2020 Feb 13 [cited 2020 Jul 3]; Available from: https://doi.org/10.1080/00273171.2020.1717922.
Xia Y, Yang Y. RMSEA, CFI. and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods. Behav Res Methods. 2019 Feb;51(1)(1):409–28.
Maydeu-Olivares A. Assessing the size of model misfit in structural equation models. Psychometrika. 2017 Sep 1;82(3):533–58.
Shi D, Maydeu-Olivares A, Rosseel Y. Assessing fit in ordinal factor analysis models: SRMR vs. RMSEA. Struct Equ Model Multidiscip J. 2020 Jan 2;27(1):1–15.
Reeve BB, Hays RD, Bjorner JB, Cook KF, Crane PK, Teresi JA, et al. Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med Care. 2007;45(5):22–31.
Kline RB. Principles and practice of structural equation modeling. Fourth edition. New York: The Guilford Press; 2016. 534 p. (Methodology in the social sciences).
Rodriguez A, Reise SP, Haviland MG. Evaluating bifactor models: Calculating and interpreting statistical indices. Psychol Methods. 2016;21(2):137–50.
Rodriguez A, Reise SP, Haviland MG. Applying bifactor statistical indices in the evaluation of psychological measures. J Pers Assess. 2016;98(3):223–37.
Revelle W, Condon DM. Reliability from α to ω: A tutorial. Psychol Assess. 2019;31(12):1395–411.
Bonifay WE, Reise SP, Scheines R, Meijer RR. When are multidimensional data unidimensional enough for structural equation modeling? An evaluation of the DETECT multidimensionality index. Struct Equ Model Multidiscip J. 2015 Oct 2;22(4):504–16.
Green SB, Yang Y. Reliability of summed item scores using structural equation modeling: An alternative to coefficient alpha. Psychometrika. 2009;74(1):155–67.
Samejima F. Estimation of latent ability using a response pattern of graded scores. Psychom Monogr Suppl. 1969;34(4):p..t. 2):100–0.
Bock RD, Aitkin M. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika. 1981;46(4):443–59.
Chalmers RP. mirt: A Multidimensional Item Response Theory Package for the R Environment. J Stat Softw [Internet]. 2012 [cited 2020 Jun 28];48(6). Available from: http://www.jstatsoft.org/v48/i06/.
Cai L, Monroe S. A new statistic for evaluating item response theory models for ordinal data [Internet]. Los Angeles, CA: University of California, National Center for Research on Evaluation, Standards, and Testing S. (CRESST); 2014 p. 1–28. Report No.: CRESST Report 839. Available from: https://eric.ed.gov/?id=ED555726.
Monroe S, Cai L. Evaluating Structural Equation Models for Categorical Outcomes: A New Test Statistic and a Practical Challenge of Interpretation. Multivar Behav Res. 2015 Nov 2;50(6):569–83.
Maydeu-Olivares A, Joe H. Assessing Approximate Fit in Categorical Data Analysis. Multivar Behav Res. 2014 Jul 4;49(4):305–28.
Cao M, Tay L, Liu Y. A Monte Carlo study of an iterative Wald test procedure for DIF analysis. Educ Psychol Meas. 2017;77(1):104–18.
Williams ZJ. irt_extra: Additional functions to supplement the mirt R package [Internet]. Nashville, TN. 2020 [cited 2020 May 4]. Available from: http://doi.org/10.13140/RG.2.2.10226.04803/1.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B Stat Methodol. 1995;57(1):289–300.
Stover AM, McLeod LD, Langer MM, Chen W-H, Reeve BB. State of the psychometric methods: patient-reported outcome measure development and refinement using item response theory. J Patient-Rep Outcomes. 2019;3(1):50.
Mattila AK, Salminen JK, Nummi T, Joukamaa M. Age is strongly associated with alexithymia in the general population. J Psychosom Res. 2006 Nov 1;61(5):629–35.
Lane RD, Sechrest L, Riedel R. Sociodemographic correlates of alexithymia. Compr Psychiatry. 1998 Nov 1;39(6):377–85.
Salminen JK, Saarijärvi S, Äärelä E, Toikka T, Kauhanen J. Prevalence of alexithymia and its association with sociodemographic variables in the general population of finland. J Psychosom Res. 1999 Jan 1;46(1):75–82.
Kurz AS. Bayesian robust correlations with brms (and why you should love Student’s t) [Internet]. A. Solomon Kurz. 2019 [cited 2020 Sep 27]. Available from: https://solomonkurz.netlify.app/post/bayesian-robust-correlations-with-brms-and-why-you-should-love-student-s-t/.
Wetzels R, Wagenmakers E-J. A default Bayesian hypothesis test for correlations and partial correlations. Psychon Bull Rev. 2012 Dec;19(6):1057–64.
Kruschke JK. Bayesian estimation supersedes the t test. J Exp Psychol Gen. 2013 May;142(2):573–603.
Williams ZJ. BayesianTools: R functions to perform general-purpose Bayesian estimation and hypothesis testing using brms [Internet]. Nashville, TN; 2020. Available from: http://doi.org/10.13140/RG.2.2.26089.31845.
Bürkner P-C, brms. An R package for Bayesian multilevel models using Stan. J Stat Softw. 2017;80(1).
Kirk RE. Practical significance: A concept whose time has come. Educ Psychol Meas. 1996 Oct;56(5):746–59.
Kruschke JK, Liddell TM. The Bayesian new statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective. Psychon Bull Rev. 2018;25(1):178–206.
Makowski D, Ben-Shachar MS, Chen SHA, Lüdecke D. Indices of effect existence and significance in the Bayesian framework. Front Psychol [Internet]. 2019 [cited 2020 Mar 24];10. Available from: https://www.frontiersin.org/articles/10.3389/fpsyg.2019.02767/full.
Makowski D, Ben-Shachar MS, Lüdecke D. bayestestR. Describing Effects and their Uncertainty, Existence and Significance within the Bayesian Framework. J Open Source Softw. 2019;4(40):1541.
Wagenmakers E-J, Wetzels R, Borsboom D, van der Maas HLJ. Why psychologists must change the way they analyze their data: The case of psi: Comment on Bem (2011). J Pers Soc Psychol. 2011 Mar;100(3):426–32.
Jeffreys H. Theory of probability. 3rd ed. Oxford [Oxfordshire]: New York: Clarendon Press; Oxford University Press; 1961. 459 p. (Oxford classic texts in the physical sciences).
Caylor JS, Sticht TG, Fox LC, Ford JP. Development of a Simple Readability Index for Job Reading Material. In New Orleans, LA; 1973 [cited 2020 Oct 5]. Available from: https://eric.ed.gov/?id=ED076707.
Margol-Gromada M, Sereda M, Baguley DM. Readability assessment of self-report hyperacusis questionnaires. Int J Audiol. 2020 Jul 2;59(7):506–12.
Orrù G, Gemignani A, Ciacchini R, Bazzichi L, Conversano C. Machine Learning Increases Diagnosticity in Psychometric Evaluation of Alexithymia in Fibromyalgia. Front Med [Internet]. 2020 [cited 2020 Oct 5];6. Available from: https://www.frontiersin.org/articles/10.3389/fmed.2019.00319/full.
Haviland MG, Louise Warren W, Riggs ML. An Observer Scale to Measure Alexithymia. Psychosomatics. 2000 Sep;41(5)(1):385–92.
Hiirola A, Pirkola S, Karukivi M, Markkula N, Bagby RM, Joukamaa M, et al. An evaluation of the absolute and relative stability of alexithymia over 11 years in a Finnish general population. J Psychosom Res. 2017 Apr 1;95:81–7.
Hollocks MJ, Lerh JW, Magiati I, Meiser-Stedman R, Brugha TS. Anxiety and depression in adults with autism spectrum disorder: A systematic review and meta-analysis. Psychol Med. 2019;49(4):559–72.
Embretson SE. The new rules of measurement. Psychol Assess. 1996;8(4):341–9.
Sekely A, Taylor GJ, Bagby RM. Developing a short version of the Toronto Structured Interview for Alexithymia using item response theory. Psychiatry Res. 2018 Aug;266:218–27.
Hendryx MS, Haviland MG, Gibbons RD, Clark DC. An Application of Item Response Theory to Alexithymia Assessment Among Abstinent Alcoholics. J Pers Assess. 1992 Jun 1;58(3):506–15.
Watters CA, Taylor GJ, Bagby RM. Illuminating the theoretical components of alexithymia using bifactor modeling and network analysis. Psychol Assess. 2016 Jun;28(6):627–38.
Preece D, Becerra R, Allan A, Robinson K, Dandy J. Establishing the theoretical components of alexithymia via factor analysis: Introduction and validation of the attention-appraisal model of alexithymia. Personal Individ Differ. 2017 Dec;1:119:341–52.
Cameron K, Ogrodniczuk J, Hadjipavlou G. Changes in Alexithymia Following Psychological Intervention: A Review. Harv Rev Psychiatry. 2014 Jun;22(3):162–78.
Norman H, Marzano L, Coulson M, Oskis A. Effects of mindfulness-based interventions on alexithymia: a systematic review. Evid Based Ment Health. 2019 Feb;22(1)(1):36–43.
Flake JK, Fried EI. Measurement Schmeasurement: Questionable Measurement Practices and How to Avoid Them. PsyArXiv [Internet]. 2019 Jan 17 [cited 2020 Oct 5]; Available from: https://osf.io/hs7wm.

TAS20Supplement10.06.20.docx

Download PDF

Journal Publication

published 02 Mar, 2021

Read the published version in Molecular Autism →

Version 1

posted

You are reading this older preprint version

Read the latest preprint version →

WITHDRAWN: Improving the Measurement of Alexithymia in Autistic Adults: A Psychometric Investigation and Refinement of the Twenty-item Toronto Alexithymia Scale

Status:

Journal Publication

Version 1

Editorial Note

Abstract

Figures

Background

Methods

Results

Discussion

Limitations

Conclusions

List Of Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1