Participants
Participants were recruited as part of the Infant Screening Project, a longitudinal study conducted in collaboration with Boston Children’s Hospital/Harvard Medical School and Boston University. All participants were born with minimal gestational age of 36 weeks, birth weight over 2.5kg, and with no known genetic or neurological disorders. This was a prospective longitudinal study with infants recruited based on three different likelihoods of receiving a later ASD diagnosis: (1) Elevated familial likelihood infants had an older sibling with a confirmed ASD diagnosis (Sib); (2) Elevated Screening likelihood infants were defined by low scores on the Communication and Symbolic Behavior Scales (CSBS) parent-questionnaire at 12–14 months of age (Screener) [42]. Low CSBS scores were considered as those at or below 1.25 standard deviations below the mean, with cutoff scores of 27 for 12 months, 28 for 13 months, and 32 for 14 months out of a total score of 57; (3) Low Likelihood (LL) participants consisted of infants who had no immediate family history of ASD and passed the CSBS screener.
Autism outcomes were conducted at 24 and 36 months of age, however a number of participants reached 24 or 36 months during the COVID pandemic when in-person visits were not possible. In-person evaluations included the Autism Diagnostic Observation Schedule-Second Edition (ADOS-2; [43]), the Mullen Scales of Early Learning (MSEL; [44]), and a parent-child interaction. During COVID pandemic, remote evaluations included the Brief Observation of Symptoms of Autism (BOSA; [45]), parent-child interaction, the Autism Symptom Interview (ASI; [46]), and the Vineland Adaptive Behavior Scales – Third Edition, Parent Interview Form (Vineland-3; [47]). For toddlers meeting criteria on the ADOS/BOSA or coming within three points of cutoffs, a licensed clinical psychologist reviewed assessment scores and videos and provided their best clinical judgment regarding an ASD diagnosis. 140 participants were assessed via the ADOS and 20 were assessed via the BOSA. Final outcome groups were defined by ASD diagnosis and likelihood group, resulting in the following four groups: (1) ASD (2) Sib-noASD, (3) Screener-noASD, and (4) LL-noASD. Sample characteristics across groups are shown in Table 1. The analysis included a total of 160 participants at 12–14 months.
Behavioral measures and questionnaires
Communication and Symbolic Behavior Scales, Developmental Profile (CSBS)
The CSBS [42] is a standardized assessment scale designed for screening for developmental delays and evaluating social communication and language skills for infants and toddlers between 6 and 24 months of age. The CSBS includes 24 questions and assesses both language skills and symbolic development, including gestures, facial expressions, and play. Questions are rated on a 3-point scale, with the following categories: Not yet, sometimes, and often.
Autism Diagnostic Observation Schedule-Second Edition (ADOS-2)
The ADOS-2 [43] is a semi-structured standardized observational assessment designed to evaluate the current level of ASD symptoms of the participant. During this session, the responses and behaviors of toddlers are systematically observed and rated on two primary components: (1) Social affect, which assesses aspects of social communication, interaction, play/imagination and (2) Restricted and repetitive behavior, focusing on the presence and severity of behaviors indicative of restricted interests and repetitive actions.
Mullen Scales of Early Learning (MSEL)
The MSEL [44] is a standardized developmental assessment for children 0–68 months of age. The non-verbal and verbal developmental quotients are calculated based on the age equivalents of the 4 subscales (non-verbal: Fine Motor, Visual Reception; verbal: Expressive Language, and Receptive Language) and the child’s chronological age.
Brief Observation of Symptoms of Autism (BOSA)
The BOSA [45] is an assessment for autism which consists of a 12–14 min interaction between an individual and a caregiver or clinician, which can be administered in both in-person and telehealth settings. The BOSA offers a social context with standardized materials and activities, adapted from the Brief Observation of Social Communication Change [48] and ADOS-2, and can be coded and evaluated by clinicians trained in the ADOS. High convergent validity between BOSA and ADOS-2 were established (Toddler Module Overall Total and Calibrated Severity Score: 0.74 (p < 0.001); [45]).
Autism Symptom Interview (ASI)
The ASI [46] is a brief (15–20 min) caregiver interview designed to facilitate quick classification of children with ASD for research studies. With items derived from the Autism Diagnostic Interview-Revised [49], the ASI includes 55 questions in 4 domains of functioning: language, social communication, peer interaction, and restricted and repetitive behaviors, as well as diagnostic history of the child.
Vineland Adaptive Behavior Scales– Third Edition, Parent Interview Form (Vineland-3)
The Vineland-3 [47] is a semi-structured interview with a caregiver which assesses a child’s day-to-day adaptive functioning across four domains including communication, daily living skills, socialization, and motor skills. Scores for each item range from 0 to 2 with the following categories: the participant never performs this action (score = 0), the participant sometimes or is partly capable of performing this action (score = 1), or the participant performs the action in daily life (score = 2).
Repetitive Behavior Scale-Revised (RBS-R)
The RBS-R [50] is a 44-question survey designed to evaluate RRBs in individuals with autism across development. It covers six subdomains of RRBs: stereotyped, self-injurious, compulsive, routine, sameness, and restricted behaviors. Caregivers rate each behavior on a 4-point scale, indicating the frequency of occurrence of each behavior in the past month. Scale ranges from 0 to 3, with the following categories: behavior does not occur (score = 0), behavior occurs and is a mild problem (score = 1), behavior occurs and is a moderate problem (score = 2), behavior occurs and is a severe problem (score = 3).
EEG Assessment
EEG Data Acquisition
Two-to-five minutes of non-task-related baseline EEG data were acquired while infants seated on their caregiver’s lap, watched a video of abstract moving shapes in a dimly lit, sound-attenuated room. Research assistants refrained from social interaction with the infant but sometimes blew bubbles across the room or presented a quiet toy (e.g., a ball) to the infant if they became fussy. EEG data were collected with 128 channel Geodesic Sensor Nets connected to a NetAmps 300 amplifier, sampled at 500Hz with a 0.1 Hz high-pass analog filter, and online re-referencing to the vertex (Cz) through NetStation software (Electrical Geodesics, Inc (EGI), Eugene, OR, USA). Impedances were kept below 100KΩ in accordance with the impedance capabilities of the high-impedance amplifiers inside an electrically shielded room.
EEG pre-processing
The continuous EEG data were first exported from NetStation to MATLAB format (version R2017a). Data preprocessing, artifact removal, and data quality assessment were carried out via the Harvard Automated Processing Pipeline for EEG (HAPPE 1.0; [51], a preprocessing pipeline optimized for developmental EEG data). All files were batch processed using the Batch Electroencephalography Automated Processing Platform (BEAPP; [52]) software. In order to optimize artifact rejection performance given the short lengths of EEG in infant data, a spatially distributed subset of channels providing whole-head coverage was processed through HAPPE (electrodes: 9, 11, 22, 24, 33, 124, 122, 36, 45, 52, 58, 62, 70, 83, 92 ,96, 104, 108, 122, 124, 28, 19, 4, 117, 13, 112, 41, 47, 37, 55, 87, 103, 23, 98, 65, 67, 77, 90, 75; Fig. 1). For each EEG, a 1 Hz digital high-pass filter and a 100 Hz low-pass filter was applied in preparation for independent component analysis (i.e. ICA). Data were then resampled with interpolation to 250 Hz (resampling was performed after filtering to avoid aliasing higher frequencies when resampling). Electrical line noise was removed at 60 Hz via CleanLine [53]. Artifacts were rejected (e.g., eye blinks, movement, and muscle activity) through wavelet-enhanced ICA with automated component rejection via EEGLAB [54] and the Multiple Artifact Rejection Algorithm [55, 56]. Post-artifact rejection, any channels removed during the bad channel rejection were interpolated through spherical interpolation to reduce spatial bias in re-referencing. The EEG data were then re-referenced to the average reference, detrended to the signal mean, and segmented into contiguous 2-s windows. Additional segments were then rejected using HAPPE’s amplitude and joint probability criteria.
EEG rejection criteria
EEG recordings were rejected using the following HAPPE data quality measures: Fewer than 20 segments (40 seconds of total EEG), percent good channels < 80%, percent independent components rejected > 80%, mean artifact probability of components kept > 30%, and percent variance retained < 25%. All data quality metrics were similar across outcome groups (Table 2). Of 178 participants with EEG recordings, 13 were excluded due to fewer than 20 segments, and 5 were excluded for having more than 80% good channels.
EEG Spectral decomposition and parameterization analysis
The power spectral density at each electrode, for each 2-second segment, was calculated in the BEAPP Power Spectral Density (PSD) module using a multitaper spectral analysis with three orthogonal tapers. For each electrode, the PSD was averaged across segments, and then further averaged across posterior regions of interest (electrodes: 70, 75, 83, 67, 77, 62; Fig. 1). This posterior electrode cluster over the parietal-occipital cortex was determined a priori and was selected based on 1) prior research indicating that the dominant oscillation of alpha typically originates in parietal-occipital cortex [13, 34] and 2) Carter-Leno et al., (2018)’s finding on the relation between parietal alpha power and BAP-Q.
The PSD was then further analyzed using a modified version of SpecParam v1.0.0 (Also known as FOOOF, https://github.com/fooof-tools/fooof; in Python v3.6.8) to model periodic and aperiodic components of the power spectra. SpecParam required modification for use in this age range, as power spectrum models showed poor model fit (increased mean squared error) for frequencies between 10-20Hz (see Wilkinson et al., 2023 for more details on modifications to the original algorithm). The SpecParam model was used in the fixed mode (no spectral knee) with peak_width_limits set to [0.5, 18.0], max_n_peaks = 7, and peak_threshold = 2. Further analyses were subsequently restricted to 2.5-50Hz given elevated error between 2-2.5, and 50-55Hz. Mean \({R}^{2}\) for the full sample was 0.997 (STD = 0.009). Mean estimated error for the sample was 0.01 (STD = 0.010). The mean \({R}^{2}\)and mean estimated error for each group was as follows: Sib-noASD (\({R}^{2}\)= 0.997, STD = 0.005; Error = 0.012, STD = 0.008), Screener-noASD (\({R}^{2}\)= 0.998, STD = 0.012; Error = 0.013, STD = 0.008), LL-noASD (\({R}^{2}\)= 0.996, STD = 0.012; Error = 0.014, STD = 0.011), and ASD (\({R}^{2}\)= 0.997, STD = 0.005; Error = 0.012, STD = 0.010).
SpecParam outputs include an aperiodic offset and slope to describe the estimated aperiodic 1/f signal. Each individual’s periodic power spectrum was determined by subtracting their SpecParam-estimated aperiodic spectrum from their original absolute power spectrum. Spectral analysis was performed to get a measure of periodic alpha power by binning power of each detected oscillation into the following the traditional “canonical” alpha band: 6–9 Hz. To characterize individual PAA and PAF, the periodic spectrum was smoothed using a savgol filter (scipy.signal.savgoal_filter, window length = 101, polyorder = 8). Maxima were identified within 6 − 12 Hz, and the frequency and amplitude for the maxima were extracted. Only those with an identifiable peak in the 6 − 12 Hz range was included in the analyses for PAA and PAF. We selected this frequency range based on previous developmental work suggesting 6 Hz is an appropriate lower bound for the alpha rhythm in infants from 5 months upwards [58], and extended our upper bound to 12 Hz to enable identification of peaks that are outside of the canonical band. Figure 2 displays PSDs for the whole sample to illustrate there was a clear peak in the 6 − 12 Hz range. There were two participants where FOOOF identified two peaks in 6 − 12 Hz range in the posterior ROI; in these participants the higher value was taken as their PAF. Peaks were not identified in eleven participants. To characterize the aperiodic exponent (slope of the spectra), we used χ in the 1/fχ model fit.
Table 1
Sample Characteristics at 12–14 months
| Groups Combined N = 160 | LL-noASD N = 80 | Sib-noASD N = 34 | Screener-noASD N = 21 | ASD N = 25 | P value |
Age, days (SD) | |
| 399.4 (24.05) | 400.1 (23.0) | 403.3 (29.4) | 400.8 (17.7) | 390.4 (22.9) | 0.21 |
Sex, N (%) | |
Female | 67 (41.8) | 42 (52.5) | 16 (47.0) | 4 (19.0) | 5 (20.0) | 0.004 |
Ethnicity, N (%) | 0.2 |
Hispanic | 7 (4.3) | 3 (3.8) | 1 (2.9) | 0 | 3 (12.0) | |
Non-Hispanic | 152 (95.0) | 76 (95.0) | 33 (97.0) | 21 (100.0) | 22 (88.0) | |
Not Answered | 1 (0.7) | 1 (1.2) | 0 | 0 | 0 | |
Race, N (%) | 0.15 |
White | 130 (81.3) | 62 (77.5) | 27 (79.4) | 17 (81.0) | 24 (96.0) | |
Asian | 10 (6.2) | 3 (3.8) | 5 (14.3) | 2 (9.5) | 0 | |
Black/African American | 3 (1.8) | 2 (2.5) | 1 (2.9) | 0 | 0 | |
More than one race | 16 (9.9) | 12 (15.0) | 1 (2.9) | 2 (9.5) | 1 (4.0) | |
Not Answered | 1 (0.8) | 1 (1.2) | 0 | 0 | 0 | |
Income, N (%) | 0.99 |
<$40,000 | 7 (4.3) | 1 (1.2) | 0 | 2 (9.5) | 4 (16.0) | |
$40,000-$99,999 | 23 (14.3) | 10 (12.5) | 6 (17.1) | 2 (9.5) | 5 (20.0) | |
>$100,000 | 119 (74.3) | 68 (85.0) | 21 (61.8) | 15 (71.4) | 15 (60.0) | |
Not Answered or Don’t Know | 11 (6.9) | 1 (1.2) | 7 (20.0) | 2 (9.5) | 1 (4.0) | |
MSEL at 12m, M(SD) | |
Verbal developmental quotient | 99.33 (16.58) | 101.66 (14.04) | 103.45 (16.4) | 90.6 (15.85) | 93.74 (21.33) | 0.006 |
Non-Verbal developmental quotient | 114.83 (15.74) | 115.87 (15.38) | 116.31 (13.08) | 110.51 (19.36) | 113.18 (16.96) | 0.48 |
EEG at 12m, M(SD) | |
Periodic alpha power | 0.93 (0.29) | 0.93 (0.3) | 0.96 (0.33) | 0.92 (0.26) | 0.88 (0.23) | 0.85 |
PAA | 0.36 (0.11) | 0.36 (0.11) | 0.37 (0.13) | 0.35 (0.10) | 0.34 (0.08) | 0.87 |
PAF | 8.04 (0.93) | 7.92 (0.99) | 8.21 (0.88) | 8.32 (0.79) | 7.88 (0.86) | 0.17 |
Aperiodic exponent | 1.11 (0.12) | 1.10 (0.12) | 1.09 (0.15) | 1.16 (0.10) | 1.10 (0.10) | 0.39 |
RRB Data at 24m Included in Analysis, N | |
| 133 | 67 | 30 | 17 | 19 | |
RRB at 24m, M(SD) | | | | | | |
| 5.18 (7.12) | 3.0 (3.1) | 4.03 (6.08) | 6.88 (8.61) | 13.11 (11.07) | < 0.001 |
P values indicate the obtained p-values from each statistical test conducted for group-level differences: ANOVA- Age, MSEL, EEG, and RRB; Chi-square test- Sex, Ethnicity, Race, and Income |
Table 2
| Groups Combined N = 160 | LL-noASD N = 80 | Sib-noASD N = 34 | Screener-noASD N = 21 | ASD N = 25 |
EEG quality metrics, Mean (SD) |
Number of Segments retained after processing | 97.28 (35.1) | 96.91 (37.96) | 102.32 (27.21) | 98.05 (37.13) | 90.92 (34.25) |
Percent Good Channels | 93 (4.2) | 93 (4.2) | 93 (4.5) | 94 (2.6) | 92 (4.6) |
Percent ICs Rejected | 39 (11) | 40 (10) | 41 (11) | 35 (11) | 39 (11) |
Percent Variance Kept Post Waveleted Data | 65.2 (14.94) | 65.62 (15.57) | 63.34 (14.84) | 64.73 (14.16) | 66.65 (14.25) |
Mean Artifact Probability of Kept ICs | 0.13 (0.05) | 0.13 (0.05) | 0.14 (0.05) | 0.13 (0.04) | 0.12 (0.04) |
Statistical Analyses
All statistical analyses and visualizations were done using the Python and R programming language. To explore whether EEG at 12–14 months differed between outcome groups, we performed a one-way analysis of variance with covariates (ANCOVA) with each EEG measure (periodic alpha power, PAF, PAA, or aperiodic exponent) as the dependent variable, outcome groups as a between-participant variable, and age (in days), sex, and non-verbal MSEL as a covariates.
To evaluate the effect of outcome group on the relationship between EEG measures (periodic alpha power, PAF, PAA, or aperiodic exponent) and RRBs, linear regression models included a two-way interaction between outcome group and the relevant EEG measure (setting ASD group as the main reference group). As there were more males in the Screener-noASD and ASD group, sex was included as a covariate in all models. In addition, since our EEG measures were collected between 12-to-14 months, and as previous research found associations between age and cognitive measures with EEG measures (alpha power and PAF), age and non-verbal MSEL were included as covariates in all the linear regression models. Standardize_parameters function in the R package effectsize (version 0.0.1; [59]) was used to re-fit and compute standardized model parameters (standardized coefficients). To characterize interaction effects within the models, marginal effects analyses [60] were performed. R package emmeans [61] and ggplot2 [62] were used to plot estimated regression lines and the interaction plot of estimated marginal effects. Post hoc comparisons were conducted between sub-groups using the Tukey method at 95% confidence level. In addition, we computed spearman correlations within each group to measure the magnitude of the relation between EEG at 12-14m and RRBs at 24m.
-
Model = RRB at 24m ~ EEG measure * Outcome group + age(days) + sex + non-verbal MSEL
-
Standardize_parameters (model, method = "refit") for standardized coefficients
-
emmeans to conduct post hoc analyses: marginal effects of interaction of EEG and Outcome group.
-
Spearman correlations