English sentences
In reading time, an interaction between music expertise and sentence type was found, F(2, 166) = 3.10, p = .048, ηp2 = .036 (Fig. 2a). Participants spent the most time reading random word lists and least time reading original sentences; this effect was stronger in musicians. No main effect of music expertise or sentence type was found. The planned comparisons for semantic processing (original vs. semantically incorrect) and linguistic regularity (original vs. random word list) showed no main effect or interaction. In syntactic processing (semantically incorrect vs. random word), an interaction between sentence type and music expertise was observed, F(1, 83) = 4.06, p = .047, ηp2 = .047: the reading time difference between the two conditions was larger in musicians than non-musicians. Thus, musicians’ reading fluency was more affected by syntactic irregularities than non-musicians.
In reading efficacy, musicians and non-musicians did not differ significantly in accuracy or RT of question answering for any sentence type.
In saccade length, no significant effect was observed (Fig. 2b).
In eye movement pattern, the two representative patterns were shown in Fig. 2c. In the dispersed pattern, a scan path typically started with a fixation at a widely distributed region covering the whole sentence (Red, 56%), and then remained in this region, with a small probability to move to the sentence beginning (Green, 5%). In contrast, in the sequential pattern, a scan path typically started at the middle (Red, 64%). Then, the second fixation was most likely at the sentence beginning (Green, 88%), and continued to the rest the sentence. The two patterns were significantly different, as the data log-likelihoods of the dispersed patterns given the representative dispersed HMM were significantly higher than those given the representative sequential HMM, t(110) = 15.947, p < .001, d = 1.514, and vice versa for the sequential patterns, t(146) = 12.146, p < .001, d = 1.00228. To quantify participants’ eye movement pattern along the dispersed-sequential pattern dimension, following previous studies34,39,41, we defined D-S scale as (D – S)/(|D| + |S|), where D refers to the log-likelihood of the eye movement data being generated by the representative dispersed pattern HMM, and S for the representative sequential pattern HMM. A more positive value indicated higher similarity to the dispersed pattern.
In D-S scale, ANOVA showed no significant effect. In the planned comparisons, no effect was observed in semantic or syntactic processing comparisons. In linguistic regularity (original vs. random word), an interaction between sentence type and music expertise was observed, F(1, 83) = 4.430, p = .038, ηp2 = .051. Musicians showed a more sequential pattern when reading original sentences than random word lists, t(83) = -3.297, p = .008, d = 3.317, whereas non-musicians did not, t(83) = − .278, p = .992, n.s. (Fig. 2d). This suggested that musicians’ eye movement planning behaviour was affected more by linguistic (semantic and syntactic) irregularities than non-musicians.
Chinese sentences
In reading time, similar to the English reading results, an interaction between music expertise and sentence type was found, F(2, 166) = 3.41, p = .035, ηp2 = .039 (Fig. 3a): musicians spent longest time reading random word lists, and shortest time reading original sentences, whereas non-musicians spent longer time reading random word lists than semantically incorrect and original sentences. No main effect of sentence type or music expertise was observed. In the planned comparisons, no effect was observed in semantic processing (original vs. semantically incorrect) or syntactic processing (semantically incorrect vs. random word), whereas in linguistic regularity (original vs. random word) an interaction between sentence type and music expertise was observed, F(1, 83) = 4.05, p = .047, ηp2 = .047: the sentence type effect was stronger in musicians than non-musicians (Fig. 3a). Thus, musicians’ Chinese reading time was more affected by linguistic regularity than non-musician.
In reading efficacy, musicians and non-musicians did not differ in accuracy or RT of question answering for any sentence type.
In average saccade length (Fig. 3b), a main effect of sentence type was observed, F(2, 166) = 5.170, p = .007, ηp2 = .059: participants had longest saccade lengths when reading original sentences, and shortest when reading random word lists. In the planned comparisons, in semantic processing (original vs. semantically incorrect), no effect was observed. In linguistic processing (original vs. random word), a main effect of sentence type was observed, F(1, 83) = 7.790, p = .007, ηp2 = .086. In syntactic processing (semantically incorrect vs. random), a main effect of sentence type was observed, F(1, 83) = 5.937, p = .017, ηp2 = .067.
In eye movement pattern (Fig. 3c), the disperse pattern typically started with a fixation at a widely distributed region covering the whole sentence, and then remained in this region. The sequential pattern typically started at the middle (Red, 91%), and then to the sentence beginning (Green, 93%); then continued to the rest the sentence. The two patterns were significantly different: the data log-likelihoods of the dispersed pattern given the representative dispersed HMM were significantly higher than those given the representative sequential HMM, t(165) = 13.1391, p < .001, d = 1.0198; vice versa for the data log-likelihoods of the sequential patterns, t(91) = 14.9432, p < .001, d = 1.5579.
In D-S scale, no significant effect was found. Also, no effect was found in the planned comparisons (Fig. 3d).
Musical phrases
In viewing time, a main effect of music expertise was observed, F(1, 83) = 21.516, p < .001, ηp2 = .206; this effect interacted with sentence type, F(1, 83) = 20.650, p < .001, ηp2 = .199 (Fig. 4a). Musicians spent more time viewing random segments than original phrases, t(83) = -7.64, p < .001, d = .922; this was not observed in non-musicians, t(83) = -1.12, p = .679, n.s.
In viewing efficacy, musicians had higher accuracy than non-musicians in the recognition of original phrases, t(84) = 7.81, p < .001, d = 1.685, and random segments, t(84) = 4.94, p < .001, d = 1.065. Musicians also had longer RT than non-musicians for original phrases, t(84) = 2.48, p = .015, d = .536, and random segments, t(84) = 2.75, p = .007, d = .593. In auditory musical phrase matching, musicians had higher accuracy than non-musicians for both original phrases, t(84) = 9.282, p < .001, d = 2.002, and random segments, t(84) = 5.858, p < .001, d = 1.263. They did not differ in RT.
In saccade length (Fig. 4b), a main effect of music expertise was observed, F(1, 83) = 14.11, p < .001, ηp2 = .145. An interaction between sentence type and music expertise was also observed, F(1, 83) = 10.37, p = .002, ηp2 = .111: musicians had longer saccade lengths when viewing original phrases than random segments, t(83) = 5.319, p < .001, whereas non-musicians did not, t(83) = .701, p = .896, n.s. Thus, musicians were more sensitive to irregularities in music reading reflected in average saccade length.
In eye movement pattern (Fig. 4c), the dispersed pattern typically started with a fixation at a widely distributed region (Red and Green, 79%); then remained in these regions. The sequential pattern typically started with a fixation located at the first three bars (Red, 100%), and then stayed in the same region, with a small probability to continue to the rest of the phrase (Green, 12%), or move to the phrase beginning (Blue, 10%). The two patterns were significantly different: Data log-likelihoods of the dispersed patterns given the representative dispersed HMM were significantly higher than those given the representative sequential pattern HMM, t(91) = 8.14542, p < .001, d = 0.849, and vice versa for the data log-likelihoods of sequential patterns, t(79) = 11.7085, p < .001, d = 1.309. In D-S scale, no effect was observed (Fig. 4d).
Here we observed musicians’ sensitivity to irregularities in music reading reflected in both viewing time and saccade length. In a separate analysis, we calculated normalized viewing time difference between original and random segment conditions as (O - R)/(O + R), where O and R refer to viewing time in the original and random segment condition respectively (rSB = .99). We found that musicians’ viewing time difference was associated with their auditory musical phrase matching accuracy: higher accuracy was correlated with longer viewing time for random segments relative to original notations, r(41) = − .409, p = .006. Similarly, in musicians, larger normalized saccade length difference between original and random segment conditions (rSB = .97) was associated with higher auditory musical phrase matching accuracy, r(41) = .335, p = .028. These results suggested that the viewing time and saccade length effects in musicians were related to their expertise in matching music notations to corresponding auditory musical phrases.
Tibetan sentences
In viewing time, a main effect of music expertise was found, F(1, 83) = 4.505, p = .037, ηp2 = .051 (Fig. 5a): musicians spent more time viewing than non-musicians. No main effect of sentence type was observed. In viewing efficacy, musicians and non-musicians did not differ in accuracy or RT of word recognition of any sentence type.
In saccade length (Fig. 5b), an interaction between sentence type and expertise was observed, F(1, 83) = 12.419, p < .001, ηp2 = .130: musicians had marginally longer average saccade lengths when viewing original sentences than random word lists, t(83) = 2.513, p = .065, whereas non-musicians had marginally shorter average saccade lengths when viewing original sentences than random word lists, t(83) = -2.541, p = .061.
In eye movement pattern (Fig. 5c), the disperse pattern typically started with a fixation at a widely distributed region (Red and Green, 92%); then remained in these regions. The sequential pattern typically started with a fixation at a widely distributed region (Red, 98%), and then had a small probability to move to the end (Blue, 7%) or the sentence beginning (Green, 8%). The two patterns were significantly different (Data log-likelihoods of the dispersed patterns given the representative dispersed HMM were significantly higher than those given the representative sequential HMM, t(72) = 15.7654, p < .001, d = 1.8452; vice versa for the sequential patterns, t(98) = 4.75833, p < .001, d = 0.4782). In D-S scale, no significant effect was observed (Fig. 5d).