2.1 | Sample recruitment and selection criteria
Professional as well as semi-professional musicians and music students with different majors (e.g, artistic/ artistic-pedagogical/ pedagogical with an artistic focus), aged between 18 and 60 years and suffering from MPA, are included in the study. Further inclusion criteria are sufficient knowledge of the German language and compliance with Covid-regulations for local study appointments. Concerning the presence of a performance-disruptive MPA only musicians are included reporting a moderate or severe level of psychological distress (≥ 5 out of 10) and/or impaired functioning due to their fear of musical performance situations. This is assured by asking the subjects to rate on a scale ranging from 0 “not at all” to 10 “maximum possible” 1) how much they suffer due to their fear of musical performance situations and 2) how much they feel impaired in their functioning (i.e. every day or musical performance) due to their fear of musical performance situations. The musicians are included when rating at least one of the questions with at least 5 out of 10. Exclusion criteria comprise current or recent (< 6 months ago) psychotherapy because of the MPA and severe comorbid psychiatric or physical disorders, possessing a contraindication for exposure therapy (e.g. epilepsy, serious disorders of the cardiovascular system, psychotic disorders, borderline personality disorder, substance dependence or acute suicidality). Further exclusion criteria are concerning the characteristics and design of the study (e.g. small VR-cabin, assessment of blood samples, standing first-person view of the VR videos). Therefore, the subjects have to assure they are not suffering from severe claustrophobia or blood-injection-injury type phobia and can sing or play their instrument blindfolded and standing in the VR-cabin. For this reason and also because of the comparability of the HRV assessment, musicians with larger instruments and playing in a sitting position such as drummer, pianists, harpists and cellists cannot take part in the study. An overview of all inclusion and exclusion criteria can be found in Table 1. The subjects are recruited via local advertisements, flyers, radio announcements, and mailing lists of different musical universities, orchestras, music schools and theater groups in Bavaria.
Table 1
Overview of the selection criteria for sample recruitment
Selection Criteria | Description |
Inclusion Criteria | Age between 18 to 60 years |
| Anxiety and/or avoidance of musical performance situations (MPA) |
| Moderate level (≥ 5/10) of physiological distress and/ or impaired functioning due to MPA |
| Sufficient knowledge of the German language |
| Compliance with the Covid-19-regulations for local study appointments |
| Written informed consent |
Exclusion Criteria | Physiological contraindications for an exposure therapy (i.e. epilepsy, serious disorder of the cardiovascular system) |
| Psychological contraindications for an exposure therapy (i.e. substance dependence, psychotic disorder, acute suicidality) |
| Severe Claustrophobia |
| Severe Blood-injection-injury type phobia |
| Use of sedative medication or tranquilizers (i.e. beta-blockers) for study appointments |
| Instrument is not portable, too big for the VR-cabin, or not playable while standing |
| Not able to play instrument blindfold |
| Current or recent (< 6 months ago) psychotherapy due to the MPA |
2.2 | Study design
The prospective, randomized trial has a pre-post design and includes a follow-up period of 6 months (see Fig. 1). Musicians (N = 46) with MPA will be examined before and after therapy.
At a first screening appointment (S), inclusion and exclusion criteria will be checked. The study information will be provided and written informed consent will be obtained. The study was approved by the ethics committee of the University of Würzburg (No. 194/21-sc). The project is in line with the Helsinki Declaration (as of October 2013, Fortaleza, Brazil) and Good Clinical Practice (GCP). Eligible musicians will be randomized by sex (male vs. female) in two therapy arms, i.e. VRET or PMR. The randomization is carried out by assigning the musicians, at the end of the screening appointment (S), to a previously defined random order of treatment conditions that has been generated by a computer. The investigators know about the randomization after the screening assessment and the musicians will be informed about their assigned intervention at the end of the pre- treatment assessment (T0) (single-blind randomization). Thereafter, screening questions concerning onset, form, and severity of MPA based on the criteria for specific phobia according to ICD 10 (F40.2) and social anxiety disorder according to DSM V (300.23, F40.10) will be addressed. A clinical interview for diagnosing mental disorders (Mini-DIPS-OA, 37, 38) is conducted by a professional (DB and KW) to properly assess psychiatric comorbidities. Furthermore, information on the family history for affective disorders and cardiovascular diseases as well as current medication will be collected by questionnaires.
For each treatment condition (VRET vs. PMR) four individual therapy sessions within two weeks will take place. Subjective as well as behavioral data will be collected at T0, T1 and T2, to describe the severity of MPA (see 2.5). For the assessment of the behavioral and physiological data a behavioral assessment test (BAT) in VR is conducted, during which the participant performs a musical audition (see 2.5.1).
2.3 | VRET
At the end of the pre assessment (T0) in the CAVE (computer assisted virtual environment) the subjects are given a psychoeducational manual in preparation for the VRET. They are instructed to read the manual at home and to complete short written exercises included in the manual. Contents of the manual are extracted from standardized CBT manuals and include the function and components of fear as well as their interplay (vicious circle of fear), information about MPA in discrimination to non-pathological stage fright and its treatment due to exposure therapy. Exercises concerning individual components of MPA, individual safety- and avoidance behaviors, and an individual anxiety hierarchy of the VR scenarios (see 2.3.1) were included in the manual to deepen the understanding of important key contents and in preparation for the exposure therapy. Before and after every VRET session, the study participants are given a questionnaire to assess important factors relating to the training progress and potential changes of the MPA due to the exposure training. Furthermore, blood pressure and heart frequency are being measured in every training session, directly before and after the first exposure.
The first training session (VRET 1) starts with discussing the contents of the manual with the patients to make sure they understand the key points and mechanisms of behavioral exposure therapy. Sessions two to four (VRET 2–4) start with discussing the practice experience at home since the last session, reinforcing therapy compliant behavior and supporting the patient in case of difficulties with the exposure concept. The VR scenario for the exposure training is selected in every training session (VRET 1–4) and with every patient individually using the anxiety hierarchy. The selection criteria for the first scenario in VRET 1 is a moderate level of fear (approx. 60), which is considered high enough to elicit a fear reaction as suggested by Neudeck (39) but with the option for increasing the challenge for the next sessions (VRET 2–4). Individual concerns or preferences of the musicians regarding the scenario are considered. In preparation for the exposure, the expected physiological, cognitive, and behavioral reaction will be explored with the patients. Furthermore, they are asked to give fear ratings on a scale from 0 “no fear at all” to 100 “maximum fear” before (anticipatory and maximal expected fear), meanwhile (fear during the waiting phase) and after (maximal fear meanwhile the performance, fear after the performance) the exposure. After completing the scenario, the exposure experience is discussed with the patients focusing on important learning experiences (cognitive, behavioral, and physiological) for unlearning their fear of musical performance situations and on the proper execution of the exposure training (omitting safety or avoidance behaviors). If required, therapeutic support is given and instructions for further exposure scenarios are being discussed with the patients. The patients are exposed again to the scenario until either their fear of and during the performance situation has reduced to a minimum or until they have drawn an important learning experience regarding their fear and feel confident to proceed to another (more feared) scenario. Therefore, the exposure sessions accompanied by a therapist are aiming at 1) a between-session habituation/ reduction of the fear or 2) changes in the cognitive fear structure in the process of unlearning fear of musical performance situations. At the end of every training session (VRET 1–4) the musicians are instructed to practice similar performance situations in real life at home. Therefore a worksheet is given to them to facilitate the implementation. In the last session (VRET 4) positive learning experiences within the scope of the VRET are summarized together with the subject and a small theoretical input regarding relapse prevention and the handling of future performance situations is given.
The virtual environment is generated using the programs GoProVR Player Version 3.0 and Steam VR Version 1.20 and displayed via a HTC Vive head-mounted display. The VR scenario creates the exposure using 360° video sequences. Maximum duration of each session is one hour.
2.3.1 | Description of 360° videos
Since the VR scenarios will be selected individually for every patient, an overview of the 360° videos is presented in Table 2 and in Fig. 2. The videos were filmed at locations of the University of Music in Würzburg and at the Center for Mental Health at the University Hospital in Würzburg. All participants seen in the videos gave their written informed consent.
Each video starts with a short 1.5-minute waiting phase in a backstage waiting room, where the participant can accommodate to the VR and prepare for the performance situation. The subjective fear rating is assessed verbally using a subjective units of distress (SUD) scale (0-100) before and after the feared situation as well as during the waiting phase. Scenario A consists of a professional jury (one man, two women) sitting in the audience of a large concert hall. The participant is located on a large stage at a distance of 4–5 meters from the jury. The participant is asked by the jury to introduce him-/herself and the two pieces selected for the audition and then start to perform. Scenario B takes place in the same concert hall and has the order of sequence as scenario A, but differs in that the participant is performing behind a folding screen/paravent (similar to real audition situations) and thus unable to see the jury or the audience, and is interrupted by the jury while playing. Hence, scenario B has a total duration of 6.5 minutes (incl. the waiting phase). In scenario C, participants have to perform in a constrained area, i.e. on a small stage in a small concert hall in front of a professional jury (one man, three women). Two jurors (women) are sitting at a table in front of the first row in close distance to the musician. Similar to the procedure of the audition in scenarios A and B, after personal and musical introduction, the participant start to perform. After 2.5 minutes of performance, the participant is interrupted by the jury and asked to leave the stage. During the performance, the jurors take notes, carefully observe the participant and initiate eye contact amongst each other. Scenario D shows a concert situation in front of an audience consisting of 21 middle-aged people. It takes place in the same small concert hall as scenario C. When the participant enters the stage, the audience starts to clap hands. After ending the applause, the audience attentively listens to the performance of the musician. The scenario ends with the audience clapping hands enthusiastically after the musician finished the performance. Scenario E takes place in a classroom in front of an audience consisting of 17 music students displaying bored and distractive behaviors like talking to their neighbors, moving and changing seating positions, checking their phones, coming late to class, or leaving early. The Scenario has a duration of 6.5 minutes (incl. waiting phase). Scenario F takes place in the same classroom with the same audience of music students. In contrast to scenario E, the students in scenario F are listening attentively to the performance of the musician. The scenario ends with applause after the musician finished the performance. In Scenario G, the musician has to perform in front of the conductors and some ensemble members (five men, four women) of their (fictitious) favorite ensemble, which they have applied for. They are standing in close distance to the ensemble and are being asked by the ensemble director to present two pieces successively. Scenario H is similar to scenario G with the difference that another man is talking and that the subjects are being asked to explain their motivation of applying for the ensemble, similar to a job interview. The time for playing/ singing until getting interrupted by the jury is longer in scenario G (4 minutes per piece) than in Scenario H (3 minutes per piece). Scenario I takes place in a video studio presenting a live recording for a television broadcast. After entering the video studio, the musician stands in front of a green screen illuminated by floodlights. The musician is filmed and recorded by a camera and a microphone right in front of them. The musician can see a sound engineer through a small window, who is giving them instructions during the live recording. The total duration including waiting phase is 6.75 minutes.
Table 2
Overview of the 360° videos used for the VRET
Scenario | Description |
A | Audition in front of a professional jury in a big concert hall |
B | Audition behind a folding screen in a big concert hall |
C | Audition in front of a professional jury in a smaller concert room |
D | Concert in front of an elderly audience in a smaller concert room |
E/F | Audition in front of a class of music students |
G/H | Audition in front of the conductors and ensemble members |
I | Live-recording for a television broadcast in a recording studio |
2.4 | PMR
The content and procedure of the PMR training is aimed as equal as possible to the content and procedure of the VRET. The control group is given a psychoeducational manual in preparation for the PMR training sessions at the end of the first assessment in the CAVE (T0, Pre). They are instructed to read the manual at home and to complete small written exercises included in the manual. Contents of the PMR manual resemble contents of the VRET manual (e.g. function and components of fear, vicious circle of fear, information about MPA in discrimination to stage fright) adjusted to the different therapy method (e.g. treatment of MPA due to PMR). The use of PMR to reduce MPA is explained in the manual in terms of an anxiety management training. Psychoeducational contents specific to the VRET such as individual safety and avoidance behaviors, learning and unlearning anxiety are excluded from the PMR manual and replaced by PMR specific contents (e.g. the relaxation response, classical conditioning). Same as for the VRET the participants of the PMR training are given a questionnaire to assess important factors relating to the training progress and potential changes of MPA due to the relaxation training before and after every training session. Furthermore, blood pressure and heart frequency are being measured in every training session, directly before and after the PMR.
The content of the PMR manual is discussed with the patients in the first training session (PMR 1) and the patients have the opportunity to ask questions. They are introduced into the right execution of the PMR by explaining and practicing the correct way of straining and tensing the muscles together with the participant (PMR 1). Sessions two to four (PMR 2–4) are started with discussing the practice experience at home since the last session, reinforcing therapy compliant behavior and supporting the patient in case of difficulties with the therapeutic method. After that the musician is instructed to take a comfortable sitting position (e.g. take off shoes and glasses, unfasten waist belt, close eyes) and the PMR instruction is read out loud to the participant (PMR 1–4). The instruction used is a version by Bernstein and Borkovec according to the concept of progressive muscle relaxation in 16 steps by Jacobsen (40, p.56–58) with the three respective phases focusing-tension-relaxation. After having executed the 16 steps of tensing and relaxing certain groups of muscles, a one- to two-minute body scan is added. The PMR ends with a retraction phase instructing the participant to move, stretch and come back to the here and now. At the end of every session (PMR 1–4), the relaxation experience is discussed with the subject and if needed therapeutic support is given. The patients are instructed to practice the PMR at home to channel automation and to use the PMR for performance situations in case of beginning symptoms of anxiety. Same as for the VRET, a worksheet for documenting training sessions and experiences at home is given to the participants. Furthermore, the participants of the PMR group receive two audio files with the spoken PMR-instructions (female and male voice) to practice at home. In the last session (PMR 4) positive experiences within the scope of the PMR training are summarized and a short psychoeducational input regarding relapse prevention is given to the musicians.
2.5 | Assessments: Before and after therapy, and at follow-up
In the course of the study different behavioral, neurobiological, physiological, and psychological assessments (see Table 3) are conducted at the different points of measurement (S, T0, VRET 1–4/ PMR 1–4, T1, T2). To record the fear reaction of the participants, various parameters of the emotion fear are measured. In addition to basic psychometry and SUDs, the physiological reaction (heart rate, blood pressure), and the endocrine reaction (cortisol in blood and saliva) are assessed. Molecular influencing factors are determined in the blood and therapy-related dynamics as well as exposure-related changes of the (epi-) genetic markers, the proteome, and metabolome are examined. Primary and secondary outcome parameters are described in detail in further chapters (see 2.6 and 2.7).
Table 3
Overview of assessments arranged to the type of measurement
Assessment | Screening (S) | Pre (T0) | VRET 1–4/ PMR 1–4 | Post (T1) | Follow-up (T2) |
Behavioral | | | | | |
BAT | | X | | X | X |
Neurobiological | | | | | |
Blood sampling | | X | | X | X |
Saliva sampling | | X | | X | X |
Physiological | | | | | |
HRV | | X | | X | X |
Heart rate | | X | X | X | X |
Blood pressure (RR) | | X | X | X | X |
Clinical and psychological | | | | | |
Diagnostic criteria for MPA as social phobia (DSM-5) and as specific phobia (ICD-10) | X | | | | |
Mini DIPS | X | | | | |
Bühnenangstfragebogen (BAF) | | X | | X | X |
Kenny Music Performance Anxiety Inventory (K-MPAI) | | X | | X | X |
Fragebogen zum Sicherheits- und Vermeidungsverhalten bei Musikerinnen und Musikern | | X | | X | X |
Social Interaction Anxiety Scale (SIAS) | | X | | X | X |
Social Phobia Scale (SPS) | | X | | X | X |
Beck Depression Inventory (BDI-II) | | X | | X | X |
Beck Anxiety Inventory (BAI) | | X | | X | X |
Anxiety Sensitivity Index (ASI-3) | | X | | | |
State Trait Anxiety Inventory (STAI-Trait) | | X | | | |
Childhood Trauma Questionnaire (CTQ) | | X | | | |
Immersive Tendencies Questionnaire (ITQ) | | X | | | |
Uncertainity Tolerance Scale (UGTS) | | X | | | |
Positive and Negative Affect Schedule (PANAS) | | X | | X | X |
Questionnaire CAVE Pre_before BAT | | X | | | |
Questionnaire CAVE Pre_after BAT | | X | | | |
Questionnaire CAVE Post_before BAT | | | | X | |
Questionnaire CAVE Post_after BAT | | | | X | |
Questionnaire CAVE Follow-up_before BAT | | | | | X |
Questionnaire CAVE Follow-up_after BAT | | | | | X |
SUDs | | X | | X | X |
Questionnaire_before VRET/PMR | | | X | | |
Questionnaire_after VRET/PMR | | | X | | |
Questionnaires: BAF (41), K-MPAI (42), Fragebogen zum Sicherheits- und Vermeidungsverhalten bei Musikerinnen und Musikern (43, p.207), SIAS (44, 45), SPS (45, 46), BDI-II (47), BAI (48, 49), ASI-3 (50), STAI-Trait (51, 52), CTQ (53), ITQ (54), UGTS (55), PANAS (56) and further questionnaires being assessed before and after each BAT and therapy training session.
2.5.1 | Behavioral assessment in the CAVE
In order to assess MPA in a condition as real as possible on the different levels (subjective and physiological), a BAT is performed in the CAVE (software from VTplus GmbH, Institute of Psychology I at the University of Würzburg). The CAVE system is a room sized 4 × 3 × 3 m), in which a virtual environment is projected onto the four walls and floor via stereoscopic images. The stereoscopic impression is created by passive interference filtering glasses worn by the subjects (57). The test person can move around freely with his/her instrument without any problems. During this behavioral test, the participants are instructed to present two different pieces of music over approximately 10 minutes playtime. In that time of the behavioral test the HRV will be measured simultaneously, using appropriate sensors and electrodes (Movisens®). Additionally, subjective fear ratings (SUDs) will be assessed.
After finishing the preparations for the BAT (e.g. attaching the microphone, head tracker, and electrodes for measuring the heart rate), blood pressure and heart rate are measured. The musicians are instructed to behave and interact with the jury (e.g. respond to questions) as if it was a real audition. Also, they are briefed about the subjective fear ratings ranging from 0 “no fear at all” to 100 “maximum fear”, that are assessed throughout the BAT over a microphone. When entering the CAVE, a physiological baseline for the HRV is recorded. Therefore the participants have to sit and to stand respectively one minute in a neutral CAVE (white walls) while doing nothing. After measuring the baseline, the subjects are asked to rate their anticipatory and maximum expected fear of the audition. Subsequently, they are told that the audition will start now and the CAVE shifts to display a waiting room in which the musicians find themselves waiting in front of the door of a big concert hall. To help the participants immersing into the situation, short, standardized instructions are read at the beginning of the waiting phase and when entering the stage. At the beginning and at the end of the waiting phase, the participants are asked to rate their actual fear (0-100). Thereafter, participants “enter the stage” by shifting the CAVE to display a concert hall with other musicians waiting in the background and people sitting in the audience. The musicians are instructed to wait for another minute until the jury will arrive, and to feel free to look around and prepare for their performance. Also they are asked again to rate their current fear (0-100). Finally the jury, consisting of two men and two women, pops up in the first row of the audience and another fear-rating (0-100) is assessed. When the jury appears they seem busy (e.g. looking down at their notes or around). After 30 seconds all the jury members lay back and start to look up at the participant and a male juror begins to greet the participant. He asks the musician to introduce her-/himself and the first piece he/she is going to perform. When the musician has finished answering, the juror reacts confirming the choice of the piece and asks the musician to start playing/ singing when ready. After three minutes, the participants are being interrupted by the jury even though they have not finished their first piece yet. Another fear rating is assessed asking for the maximum fear while playing (0-100) and the actual fear after having finished the first piece (0-100). Finally, a female jury member asks the musician for the second piece that the musician is going to present and prompts the participant to start the performance. Following the same procedure as for the first piece, the musician is being interrupted by the jury at latest after having played three minutes, which is followed by the fear ratings to assess the maximum fear while playing (0-100) and the actual fear after having finished the second piece (0-100). Thereafter, the female jury member says goodbye and asks the participants to wait outside. The audition is ended and the BAT ends with again recording a physiological baseline for the HRV in a neutral CAVE (e.g. white walls), same as at the beginning. After recording the baseline, the participants are asked for the last time to rate their actual fear (0-100) and are then asked to leave the CAVE. Outside the CAVE, blood pressure and heart rate are measured again and microphone, head tracker and electrodes are being removed.
In addition, during the BAT, the musical performance is recorded by a recorder (Zoom H4n®) which is attached to a microphone stand in front of the musician.
The total duration of the CAVE examination is estimated to last approximately 120 minutes of which 50 minutes are estimated for the BAT, its preparations and debriefing (e.g. attaching the microphone, head tracker and electrodes for measuring the heart rate, recording the physiological baseline and measuring heart rate and blood pressure before and after the BAT), 40 minutes to fill out the questionnaires, and 30 minutes to take blood and saliva samples in the beginning and at the end of the examination.
Concerning the pieces of music, the participants are told two weeks in advance to present a slow and a virtuosic piece of music of moderate to high degree of difficulty, similar to a real audition. Participants have to present the same two pieces of music at all time points (T0 to T2).
2.5.2 | Blood samples
At the CAVE examination (BAT, T0/T1/T2) blood samples include 2x 1.6 ml EDTA for (epi-)genetics, 1x 1.6 ml EDTA for blood count, 2x 9.0 ml EDTA for proteins, 2x 9.0 ml serum for metabolome and endocrine analysis as well as saliva samples of 2x 1.1 +/- 0.3ml. In total, this amounts to 3 x 37.8 ml blood samples as part of the CAVE examination.
2.6 | Primary Outcome Parameters
The primary outcome parameters are therapy effects on MPA measured by the Bühnenangstfragebogen (BAF) as well as changes in cardiac reactivity using HRV. Therapy response criteria for MPA are defined by a decrease of at least 50% of the BAF total score compared to T0. Remission ist defined by no psychological distress or impaired functioning due to the symptoms. Regarding the cardiac reactivity, HRV is not yet sufficiently established as a biomarker for music performance and there is no reliably data in this area so far which is why we evaluate this data exploratively. However, in case of a successful reduction of MPA we expect a significant higher HRV during the BAT after therapy compared to before therapy.
2.6.1 | Psychometric assessments for MPA
Specific questionnaires on performance anxiety have already been established since the 1990 (4, 58). As dimensional measures of MPA the German Bühnenangstfragebogen (BAF) (41) and a german translation of the revised Kenny Music Performance Anxiety Inventory (K-MPAI-R, 42) are assessed to measure subjective symptoms of MPA. Since the K-MPAI (K-MPAI-R, 42) has not been validated yet and is considered less sensitive to detect chances in MPA over time than the BAF (2, 43), we only consider the BAF (41) as primary outcome measure in this study.
The BAF (41) is a validated german translation of the Performance Anxiety Questionnaire by Cox and Kenardy (58) and consists of 20 Items assessing MPA separately for solo and ensemble performances. The items are measuring cognitive and bodily symptoms of MPA and have to be rated by their frequency of occurrence during performance situations (“How often do you experience those thoughts or feelings meanwhile a musical performance situation?”) on a five-point scale ranging from 1 “never” to 5 “always”.
BAF scores are assessed before the BAT at pre- (T0), post- (T1) and after six month follow-up-treatment (T2) to detect changes in MPA symptoms over time. We hypothesize that musicians of the experimental group show significantly lower overall score measured by the BAF (41) at T1 and at T2 compared to T0 representing a successful VRET with lasting effects. Furthermore, we expect this reduction of the BAF (41) overall scores to be significantly stronger the experimental group (VRET) than for the control group (PMR).
2.6.2 | Primary outcome: Heart rate variability
Heart rate variability (HRV) describes the variability in the time intervals between consecutive heartbeats. Since our cardiovascular system has to adapt constantly to changing conditions to adjust to physical and psychological challenges, the oscillations of a healthy heart are complex and constantly changing (59). Measuring HRV provides a non-invasive method to assess the cardiac autonomic modulation. Since its easy application and tolerability to the proband, HRV is widely used and well established in different scientific fields, not only applied in humans but also animal studies. Today, common HRV devices record a one-channel electrocardiography, from which the heart rate and the HRV can be derived, but are also capable to measure additional parameters such as body temperature, acceleration, and air pressure.
There are many different approaches to calculate HRV. A detailed description about HRV metrics and norms, and the different parameters is given by Ginsberg and Shaffer (59). Giving a brief summary, HRV can be described using time-domain bound approaches like the standard deviation of normal-to-normal intervals (SDNN) and the root mean square of successive differences (RMSSD), frequency-domain bound approaches like high frequency power (HF) and low frequency power (LF), and non-linear approaches. All parameters are rooted in the variability observed between consecutive R waves (RR-intervals), which is why they are highly correlated. Till this day, the superiority of any parameter across settings has not been demonstrated, even though it has been shown that some parameters represent different aspects better than others (59, 60, 61, 62). Selecting the recommended measure and interpreting the data, context factors and subject variables have to be considered e.g. method and length of recording (24h, short-term (~ 5min), ultra-short-term (< 5min)), sampling frequency, removal of artifacts, respiration, position, movement, tasks and demand characteristics, age, sex, heart rate and health status (59).
In the scientific discourse, some authors emphasize the value of HRV even as a „biomarker“ (63, 64) respectively as a „psychophysiological marker for physical and mental health“ (62, 65). A current meta-analysis on HRV in patients with anxiety disorders found that especially resting-state HRV in patients with anxiety disorders, in general, was significantly lower than in healthy controls with a low to a medium effect size of Hedges’ g 0.39 reflecting a „robust feature of anxiety disorders“ (66, p. 9–10).
With regard to music performance, HRV has already been investigated in some studies and described as a correlate of fear in the context of performance anxiety (8, 14, 67) but has not been established as a Biomarker. Just a few studies suppose a correlation between autonomic cardiac modulation and the flow experience during music performance (68, 69). Concerning therapeutic interventions, a randomized-controlled trial investigated a single-session biofeedback training on MPA and HRV with positive effects on HRV of a slow breathing group in comparison to controls (65). Two studies on anxiety levels during music performance using VR already investigated HRV, although the sample size was very small (30, n = 5, 34). In particular, Williamon et al. used HRV to calculate the LF/HF ratio with a subgroup of violinists (n = 7) and found significant differences emerged in physiological responses for a pre-performance period (real audition vs. simulated) which interestingly did not correspond to higher anxiety scores assessed with the STAI-Y1 (70). Referring to the impact of context factors as mentioned above the type of instrument (wind, strings, singing, etc.) and its position of playing could play a decisive role in HRV analysis which has not yet been investigated sufficiently in musicians.
In our study, we use primarily a biobehavioral approach (71) investigating the following parameters of HRV: heart frequency [bpm], SDNN I [ms], RMSSD [ms], pNN50 I [%], LF [ms2], HF [ms2] and LF/HF-Ratio. The HRV is assessed meanwhile the BAT and we expect a higher HRV to represent less cardiac stress. Therefore, we assume that a successful reduction of MPA goes along with a significant higher HRV in the BAT at T1 and T2 compared to T0.
2.7 | Secondary outcome measures
Blood-based parameters are easy to access in patients and therefore could have an immense use in diagnostic and predictive ways within the personalized medicine approach. In the present study, we focus on stress-related endocrine measurements, neuropeptides and DNA-methylation as measurable parameters in the peripheral blood to explore 1) their utility as a proxy for the fear-related phenotype, 2) potential predictive use for therapy response and 3) the dynamic changes within the therapy course.
Stress is one of the important factors contributing to the development and relapse of various anxiety disorders. In a recent meta-analysis, cumulative HPA-axis parameters and autonomous nervous system markers were associated with anxiety arousal and fear severity (72). Given the first reports on CBT-related attenuation of HPA system activation in anxiety disorders, we hypothesize that successful VR exposure treatment leads to the reduction of stress elicited cortisol levels within the BAT to both time points, T1 and T2 and that this reduction is related to clinical response. Furthermore, we will analyze whether the magnitude of cortisol release during the T0 BAT influences the clinical response to T1 treatment effects in a predictive way. Lastly, we investigate, how cortisol levels in T0 and T1 can project on MPA symptoms 12 months after the treatment (T2).
Neuropeptides serve as messenger substances or as co-transmitters, i.e. with 5-HT, GABA or norepinephrine. A number of neuropeptides have been shown to modulate anxiety-related behavior in various preclinical models (73). In the present study, we focus on neuropeptides with modulating effects on humoral and behavioral stress response, and myocardial as well as autonomic nerve system function (ANS), such as Atrial natriuretic Polypeptide (ANP) (Marazziti et al., 2021), neuropeptide Y (NPY) (74), oxytocin (OXT) (75), and the enzyme salivary Alpha-Amylase (AA) (76).
Genetic factors contribute to the aetiology of anxiety disorders (AD) with heritability estimates ranging around 20–40% for special phobias and social anxiety disorder (77). Following this, environment acts as an important contributor to molecular processes which lead to the development of pathological anxiety states in both conditions (78). In this context, epigenetic processes have been increasingly studied in AD as a possible link between environment and molecular function (79). Epigenetic modifications describe gene regulatory processes without changing the original DNA, such as micro-RNA (miRNA), histone modifications and DNA-methylation (DNAm), the latter being the most studied epigenetic mechanism in psychiatry to date. An increasing number of epigenetic studies suggest that the dynamics of DNAm and subsequent gene regulatory changes modulate the effects of psychological therapies and could serve as a possible biomarker for short- and long-term therapy success (80). In the present study we investigate mDNA of the candidate genes related to the endocrine stress reactivity (Glucocorticoid receptor gene (NR3C1), FK binding protein 5 (FKBP5), corticotropin releasing hormone receptor 1(CRHR1)), which is hypothesized to be modulated by successful exposure therapy. Furthermore, Norepinephrine Transporter (NET) and Oxytocin Receptor (OXTR) genes were selected for DNAm analysis due to their implication in the regulation of autonomous function and social anxiety.
Regarding cardiac stress by measuring heart frequency and blood pressure and regarding MPA by assessing SUDs during the BAT, a significant difference indicating less cardiac stress or lower ratings of MPA during the BAT between the time before and after therapy is expected.
Summarizing, an overview of all primary and secondary outcome parameters and its measures is given in Table 4.
Table 4
Overview of the constructs and measures
Construct | Measure |
Primary Outcome | |
MPA symptoms | Bühnenangstfragebogen (BAF) |
Cardiac reactivity | Heart rate variability (HRV) |
Secondary Outcome | |
MPA during BAT | Subjetive units of discomfort (SUDs) |
Social anxiety symptoms | Social Interaction Anxiety Scale (SIAS) |
Social phobia symptoms | Social Phobia Scale (SPS) |
Depression symptoms | Beck Depression Inventory (BDI-II) |
Anxiety symptoms | Beck Anxiety Inventory (BAI) |
Anxiety sensitivity | Anxiety Sensitivity Index (ASI-3) |
Trait anxiety | State Trait Anxiety Inventory (STAI-Trait) |
Childhood-history of abuse and neglect | Childhood Trauma Questionnaire (CTQ) |
Immersive tendencies | Immersive Tendencies Questionnaire (ITQ) |
Uncertainty Tolerance | Uncertainity Tolerance Scale (UGTS) |
State affect | Positive and Negative Affect Schedule (PANAS) |
Cardiac stress | Heart rate, Blood pressure |
Psychophysiological stress response | Salivary Cortisol hormone |
| Salivary Alpha-Amylase hormone Atrial natriuretic Polypeptide (ANP) |
| Neuropeptide Y (NPY) |
Psychosocial stress response | Oxytocin (OXT) |
DNA-methylation in peripheral blood | Glucocorticoid Receptor (NR3C1), FK binding protein 5 (FKBP5), Corticotropin releasing hormone receptor 1 (CRHR1), Oxytocin Receptor (OXTR), Norepinephrin Transporte (NET) |
Subjective MPA symptoms | Kenny Music Performance Anxiety Inventory (K-MPAI) |
Use of MPA specific avoidance and safety behaviors | Fragebogen zum Sicherheits- und Vermeidungsverhalten bei Musikerinnen und Musikern |
2.8 | Data processing and statistical analysis
The primary parameter for the evaluation of the therapeutic effects of this study is the MPA questionnaire, with the expected superiority of VRET group compared to the relaxation training group regarding the decrease of the reported symptomatology immediately after the therapy.
For the primary analyses, an intention to treat (ITT) analysis will be performed for all included participants, except for those withdrawing consent during the study or not receiving at least one treatment. Missing values will be imputed using a last value carried forward approach. No interim analysis will be performed. The analysis is performed by ANCOVA with measurement repeat factor time (pre-post), the factor group (experimental-control), and the baseline values of the respective variable as covariates.
Regarding HRV, we analyze the heart rate (bpm) and RMSSD as an HRV indicator. Heart rate and RMSSD each serve as a model outcome in multivariate analyzes with multiple predictors (e.g. age, gender, instrument group).
2.8.1 | Sample size calculation and number of VR sessions
There are only a few studies that have examined VR and in particular VRET for treating MPA (33, 34). Samples sizes were very small (30, case study, n = 5, 32, n = 17 ) and with regard to their methods extremely heterogeneous. Likewise, the study situation on the use of PMR for performance anxiety is extremely small (e.g. music-assisted PMR in pianists, (19); PMR vs. propranolol, (7). Corresponding meta-analyses are lacking.
A recent meta-analysis by Carl et al. (81) from 30 studies with 1057 patients examined the effectiveness of VR exposure for anxiety and anxiety-related disorders in general. Here, a large effect size was reported for the VRET, with a Hedges' g of g = 0.88 compared to waiting list conditions (g = 0.88, SE = 0.10, 95% CI: 0.69–1.07). Compared to an active control condition, such as relaxation exercise including PMR, the effect size was g = 0.78 (SE = 0.27, 95% CI 0.25–1.31). This corresponds to an effect size of f = 0.39, which we used in GPower (version 3.1.9.7, alpha = 0.05, power = 0.95, r = 0, with 2 groups and 2 measurements (pre/post)) to determine the sample size for a significant interaction used in the between-group and time-point repeated measures ANOVA. The total sample size was calculated as n = 46. Taking into account a drop-out of around 10% during the study, 52 patients are defined as a sufficient total sample size for the planned examination of MPA.
Regarding the number of VR sessions mentioned in this meta-analysis (81) in the case of specific phobias, such as fear of performing, very good effects were achieved with 4 sessions. Even with just one intervention, there was an effect size of g = 0.88 with n = 77 participants (82). In studies on social phobia, which defines complex symptoms with relevant limitations in several social situations in addition to performance anxiety, Harris et al. (83) still report a g = 0.91 with 4 sessions (81, Table 2). The only VR study on performance anxiety listed in this meta-analysis conducted a total of 6 exposure sessions on music students with a Hedges' g of g = 0.14. However, the examined music students did not suffer from clinically diagnosed performance anxiety, which reflects the weak evidence within this form of anxiety disorder as described earlier (n = 17; 32). The authors of this meta-analysis state in summary that “there was no effect for the number of treatment sessions in either VRET versus control (p = 0.87) or in vivo comparisons (p = 0.67)“ (81, p.30).
In addition, an even more recent meta-analysis (11 studies with 626 patients) examined the benefit of cognitive-behavioral therapy with the additional use of VR (Virtual Reality-Assisted Cognitive Behavioral Therapy, VRCBT) and was able to demonstrate positive effects of VRCBT in anxiety disorders under these conditions (84, p.7, SMD − 0.92), although calculations of effect size on the number of sessions were missing. The authors conclude that further research is needed to confirm the benefit of VRCBT with different subtypes of anxiety disorders (84, p.8). In this context, MPA, a subentity of social anxiety, can be listed here.
In the light of this knowledge and taking into account that the majority of our participants in the study does not suffer from any general social phobia, but rather a very defined performance anxiety as mentioned above, 4 sessions for the sample size are considered as sufficient and we do not expect any loss of power.