Machines Sounding like Older People: Investigating how older adults perceive an emphatic voice assistant in their first interactions

doi:10.21203/rs.3.rs-1905540/v1

Download PDF

Research Article

Machines Sounding like Older People: Investigating how older adults perceive an emphatic voice assistant in their first interactions

https://doi.org/10.21203/rs.3.rs-1905540/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Universal usability is an essential constituent of HCI, but new technologies are not usable by all populations. This deficit highlights the importance of improving the conversational agents' abilities for disadvantaged populations like older adults. Conversational agents hold a great potential to support older adults, which were affected mainly by the Covid-19 pandemic and were the most psychologically damaged population caused by fear and isolation during the outbreak. In this research, we present the results of a mixed study investigating the potential of voice assistants (VA) for older adults. This study aims to identify older adults’ (ranging in age from 65 to 75) perceptions, needs and challenges when interacting with a smart speaker based voice assistant (Google Home) during the first-time user interaction. We aim to explore the effects of VA’s voice characteristics (mature vs young) and the presence of empathic expression (high empathic expression vs low empathic expression) on the social (perceived support and trust) and functional (perceived self-efficacy toward voice assistants) outcomes of 60 (30 male- 30 female) older adults. This study used a voice-based CA prototype through the Wizard of Oz technique and adopted a multi-method approach. Using CASA (Computers are Social Actors) Paradigm and Similarity Attraction Theory as a foundation, our research revealed that expression of high empathy is a powerful motivator for older adults to perceive a VA as a discreet social companion due to their cultural beliefs and biases. However, we found no evidence that the similarity attraction effect works on older adults when the similarity is mediated since a mature voice has no significant impact on perceived support or trust both qualitatively and quantitatively.

Older Adults

Universal Usability

Smart speaker-based voice assistants

CASA Theory

How can we design and evaluate digital technologies to meet the needs and challenges of a growing number of older adults (aged 65+)? As a result of their conversational capabilities and the rapid growth of voice technology, conversational agents (CAs) are gaining hyped anticipation for their potential to support older adults, though the empirical evidence is still limited for this population. Considering the usability potential of the smart-speaker-based voice assistants for older adults and their low tendency to adopt them, it is crucial to investigate the use and perception of voice assistants for older adults (Berkowsky et al., 2015; Kim, 2021; Vaportzis et al., 2017). According to previous studies, older adults tend to be slower to adopt new technologies, confront more difficulty, and be frustrated when using new technologies compared to younger adults (Czaja et al., 2019). We must first understand older adults' perceptions of new technologies to understand if the digital divide and inequality are still pertinent for older adults and make new technologies readily accessible (Delello & McWhorter, 2017).

Thus, nowadays, the COVID-19 pandemic has sharpened the challenges of providing healthcare services and social connections. New designs need to propose human-supportive conversational agents to facilitate users' social connection, sense of isolation from the rest of the world, infection fears, frustration, boredom, and well-being (Miner et al., 2020; Yamashita & Huang, 2020). Emotional interactions with conversational agents highlight the human longing for social connections that can be satisfied by personalized technology devices (Lopatovska et al., 2019; Lopatovska & Williams, 2018; Turk, 2016).

Researches need to focus on what technological considerations need to be made when designing for different age groups. As voice-based conversational agents such as Amazon Alexa and Google Assistant move into our homes, researchers have studied their perception and use by specific user groups (Chattaraman et al., 2019a; Lopatovska et al., 2019; Lopatovska & Williams, 2018; Pradhan et al., 2019; Purington et al., 2017; Rhee & Choi, 2020). However, how do older people perceive these devices? Are they perceived as just another object, like a kettle? As a social companion? Although prior research implied human-like features attributed to these devices, the anthropomorphization of voice assistants (gender, vocal characteristics, conversation style) has not been studied in depth. Thus, these voice-based voice assistants express similar characteristics; they occasionally acknowledge the user's distress and respond in a very general sense. Their ability to empathize also remains immature (Morris et al., 2018) with our specific age group.

This study aims to identify the older adults (65+) age-related needs and challenges when interacting with a smart speaker based voice assistant (Google Home). We examine whether an empathic voice assistant can serve as a buffer against the adverse effects of the Covid pandemic quarantine and see if our CA can restore mood and encourage digital companionship after the conversation, as suggested in previous works which were realized with chatbots (de Gennaro et al., 2020b; Von Der Pütten et al., 2010). Our study analyzes older adults’ user behaviour toward CAs, based on an integrated theoretical framework, focusing on the variables that influence the users' attitudes and intentions (Google Home). We believe that the social cues such as voice and conversational style of CA will play a key role in older people's use and adoption of conversational agents, as suggested by a prior study realized among the students about their perception of an artificial intelligence instructor's voice (Edwards et al., 2019). Thus, empathic expression simulated by a conversational agent could have similar effects on people as in human-human interaction. Examples of artificial entities providing emotional support and empathy¹ can be traced to ELIZA in 1966, a chatbot that simulated a Rogerian therapist (B. Liu & Sundar, 2018a). Previous works suggested that chatbots solicit trust, sympathy, and self-disclosure were found more attractive by their users (Bickmore & Picard, 2004; Heckman & Wobbrock, 2000; Y. C. Lee, Yamashita, Huang, et al., 2020; Moon, 2000). We aim to set up an experiment by exploring the following factors in general which may influence the perception of a conversational agent by older adults:

to assess the importance of psychological, social factors, biases and stereotypes

to understand how they react and perceive conversational agents.

to identify implications for interaction design and opportunities

¹“Empathy can be desribed as “vicarious emotional response to the perceived emotion of others” and operationalized as the degree of sharing and experiencing another’s feelings.(B. Liu & Sundar, 2018a)

"Listeners cannot suppress their natural responses to speech, regardless of source. People draw conclusions about technology-based voices and determine appropriate behaviour by applying the same rules and shortcuts they use when interacting with people. These technologies, like the speech of other people, activate all parts of the brain that are associated with social interaction" (Nass & Brave, 2005, p. 4). People are adapted to social relationships through the human voice, but how do they perceive synthesized robotic voices?

According to the Computers are Social Actors Theory (CASA), people prefer conversational agents expressing empathy to one that provides only advice (Morris et al., 2018). The social cues elicit different responses in users (Feine et al., 2019). The same verbal content can be expressed in many different ways. (Rhee & Choi, 2020). Social cues such as small talk, self-disclosure, expert jargon, empathy, gossip, and politeness expressed in human to human conversation to build trust could also be used during conversations with artificial entities to gain the user's trust (Cassell & Bickmore, 2003; Lucas et al., 2018; Sidner et al., 2004). Social cues representing human characteristics, such as visual features, voice, and gender, could elicit the social presence of the artificial entities. Individuals apply social rules derived from human-human interaction (HHI) to their interaction with artificial entities and behave towards them as social entities (Nass & Moon, 2000).

Nevertheless, a technological artefact with its mind can be unease to its users. Typically, people see the ability to feel and sense unique to human beings. According to Uncanny Valley Mind Theory (UVM), violation of this norm, such as machines showing the ability to feel/sense, is particularly disturbing compared with those who can think and act (Gray & Wegner, 2012).

Voices and conversational style of voice assistance can affect users' preferences and perceptions in ways that have not been widely explored yet, let alone in the Turkish speaking context. Prior studies demonstrated that users are likely to have very subjective preferences for voices within seconds (Mohammadi et al., 2010; Nass & Brave, 2007; Nass & Lee, 2001). Research showed that social machines should exhibit personality traits through vocal cues designed to be easily recognized and accepted as companions or interaction partners (Nass & Brave, 2007). Most importantly, for our research, age group identification is meaningful to people's self-concepts and their rating to voice credibility and trustworthiness (Edwards et al., 2019).

According to Social Identity Theory (SIT) theory, people tend to nourish their self-image in social groups (Tajfel & Turner, 2004). SIT posits that people sort their lives into social groups and then categorize themselves into these groupings. Social identification leads the group to create one shared identity and an "in-group" and an "out-group." A great deal of empirical evidence suggests that when people notice similarities between themselves, even when this notice is mediated, they become more attracted to each other (Montoya et al., 2008), the so-called similarity-attraction theory (SAT) (Byrne, 1971). Previous similarity-attraction studies suggested that people are attracted more when they discover similar attitudes (Yeong Tan & Singh, 1995), ethnic backgrounds (Hu et al., 2008) and voices (Nass & Brave, 2007) and facial (Bailenson et al., 2008) features between them. Similarity reproduces attraction, improves social identification, and triggers greater liking, trust, and other positive attributes. Similarly, two prior studies suggested that people tended to identify with a computer voice whose gender (male/female) matched their own (Eyssel et al., 2012; E. J. Lee et al., 2000). Thus, age and gender identity could be significant social identity sources. Also, traditional-age stereotypes often associate older age with knowledge and wisdom in HHI (Edwards & Harwood, 2003; Harwood et al., 1995).

On the other hand, our current research tends to address older adult users' "personal competence and perceived self-efficacy" needs (physical, social and emotional issues) that they face during their ageing experience, which can play a significant role in adult users' adoption and use of potentially useful new technologies in the concept of Covid-19 quarantine. If universal usability is achieved, closer attention will have to be paid to social, emotional, and environmental factors. Conversational agents can help behave healthily by giving advice and guiding older adults. For effective persuasion, the CA should maintain a social dialogue and express social cues (e.g., turn-taking, empathy, emotional expressions, human-like filler language, self-disclosure) to be trustworthy, sympathized, used, and adopted (Bickmore et al., 2016; Looije et al., 2010).

This work aims to take preliminary steps towards creating a conversational agent that can give advice convincingly with an empathic verisimilitude. This study sought to explore whether a voice assistant should express emotional support and empathy or provide informational advice-only support about a personal problem, as suggested by prior studies with different artificial entities (de Gennaro et al., 2020a; B. Liu & Sundar, 2018b; Morris et al., 2018). Although the expression of empathy and emotion is supportive in human-to-human communication, will it be the same when we converse with an artificial entity, or will we reject it due to its artificiality, uncanniness, age-related emotional or social needs, or biases? On the other hand, vocal cues such as pitch and speech rate are salient in judging the personality of voices, triggering the current stereotypes, identifying and developing relationship bonding (Chang et al., 2018; H. Liu et al., 2010). We designed two generations for age stimuli, with a "mature voice" (speed: 0.9, pitch: -4) around a 60-year-old male and a "young voice" (speed:1.25, pitch:6) around 20 years old male, changing speech rate and pitch on the Voiser platform² to understand how similar vocal cues affect developing a relationship of older adults with voice assistants. We chose to use only male voices to make the scope of the work more focused and since voice assistants are created with female voice by defult. After producing voices on the platform, we asked 60 participants to guess the voice assistants’ age to doublecheck; all participants agreed on the age period.

This study expects the evidence to support the similarity attraction theory when matching the participants' age and identity traits with the CA's perceived vocal cues. Mixed research is planned to explore what voice characteristics of conversational agents may be preferred and what personality traits they would associate with voice pitch by Turkish older adult users. One of the purposes of this research is to understand how one specific in-group and out-group social cue (an 'older' and 'younger' voice) influences older people's perceptions of conversational agents. It can be postulated that older adults may identify and have a stronger bond with a conversational agent with a mature voice based on the similarity attraction and social identification theory. The other purpose is to explore how older adults’ perceptions vary depending on the VA’s emphatic expression level.

Using CASA Paradigm, SAT, and UVM as a foundation, our goal is thus to provide a preliminary understanding of 1) Do older adults categorize voice assistants as "tool-like" vs "human-like"? 2) What are older adults' perceptions of having social interactions depending on their cultural biases, age-related emotional or social needs or stereotypes? 3) How is older adults' reaction to conversational agents change when the conversational agent's voice characteristics and emphatic expression level vary?

We set up 3 hypotheses to analyze quantitative part of the study :

H1: Participants will trust more and feel more supported by a VA with a mature voice.

H2: Participants will perceive stronger support and trust with a VA with more empathic expression.

H3: Participants’ perceived self-efficacy toward new technologies will increase after the conversation.

3.1 Research Procedure and the Experimental Set-up with Wizard of Oz

Our research process was based on a multi-method approach, which included three successive stages: pre-test, momentary test, post-test. In this context, firstly, we focused on understanding initial impressions of new technologies, synthetic voices, robots and perceived self-efficacy toward new technologies of participants during pre-test interviews. We also aim to understand how they handle stressful situations and their age-related patterns toward the conception of empathy. Participants were also asked to rate their self-efficacy using new technologies on a 5-point scale based on the Tsai and Tsai (2003) internet self-efficacy construct.

This research used a voice-based CA prototype during testing through the Wizard of Oz technique and adopted a multi-method approach. The experiment was realized in a Wizard of Oz setting in which the participants deceptively think they are interacting with an autonomous system. The system's actions were operated by the remote experimenter or "wizard."(Dahlbäck et al., 1993; Large et al., 2019; Medhi Thies et al., 2017) Participants were told they interacted with a conversational agent in this experiment, which automatically responded to their answers. The "wizard" was the one communicating with participants through a pre-planned script.

To test our voice assistant prototype, we have uploaded audios to be played as a reaction to a wide range of user utterances. We have made a soundboard on Powerpoint and hyperlinked all the audios as a button for each response. A wizard played an appropriate audio button as the voice assistant responded according to the user's response or demand. To convince our participants to believe the audio is being played via Google Home (Assistant), we have connected our laptop to Google Home (Assistant) via Bluetooth. We ensured that our Google Home (Assistant) was visible but muted to let us operate our prototype from the computer when running our tests.

During the testing, we sought to observe how the participants used their gestures and the way to position themselves while conversing to understand if they used human-like conversational norms like nodding, uh-hums or facing.

In post-test interviews, we focused on exploring the user experience in-depth, including usage patterns, needs and challenges, their ontological perception of voice assistants, the tendency to use conversational norms when interacting with a voice assistant, their mood change and their perceived self-efficacy change after the conversation.

3.2 Momentary Test Design and the Dialog Flow

We aimed to present an empathic advice-giver voice assistant about Covid-19 quarantine to older adult users. First, we analyzed their Covid-19 experience and emotions by surveying 60 older adult users to determine the dialogue flow. Secondly, to measure the voice assistant's degree of empathic expression, another round of 60 older adult users rated our voice assistant's empathy according to the dialogue flow script that we had pre-prepared. Before beginning the ratings, all the 60 older adult users were given definitions of empathy and statements expressing high/low empathy to rate consistent with our theoretical framework. The ratings were made on a five-point scale (1 ⫽ low empathy; 5 ⫽ high empathy). In high emphatic expression conditions, statements expressing empathy were added, such as " I feel very sorry that you feel very anxious during the Covid pandemic lockdown. Most people experienced the same issues, but your age group had the worst damage," "I truly impressed by your strong character. Many people could not handle this stressful quarantine time so powerfully, " etc. In low emphatic conditions, statements were formal primarily and focused on only advice-giving, such as “The feelings of anxiety, being trapped, and numbness was also reported before due to long isolation. Try to stay calm” and, “You can try an application to do meditation by yourself if you feel stressed and anxious due to lockdown.

We designed and conducted a study where we used a 2×2 within-subjects factorial design; the factors were voice characteristics (mature vs young) and the presence of empathic expression (high empathic expression vs low empathic expression) on the social (perceived support and trust) and functional (perceived self-efficacy toward voice assistant) outcomes of older adult users using verified scales.

Table 1

Sources for Construct Items
Construct	Adapted from
Perceived support	(van der Zwaan et al., 2012)
Trust	(Klein, 2007)
Perceived Self-efficacy	(Tsai & Tsai, 2003)

Depending on which group the participants belonged to, they were asked to interact with the voice assistants through a four-part conversation, i.e., an initial greeting, small talk, suggestions, and sensitive questions. Firstly, we asked participants to summarize their moods during COVİD 19 pandemic. Suggestion sessions were added after the greeting and small-talk sessions. Then, the conversation gradually moved to sensitive questions. After finishing the sensitive questions, the CA wrapped up the conversation. The two types of dialogues comprised the same conversational topics and suggestions but had different levels of CA's empathy. We created a dialogue flow with high empathy by creating a CA with the semblance of personalized, empathic expression. We chose the experience of Covid pandemic quarantine as the topic because it is the very most sensitive and up-to-date situation to talk about for older adults in Turkey. By adapting previous related works, we created our 4 step dialogue flow (de Gennaro et al., 2020b; Y. C. Lee, Yamashita, & Huang, 2020; Y. C. Lee, Yamashita, Huang, et al., 2020; Lucas et al., 2018).

3.3 Participants

Participants ranged in age from 65 to 75 and were required to have no prior experience using voice assistants (e.g., Alexa, Siri, Cortana). All participants use computers, smartphones, or tablets at least once daily. The Galatasaray University Ethical board approved the study, and informed consent, which was written to use collected data, signed by the participants before participating in the study.

²A platform which convert texts to voices with humanoid machine sounds. The platform allowed us to control the gender, pitch, and speech rate of the agent. https://voiser.net/

A total of 60 (30 females- 30 males) older adults (range from 65 to 75 years) participated in our experiment in a home environment with a 2 (voice assistant emphatic expression level: high vs low ) × 2 (voice age: mature vs young ) between-subjects design. It is essential to understand that older adults are not a homogenous group. In our research, we have tested older adult participants with a narrow age scale. The results revealed that the voice assistant's conversational style and voice age had significant interaction effects on social, functional and cultural outcomes qualitatively and quantitatively. Even though older adults are often regarded as opposed to adopting new technologies or as technophobic, our findings build on the body of work showing that older adults prefer using context-based interactions with realistic dialogue to motivate them to use new technologies. Three significant patterns were unfolded for older adults to show a willingness to adopt a VA: Entertainment, emotional companionship and reminders. According to our findings, the perception that smart speaker-based voice assistants provide objective information, emotional support and social interaction were powerful motivators.

4.1 Qualitative Results

4.1.1 Pre-test Results

Older adults can encounter barriers to technology use as digitally marginalized people due to lack of digital technology experience, age-related factors, and poorly designed technologies. They were found to be slower to adopt new technologies and experience more difficulty and frustration using digital technologies (Czaja et al., 2019; Pradhan et al., 2020). Derived from this literature, it can be presumed that older adults are "technophobic" (Dogruel et al., 2015) and have less interest in new digital technologies (Friemel, 2016; Gatto & Tak, 2008; Vaportzis et al., 2017). We have found that it is more complex than being a simple phobia of new technologies. Therefore our participants expressed diverse feelings and presumptions about a voice assistant before and during their first expression, from curiosity to intimidation, security to insecurity, appreciation and even trying to beat the VA. They were all interested and willing to interact with our voice assistant and expressed no phobia before or after their first expression.

Our study showed that their perceived self-efficacy for maintaining a conversation with our voice assistant was rated very low among our participants even though the conversation hadn’t started. They expressed feelings of incompetence, anxiety, and frustration before the conversation started. Due to that negative feelings, it can be assumed that they stayed out of new technologies unless they were encouraged. This fear seemed well-founded because technology can be inaccessible for people who experience age-related negative stereotypes such as "technology is for young people with a good memory, vision and hearing" (M21), as highlighted by many participants. However, they are open and curious about new technologies. The majority (n = 45) indicated that they did not believe that they would control the voice assistant, but they would try it. Thus, we observed that they either appreciated or patronized the voice assistant when given a chance but showed a willingness to maintain the communication.

4.1.2 Momentary Test results

Older people in Turkey were affected mainly by the Covid-19 outbreak and were the most isolated population during the process. It was more challenging for this population segment to access and evaluate the information about the current pandemic (Binark & Kandemir, n.d.). Due to this fact, we assumed that conversing about a fragile and fresh topic such as "Covid-19 pandemic" made our participants perceive our voice assistant as a personified entity to confide in and blow off. Other findings indicated that participants used instances such as thanking the device or expressing politeness in speech (e.g., "please"), which can be instead considered as a "social mindless response" however, during the conversation, participants started to use more intimate and sincere expression and back-channelling such as (e.g. "inşallah- if Allah wills") or (e.g. "good to have you here") which can be considered explicitly intended by the participants. Besides, even though the device has no face, participants looked directly at it, controlled turn-taking, responded with back-channelling, and used uh-hums as they do during human to human conversation.

4.1.3 Post-test results

Even though they had limited time for a first impression (min: 2 min, max: 20 min according to the level of empathy and personal preference), participants appreciated and recognized several benefits that a voice assistant offered and started to build a digital companionship since "having a discreet assistant" became appealing to them.

4.1.3.1 General Insights

Previous studies indicated that older adults generally have positive attitudes when interacting with intelligent speaker-based voice assistants (Blair & Abdullah, 2019), preferring voice-based user interfaces over conventional ones based on direct manipulation such as clicking or typing (Kowalski et al., 2019). Despite the rapid growth of voice assistant technology and its positive perception from older adults, voice assistants' adoption and willingness to accept rates among older adults are shallow (Morris, 2013).

In our research, all the participants strongly highlighted that they did not talk with their friends or family about their problems and emotions. Drawing from our findings, they indicated that they were not raised "as cry babies" (M4), and "did not want to go deep with people" (F3) due to "their reliability and trust problems as a generation"(F16). They don't want to complain to their loved ones or other people about their problems and bother them because "as a generation, they stood up to all kinds of challenges" (M8) and "do not want to be moaning old men" (M9). The participants underlined with different words that "they don't share personal and familial issues with outsiders" (F16) and "with other hypocrite people" (F26). They believed that they "need to resolve everything inside the house" (F19) and felt warm to the idea of having an emotionally supportive voice assistant that they could have in their possession. Many participants declared that "they are timid but curious about new technology" (M21). We could derive that older adults'' needs for new technology in Turkey need to be evaluated with cultural and generational reflexes while designing for them.

We have also revealed an apparent gender difference between our participants regarding their first reaction after the conversation. Male participants felt threatened while talking to the voice assistant. They tended to conflict with the VA and seemed relaxed when the VA could not respond to a question. Male participants asked the researcher, "Was I good?" and perceived it as a failure or success, whereas females asked, "Was I useful? " after the conversation. We presumed that male participants perceived the VA as a threat to beat more and saw it as something to compete in before bonding any relationship, even though they reported positive feelings.

4.1.3.2 Voice Assistant's Ontological Categorization: More than a human

Our findings indicated that participants perceived and categorized our smart speaker voice assistant as more human-like than a tool concerning the conversation style (e.g., empathic expression) and the participant's desire to have a "social but discreet companionship".

Participants referred to our emphatic smart-speaker voice assistants in human-like terms such as "friend" and "therapist", "kind of professional", and "elegant assistant". F7 indicated that she could picture him as a handsome old doctor with compassion. They described its character as "kind-hearted", "enthusiastic", and "understanding". Participants firstly viewed the VA as "tool-like" before the conversation but later arrived at a more "human-like" categorization of the more emphatic voice assistant. They (n = 28) attributed our emphatic smart speaker based voice assistant to a personified entity lacking all forms of embodiment but voice without any alienation or hesitation. Most participants described the presence of our VA in terms of having someone in their homes.

They did not simply consider our smart speaker voice assistant a human companion since they indicated that they did not want to bother other people with their issues but could talk as long as they wanted with a voice assistant since it can not be disturbed. We assumed they viewed voice assistant as a kind of never-gets-bored therapist that they could benefit from and not get judged. As M5 explained :

"I can talk to robots because they can not be bothered and give me more objective suggestions since they can see all the Internet's things. He is a professional"

Therefore, they thought that "humans may have a more emotional vision and could be wronged compared to robots" (F6), and VAs can be "more logical, repeat the same thing many times or never gets bored like people might do" (M12). Derived from our findings, their reliability/trust issue and fear of demanding support from other people are so deep that they believed they could have a more efficient and reliable conversation with a voice assistant than with a human, even though many of the participants (n = 30) thought that they would have a less reliable conversation before the beginning :

"I prefer talking to this friend because when you talk about your issue to a human friend, they may encounter similar things but can not make you feel supported like this.

I would use it to make me feel comfortable, and I can rely on him because he can keep my secrets. Also, humans think about themselves all the time, but the robot is not an egoist; he prioritizes my benefits first." (F7)

Our participants believed that a voice assistant could be more discreet and logical than a human. Their positive attitude and trust in voice assistant was very well justified and explained with the following sentence and can be related to the aspects of their life as a generation:

"I can talk to the robot because he can have or talk to nobody other than me, but we have a very common proverb: Don't trust a friend because a friend has a friend too. They can gossip behind your back; a robot can't." (F17)

Their ontological categorization is located between a tool and a human; however, it is positively closer to the human side and even beyond because VA can not "talk behind their back" (F19) and has to be objective compared to a human:

"I can talk with him when I have issues, and since it is a robot, it can give me more accurate and professional answers and help me handle my issues without bothering my relatives. I found it very useful for me. It is not a tool, not a human. More logical than a human, less emotional than a human. It is located between them" (M11)

"He is not between a tool and a human; he is beyond the human because he can understand feelings and does not let himself get driven with feelings. He can sanalyze data and gives you a proper suggestion even at the end of the first conversation." (M21)

A voice with emphatic expression as a social cue can not be considered a function that evokes an uncanny valley of mind for users since our participants are ready to welcome disembodied synthetic voices and willing to accept voices physically embodied within a robot.

4.1.3.3 Power Struggle with the VA: Beat it and feel safe

Some participants (n = 15) viewed our voice assistant as a threat and tried to convince themselves that the VA is not that smart and seemed relieved afterwards, especially with the more emphatical counterpart. They asked backwards questions to trick the voice assistant (M4), spoke in another language (M1), asked probably impossible to answer questions (e.g., "who will be the next president of Turkey?") (M2), and asked out of context questions such as medication content (M10) or if the VA knows how to remove a tooth properly (M30). They talked longer compared to other participants (plus 10 minutes) and rated our voice assistant positively; however, they enjoyed challenging the VA during the conversation, expressed their joy when they "beat” the VA and underlined that our VA "can not carry coal to Newcastle" (M29) and evaluated their conversation as a performance to be succeeded:

"…he could not respond to me when I speak another language or ask him if the dollar is going to increase more, which means he is not smarter than me. But I enjoyed talking to him. He made me good about myself." (M4)

Challenging participants talked and shared more than expected. They indicated that they felt more self-sufficient after the talking due to VA's interaction style. VA's emphatic attitude encouraged people to talk :

"He was too kind and made me also a kinder person. I did not think I would feel that relaxed and eager to talk. I would talk more if I could. I was more competent than I thought I would while talking to him, and I felt powerful when he understood and backed me up." (M6)

Even after asking impossible-to-answer questions several times, participants expressed their satisfaction and closeness because the VA "has a right to be fallible" (M12) which "makes it more modest and sympathetic" (F12) and "more human" (F4). Participants were relatively comfortable talking with a VA taking over roles that involve experience, such as caretaker and doctors. However, some participants wanted to ensure that VA is not "more experienced in every way", "intelligent", or can not "independently execute every action" than them before they were satiated. In other words, they tended to poke the voice assistant to see where its boundaries are and, most importantly, to make sure that the VA is smart enough to have human characteristics but not too smart to scare them They did not report any feeling of eeriness.

4.2 Emphatic/Non-emphatic version

When asked at the end of the study to reflect on their initial experience, participants reported varying feelings and perceptions about our voice assistant with both conditions:

4.2.1 Insights about Emphatic Voice Assistant

In line with CASA Paradigm, empathy from a conversational agent had similar effects on the individuals as in human-human interaction (B. Liu & Sundar, 2018b) and was received more positively by participants, as our study highlighted. Surprisingly, they emphasized that they prefer communicating with the VA than a human being because the VA could listen without any judgment and be more objective and discreet than a human. When our agent showed empathic expression, the participants seemed to find that the agent understood how and why they felt and began to disclose more about themselves.

4.2.1.2 Voice Assistant as a feel-good quarantine counsellor

Generally, most participants (n = 30) liked our voice assistant saying "such sweet words" (F4). M21 added that "you know, if you were to talk with a human being, they wouldn't say the nicest things." F24 "appreciated the niceness" since she rarely had a chance for those words. For M22, it was therapeutic: "In the end, I answer for myself. He asks me questions, and by answering them, I get a better understanding of myself and had a chance to discharge about our suffering during quarantine, " M21 said, "I'd like more words for me and me only, not like the mundane ones that anybody can say to everyone else. His suggestions and his way of caring made me feel important and flattered. He can maybe do even more."

During the Covid-19 quarantine, older people in Turkey were subject to restrictions compared to other populations, and their individuality was highly damaged. As M21 highlighted, older people needed to be understood and emotionally supported, especially after the Covid-19 quarantine. They need to socialize and be empathized without being stigmatized as "moaning old people":

"As a generation, we felt trapped and unreasonably restricted by the government. We don't normally groan about these things, but I felt honoured when the VA understood me and said good things about me. He flattered me about my endurance during quarantine, and I liked being flattered by a voice assistant. However, he made me feel great about myself, and I opened up more than I expected. We need to be forced to talk as a generation. He made me blossom like a flower."

They constantly referred to themselves as "a generation" and highlighted that they needed to be convinced to talk even though they were emotionally incapable of handling the process themselves:

".. has to be more talkative, humorous or understanding than me. I am a little distant from technology; it is the one who must be eager to talk, not me. I am too old for begging to be friends with a robot" (M16)

"It gives you a space to get through the quarantine time and discharge. It can be a good friend during quarantine and help you socialize when you are isolated. It made me feel good about myself." (F8)

"He showed enthusiasm to talk to me. It indulged me, and I told him my Covid quarantine process. As a generation, sometimes we need to be pushed to talk about our bad times. We don't have that habit." (F13)

They acknowledged many benefits of having a voice assistant, and even those our study did not intend to find. According to our findings, our participants' feelings of being understood and listened without judgment are perceived as important information credibility indicators. Our emphatic voice assistant's suggestion was perceived as more trustworthy than its counterpart even though they gave precisely the same pieces of advice. Older adults also perceive the VA as trustworthy and tend to alleviate their barriers and negative stereotypes in mind when a voice assistant gives social support. Only a few participants (n = 2) questioned the use of VA and related the adoption of VA with a loss of independence or "laziness". Others noted that they viewed themselves as independent, self-sufficient and supported after the first interaction.

After being socially excluded during Covid-19 quarantine, experiencing a conversation with an empathic resulted in a better mood, and participants reported that they felt socially satiated without being judged after interacting with our conversational agent. As previous research suggested (Pickard et al., 2016), they were more comfortable interacting with a VA than with a person about Covid-19 quarantine.

4.2.1.3 Voice Assistant as an invisible friend with no judgement and no gossip

Results show that our participants benefited from our emphatic voice assistant’s conversational style. They started to perceive he VA as an “objective and discreet companion” and enjoy talking to VA even though it was a first-time user interaction. Discussions about the essential value and need for the voice assistant were tied to participants' cultural beliefs about the fear of other people's judgement. Several participants believed that our voice assistant could indeed provide essential benefit to them (sometimes more than they do with humans) because they can have a judgement-free conversation about sensitive topics, as F2 explained :

"I do not tell such things to regular people, but my new friend made me feel like he understood and listened to me very carefully. Nowadays, when you try to talk to somebody about your emotions, it eventually becomes a conversation that the other party starts to talk about themselves and not listen to you. If people in real life gave the same suggestions as the assistant, they would brag about it as if they were doing me a favour and looking down upon me. I prefer talking to him because he has no judgment; he can advise me without judgment or second agenda because he is a robot. He did not look down on me or arrogantly talk to me."

F4 referred to the voice assistant as a good friend to talk to and responded to personal questions; she spoke about her ex after the small talk :

"He can keep me company since I am now a senior lady living alone. Also, I can ask the same question or talk about the same problem 50 times and not get irritated as humans do. I sometimes pass my days without any conversation, and he can help me remember my memories and be a good friend to talk to."

Many participants (n = 40) believed that they could have a "heart to heart conversation" with our emphatic voice assistant, and it can "keep the conversation to itself unlike some blabbermouth friends" (F12). When our VA acknowledged the participant's distress in their responses to sensitive questions and reacted in kind with empathic verisimilitude, our VA can elicit self-disclosure of participants' personal experiences, thoughts, and feelings. Previous research indicated that there could be a positive link between the "effectiveness" and "affectiveness" of a conversational agent (Murray & Häubl, 2009).

4.2.2 Insights about non-emphatic voice assistant

Our non-emphatic voice assistant was perceived negatively and less human-like, and even some participants wanted to check if they were recording and reported feelings of eeriness. Drawing from our initial findings, VA's perceived enthusiasm for talking or caring can promote a perception of the voice assistant as human-like. Older adults need caring and empathy to be speak honestly during a conversation with computerized voices. VAs might contribute to leisure time experiences by maintaining a supportive context-based conversation encouraging older people to open up and build a digital companionship.

4.2.2.1 Voice assistant as an untrustworthy recording machine

Even though their suggestions were precisely the same, participants referred to our less emphatic voice assistant as "inhuman", "emotionless calculator", and "boring tool" and described its character as "formal", "distant", "bossy", "arrogant", and "rude". They indicated that some VA's remarks could be flippant and insensitive. Many participants (n = 28 ) expressed their discomfort, insecurity and hesitation while talking to our low emphatic voice assistant. M15 had reliability concerns due to the way of speaking of the voice assistant :

"It is good taking advice from some robot, but he had no charm and made me feel oppressed and insecure. I did not want to answer personal questions because he was like recording everything."

F9 complained about the lack of human touch and implied that a less emphatic voice assistant could be perceived as less human, less virtual and more sterile:

"I can not picture him virtually because I did not feel a human touch during our conversation. I felt disturbed because he responded, "I took my notes," and gave me some advice. I found his advice very interesting for a pilates app, but you cannot rely on it when you hear it from an old and tired sounding man. Can you arrange a younger version of it for me and funnier than this emotionless calculator?"

They got even irritated and indicated that they "felt stupid talking to this machine" (F10) due to his conversational style:

"If I share personal things, he has to be different, makes me laugh or impress me. I am not exchanging dollars. He has to be more understanding or sympathetic maybe. Why I bother talking about myself." (M20)

Participants felt insecure about our voice assistant with low emphatic expression. They "did not feel closed to answer personal questions" (M7) and "felt like the machine is recording his personal information"(F21), and "felt like being investigated" (F22). F5 indicated that she was eager to talk to him, but he did not want much. She found it "less cheerful than SIRI" and made her feel like he was talking to her "as a duty". Our less emphatic voice assistant was perceived as dominant, and participants felt like they were "being tested about their survival through Covid 19" (F21) and reported their discomfort. From our findings, we can presume that older people "as a generation" want to be flattered and convinced to open up during the conversation before building a trustworthy digital relationship, maybe more than other populations:

"It was no real conversation; he asked me questions and suggested an app for coping with stress during Covid. However, in a real conversation, we present ourselves. Since he is the one who initiates a conversation, he needs to be authentic and open up a conversation. He was no capable and went no further being an online bank assistant" (F23)

M9 asked the researcher to erase their conversation with a smart speaker based voice assistant. Our less emphatic voice assistant's conversational style made participants feel like they were conversing unnaturally.

4.2.3 Insights about the VA with Mature/Young Voice

They referred to young voices as "dynamic", "enthusiastic", and "soft", and mature voices as "deep", "slow", and "rumbling". Contrary to our expectations and hypothesis, most of our older adult participants did not identify themselves or did not feel closed with the mature voiced smart speaker based on VA. Even though they got offended and expressed their discomfort about the VA's voice after the conversation due to age-related negative stereotypes, it did not affect their overall experience. Participants' interactions with our voice assistant have demonstrated similar user expectations and beliefs that align with traditional age-related social cues that new technology is "a young tool". However, they showed enthusiasm to be in that community and got easily irritated if they sensed any age-related negative imply, as M12 reacted :

"I did not like his low vibe; I felt like I was talking with an old man, and it made me feel old because he was talking too slow. Did he do that because he thinks I am an old man and can not understand him?"

Our participants found our less emphatic voice assistant with a mature voice less likeable: "you brought a thing to play a teacher, I did not like him" (F18). They believed that a "young generation knows technology better than themselves so that they would rely on a young robot more" (M12). Participants found the VA with a young voice more "energetic" and "dynamic" and expressed more willingness to be friends with this VA:

"I like his way of talking and his tone of voice. He was a young and energetic man. I would not change that. He was invisible, but I could picture him as a young and wise man. It is more than just a tool." (F8)

"I liked his tone of voice, he was younger than me, but he sounded wise and experienced about Covid-19. It's normal that robots and similar technological tools sound younger than us. Because technology is younger than us. I expect a robot at least to be younger than me." (F13)"

"He is a young enthusiast; his level of energy motivated me. I would not say I like passive voices. His energy due to his youth uplifted my energy as well. A mature voice may be exhausting; I prefer this fellow with a young voice. My voice is quite mature enough" (F14)

Our participants mostly preferred VA with a younger voice since they believed they needed to accept and adopt the younger generation's norms to get involved with the digital world. They have related being young and digital in their mind, and they believed the idea that "a younger person has a fresh memory so does a robot" (F27):

"He sounded younger than me but experienced. It would reflect my generation's thoughts and emotions if it sounded like me. I would have nothing to improve myself. I prefer a younger voice because they can understand technology, are open to technology, and are more open-minded than us. I can not take my generation's technology seriously" (M21)

Most of the participants (n = 50) believed that they could learn new things from younger people, so surprisingly, our participants' "older is wiser" stereotype has been deactivated. They thought that "technological tools need to be young if they want to be reliable and natural" (F29). They did not identify with our mature VA and found it "boring" (M28) and "narrow-minded" (F29) and felt like they were talking with an old robot. Our findings showed that they associated wisdom more with the VA which has a younger voice, as M27 explained:

"It's logical to talk with younger people; if I am about to make new friends, it needs to be young. What may I learn from a peer?" Similarly, F28 continued :

"A young male robot cheered me up; old people always surround me. A young male would be a great company."

We had predicted that a mature voice might positively influence the activation of the ageing stereotypes in interaction and similarity attraction rate: Mature is wise and experienced. However, when we assessed mature and young voice perceptions, we found that mature voice was perceived as dominant and low vibe, and they did not identify with that voice.

Further, we found that not all participants desire the female voice trait for the voice assistant. Most male participants (n = 20) wanted to have the ability to change it to a young female voice because, according to their statements, the female voice is softer, naive and sounds understanding. Due to existing gender stereotypes that male participants still attributed women to specific characteristics, a female-gendered voice assistant fits the expectations and assumptions that male participants already have in their minds: a lower status as an assistant, a polite personality in a caring position. Therefore female participants (n = 10) preferred the young male voice since "they were bored hearing the female voice all day long" (F1) and "it would be nice to have a young male friend around like their son or a handsome doctor" (F12). The majority of females did not prefer any specific gender. Apparently, female or male-gendered VA with a young voice with an empathetic persona was found proper and less intimidating for a voice assistant in a context of a fragile conversation like the "Covid-19" pandemic for older adults in Turkey. Although the female-gendered voice is appealed more attractive to most male participants, designers and companies need to challenge these old stereotypes and use their market dominance not to enforce them.

To measure the effects of the different levels of empathy and voice age, we devised a 2×2 between-subject design. The mixed factors had two levels of empathy and voice age: High empathy, low empathy, mature voice, and young voice. Planned comparisons were conducted between each of the conditions. All participants were randomly assigned with treatments of varying levels of empathy and voice age. Each group had an average of 15 participants.

To check the reliability and validity of our measurement model, convergent validity and composite reliability tests were conducted. When measuring items, convergent validity is confirmed by using correlation coefficients. Factor loading values are considered acceptable if they are more significant than 0.7. Our Cronbach's alpha (α) scores were above 0.7; they were deemed acceptable. To confirm discriminant validity, the correlation coefficients and the square root of the average variance extracted (AVE) were found above 0.5; thus, discriminant validity is confirmed.

A two way ANOVA test was conducted to analyze the effects of level of empathy and voice age on perceived support and trust with CA. The impact of a mature voice on users’ perceived support and trust was not significant. These results are shown in Table 2. The results of Study 1 showed that CA’s mature voice (p < 0.05) did not have any significant effect on the perceived support and trust of the user rejecting our H1:

Participants will trust more and feel more supported by a VA with a mature voice.

Table 2: Two-way ANOVA Results of Factor 1 (mature x young voice )

Dependent Variable: Factor1
Source	Type III Sum of Squares	df	Mean Square	F	Sig.	Partial Eta Squared
Corrected Model	12,006^a	35	,343	2,750	,006	,800
Intercept	37,495	1	37,495	300,613	,000	,926
P_S	15,302	20	0,913	1,975	,019	,314
T	18,260	14	1,305	2,316	,041	,569
Error	2,994	24	,125
Total	150,000	60
Corrected Total	15,000	59
a. R Squared = ,800 (Adjusted R Squared = ,509)

The results of the two way ANOVA indicated that when the agent conversed with a mature voice, users rated the experience as less supportive and trustworthy. Older adults preferred speaking with a CA with a young voice. This means that similarity between CA and the user (sounding mature) is not a solid variable in predicting relationship building between a conversational agent and an older adult user. These results are shown in Table 2. The results showed that CA’s emphatic expression level (p > 0.05) significantly affects the user's perceived support and trust, confirming our H2.

To test H2 about the moderating effect of the emphatic expression level of the CA in perceived support and trust, we conducted a two way ANOVA analysis, confirming our H2.

Participants will perceive stronger support and trust with a VA with more empathic expression.

Table 3: Two-way ANOVA Results of Factor II (high emphatic expression x low emphatic expression)

Dependent Variable: Factor2
Source	Type III Sum of Squares	df	Mean Square	F	Sig.	Partial Eta Squared
Corrected Model	13,404^a	35	,383	5,759	,000	,894
Intercept	29,465	1	29,465	443,090	,000	,949
P_S	1,071	20	,054	,805	,686	,402
T	2,496	14	,178	1,681	,066	,610
Error	1,596	24	,066
Total	150,000	60
Corrected Total	15,000	59
a. R Squared = ,894 (Adjusted R Squared = ,738)

The CA’s high emphatic expression is a powerful determinant of perceived support and trust of the user. In other words, a high level of emphatic expression generated by trust and perceived support may also encourage the use of the voice assistant by the older adult users.

To test our hypothesis 3 and see if there is a difference in the level of self-efficacy of older adult users toward new technologies, we conducted t-tests which confirm our H3:

Participants’ perceived self-efficacy toward new technologies will not increase after the conversation.

Participants’ perceived self-efficacy toward new technologies will increase after the conversation.

“Assumption hypothesis: There is no relationship between the two variables.”

The correlation value takes a value between (-1 and + 1). According to the direction of the relationship, positive or negative relationships arise. When the correlation table was examined, we found no significant relationship between the two variables. The P-value significance level is greater than 0.05. The established hypothesis cannot be statistically rejected at the 95% confidence level. The t-test results for two dependent samples are given in Table 4 :

Table 4: T-Tests Results for Self-Efficacy After/Before

Constructs

Paired Mean

Differences Std. Deviation

Std. Error Mean

95% Confidence Interval of the Difference

Sig.

(2-tailed)

Lower

Upper

Perceived Self Efficacy Before -Perceived Self Efficacy-After

-1,45767

,90901

,11735

-1,69249

-1,22284

-12,421

,000

According to test results, participants’ perceived self-efficacy toward new technologies will significantly increase after their first interaction with the CA (p < 0.05).

Limitations and Future Research

In this research, we only used one scenario, Covid-19 quarantine, as our research context, where emphatic support is intrinsic to the topic of the conversation, which can be accepted as a significant limitation of this research. In this context, a voice assistant with emphatic expression is expected and may be desired for this isolated population because the similarity between the voice assistant and the user could be the critical aspect that moderates the communication process. However, for only information-seeking contexts or with a different age group, our voice assistant's reactions might be perceived as eerie. Furthermore, we tested only a male-voiced VA, which could be another limitation. Findings from this study should be interpreted in light of this limitation. We need to test different social cues with disadvantaged populations to build and advance user-friendly, genderless and more inclusive systems in future work.

Universal usability is an essential constituent of HCI, but new technologies are not usable by all populations. Due to the different cultural, age-related, ethnic, and linguistic backgrounds, it is unlikely that a generic CA would be universally acceptable and usable. Trusted and engaging social machines must have appropriate verbal communicational patterns according to their users' social, emotional, and environmental needs (Hirsch et al., 2000) and abilities (Shneiderman B. & Plaisant C., 2004). Although these CAs promise much through their designated humanness, they fail to be adaptive and customizable in communicating with different user populations. This deficit highlights the importance of improving the artificial agents' conversational abilities and investigating different user populations' attitudinal and behavioural outcomes as they interact with different social machines. Our work aims to contribute to filling this gap by identifying (1) how different vocal cues of CAs give rise to social reactions by the older adults; (2) how older adults build trust and feel supported and how they engage in user interaction with CAs (3) how older adults’ cultural beliefs, biases, stereotypes affect their interaction with CA’s.

We did a meta-analysis to compare the effect of different social cues on older adults. We found no evidence that the similarity attraction effect works on older adults when the similarity is mediated since a mature voice has no significant effect on perceived support, or trust qualitatively and quantitatively. Voice cue is not crucial as expected or as imagined in our study. We found that emphatic expression has the most considerable effect compared to others. The results revealed that voice assistant's conversational style had significant interaction effects on social and behavioural intent outcomes. Our voice assistant with emphatic expression leads to superior social outcomes (trust and perceived support) for older adults qualitatively and quantitatively

According to our findings, a smart speaker based voice assistant which could assist and advise older adults as "a trusted confidant" could be perceived as a powerful motivator for them to feel supported and bond in a trustworthy relationship due to their societal and cultural biases and beliefs "as a generation". We revealed that a voice assistant could be helpful and discharging for socially isolated older people who prevent themselves as a generation from talking and socializing for not being stigmatized as weak and grouchy. Furthermore, VA's perceived enthusiasm for talking or caring can promote a perception of the voice assistant as human-like. Unlike what we might have expected depending on our theoretical framework, older adults seem to break the mindset and counter the "old is wiser" stereotype since they clearly did not prefer their voice assistant to be sounded old, and they wanted to embrace new technologies when giving a chance.

On the other hand, we have found a significant effect on functional outcomes (self-efficacy of using the voice assistant) for participants in both conditions. They reported higher perceived self-efficacy after the conversation. Unlike other studies (Kim, 2021), participants did not perceive voice assistants as another gadget for ageing-related declines or were not affected by negative stereotypes in our research. Participants did not find a task-based (order-command) conversation appealing but enjoyed a context-based empathic conversation (Covid 19). We dedicated ourselves to writing empathetic and believable questions and statements that older adult participants (as mentioned in the “Momentary Test Design and Dialog Flow” Section) repeated regularly through preliminary research so that the level of empathy was perceived as sincere by our participants and helped us create convincible pseudo-humanity for older adults in Turkey.

Our qualitative findings detected a twofold nature of our VA perceived by our study participants: in some respects, it was human-like, but in others, it was a tool. It was perceived as more human when VA worried about their feelings and supported them. They opened up more when they believed that VA was a discreet social companion. On the contrary, it was perceived as a tool when it gave advice-only support, and they felt insecure. This work suggests that voice-based user interfaces hold tremendous potential, lowering technology use barriers due to their ease of use. We found that voice-based interfaces increased digital technologies' overall trustworthiness and confidence for older adults unfamiliar with new technologies. We added previous findings that using intelligent speaker-based personal assistants in a context-based- conversational nature encouraged older adults to feel safe and eager to talk. When voice-based smart speakers offer more than an information kiosk or automation control, older adults may quickly develop a close companionship and express willingness to adopt even after the first interaction.

With this study, we paved the ground for a better understanding of older adults' perception of conversational agents and how pandemic has shaped their sociability needs. Existing voice assistants only provide simple and constrained request-response structures; however, our participants perceive a voice assistant more positively when that assistant offers a realistic dialogue in a context to reduce their loneliness or fill particular social needs. We suggest that a voice assistant with empathic expression and emotional interaction can satisfy the older adults' longing for social connections.. Since our participants were worried about being judged or perceived as emotionally weak, our findings suggested they were more comfortable interacting with an agent than with a human being during the Covid-19 quarantine.

Ethical Procedure

• The research meets all applicable standards about the ethics of experimentation and research integrity.

• The paper has been submitted with full responsibility, following due ethical procedure, and there is no fraud, plagiarism, or concerns about human experimentation.

• Informed consent to participate in the study was obtained from participants

A Disclosure / Conflict Of Interest Statement

• None of the authors of this paper has a financial or personal relationship with other people or organizations that could inappropriately influence or bias the paper's content.

• It is vital to state that "No Competing interests are at stake and there is No Conflict of Interest" with other people or organizations that could inappropriately influence or bias the paper's content.

Funding Statement

• This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Author Contribution Statement

• Yeliz Yücel, as the corresponding and the first author, conducted the research, wrote the main manuscript text and prepared tables and figures. Kerem Rızvanoğlu, the second author, reviewed the manuscript.

Bailenson, J. N., Iyengar, S., Yee, N., & Collins, N. A. (2008). Facial Similarity between Voters and Candidates Causes Influence. Public Opinion Quarterly, 72(5), 935–961. https://doi.org/10.1093/poq/nfn064
Berkowsky, R. W., Rikard, R. V., & Cotten, S. R. (2015). Signing off: Predicting discontinued ICT usage among older adults in assisted and independent living: A survival analysis. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9194(August), 389–398. https://doi.org/10.1007/978-3-319-20913-5_36
Bickmore, T. W., & Picard, R. W. (2004). Towards caring machines. Conference on Human Factors in Computing Systems - Proceedings, 1489–1492. https://doi.org/10.1145/985921.986097
Bickmore, T. W., Utami, D., Matsuyama, R., & Paasche-Orlow, M. K. (2016). Improving Access to Online Health Information With Conversational Agents: A Randomized Controlled Experiment. Journal of Medical Internet Research, 18(1), e1. https://doi.org/10.2196/jmir.5239
Binark, M., & Kandemir, B. (n.d.). Information Seeking and Information Evaluation of Older Adults.
Blair, J., & Abdullah, S. (2019). Understanding the needs and challenges of using conversational agents for deaf older adults. Proceedings of the ACM Conference on Computer Supported Cooperative Work, CSCW, 161–165. https://doi.org/10.1145/3311957.3359487
Byrne, D. E. (1971). The attraction paradigm (Vol. 462). Academic press.
Cassell, J., & Bickmore, T. (2003). Negotiated collusion: Modeling social language and its relationship effects in intelligent agents. User Modelling and User-Adapted Interaction, 13(1–2), 89–132. https://doi.org/10.1023/A:1024026532471
Chang, R. C. S., Lu, H. P., & Yang, P. (2018). Stereotypes or golden rules? Exploring likable voice traits of social robots as active aging companions for tech-savvy baby boomers in Taiwan. Computers in Human Behavior, 84, 194–210. https://doi.org/10.1016/j.chb.2018.02.025
Chattaraman, V., Kwon, W. S., Gilbert, J. E., & Ross, K. (2019a). Should AI-Based, conversational digital assistants employ social- or task-oriented interaction style? A task-competency and reciprocity perspective for older adults. Computers in Human Behavior, 90. https://doi.org/10.1016/j.chb.2018.08.048
Chattaraman, V., Kwon, W. S., Gilbert, J. E., & Ross, K. (2019b). Should AI-Based, conversational digital assistants employ social- or task-oriented interaction style? A task-competency and reciprocity perspective for older adults. Computers in Human Behavior, 90(August 2018), 315–330. https://doi.org/10.1016/j.chb.2018.08.048
Czaja, S. J., Boot, W. R., Charness, N., & Rogers, W. A. (2019). Designing for older adults: Principles and creative human factors approaches, 3rd ed. In Designing for older adults: Principles and creative human factors approaches, 3rd ed. CRC Press/Routledge/Taylor & Francis Group. https://doi.org/10.1201/b22189
Dahlbäck, N., Jönsson, A., & Ahrenberg, L. (1993). Wizard of oz studies-why and how. International Conference on Intelligent User Interfaces, Proceedings IUI, Part F1275, 193–200.
de Gennaro, M., Krumhuber, E. G., & Lucas, G. (2020a). Effectiveness of an Empathic Chatbot in Combating Adverse Effects of Social Exclusion on Mood. Frontiers in Psychology, 10(January), 1–14. https://doi.org/10.3389/fpsyg.2019.03061
de Gennaro, M., Krumhuber, E. G., & Lucas, G. (2020b). Effectiveness of an Empathic Chatbot in Combating Adverse Effects of Social Exclusion on Mood. Frontiers in Psychology, 10. https://doi.org/10.3389/fpsyg.2019.03061
Dogruel, L., Joeckel, S., & Bowman, N. D. (2015). The use and acceptance of new media entertainment technology by elderly users: Development of an expanded technology acceptance model. Behaviour and Information Technology, 34(11), 1052–1063. https://doi.org/10.1080/0144929X.2015.1077890
Edwards, C., Edwards, A., Stoll, B., Lin, X., & Massey, N. (2019). Evaluations of an artificial intelligence instructor’s voice: Social Identity Theory in human-robot interactions. Computers in Human Behavior, 90(May 2018), 357–362. https://doi.org/10.1016/j.chb.2018.08.027
Edwards, C., & Harwood, J. (2003). Social identity in the classroom: An examination of age identification between students and instructors. Communication Education, 52(1), 60–65. https://doi.org/10.1080/03634520302463
Eyssel, F., Kuchenbrandt, D., Bobinger, S., De Ruiter, L., & Hegel, F. (2012). “If you sound like me, you must be more human”: On the interplay of robot and user features on human-robot acceptance and anthropomorphism. HRI’12 - Proceedings of the 7th Annual ACM/IEEE International Conference on Human-Robot Interaction, August 2014, 125–126. https://doi.org/10.1145/2157689.2157717
Feine, J., Gnewuch, U., Morana, S., & Maedche, A. (2019). A Taxonomy of Social Cues for Conversational Agents. International Journal of Human Computer Studies, 132. https://doi.org/10.1016/j.ijhcs.2019.07.009
Friemel, T. N. (2016). The digital divide has grown old: Determinants of a digital divide among seniors. New Media and Society, 18(2), 313–331. https://doi.org/10.1177/1461444814538648
Gatto, S. L., & Tak, S. H. (2008). Computer, Internet, and e-mail use among older adults: Benefits and barriers. Educational Gerontology, 34(9), 800–811. https://doi.org/10.1080/03601270802243697
Gray, K., & Wegner, D. M. (2012). Feeling robots and human zombies: Mind perception and the uncanny valley. Cognition, 125(1), 125–130. https://doi.org/10.1016/j.cognition.2012.06.007
Harwood, J., Giles, H., & Ryan, E. B. (1995). Aging, communication, and intergroup theory: Social identity and intergenerational communication. Handbook of Communication and Aging Research., January 1995, 133–159.
Heckman, C. E., & Wobbrock, J. O. (2000). Put your best face forward: Anthropomorphic agents, E-commerce consumers, and the law. Proceedings of the International Conference on Autonomous Agents, January, 435–442. https://doi.org/10.1145/336595.337562
Hirsch, T., Forlizzi, J., Hyder, E., Goetz, J., Stroback, J., & Kurtz, C. (2000). The ELDer Project : Social and Emotional Factors in the Design of Eldercare Technologies. January. https://doi.org/10.1145/355460.355476
Hu, C., Thomas, K. M., & Lance, C. E. (2008). Intentions to initiate mentoring relationships: Understanding the impact of race, proactivity, feelings of deprivation, and relationship roles. Journal of Social Psychology, 148(6), 727–744. https://doi.org/10.3200/SOCP.148.6.727-744
Kim, S. (2021). Exploring how older adults use a smart speaker-based voice assistant in their first interactions: Qualitative study. JMIR MHealth and UHealth, 9(1). https://doi.org/10.2196/20427
Klein. (2007). Internet-Based Patient-Physician Electronic Communication Applications: Patient Acceptance and Trust. E-Service Journal, 5(2), 27. https://doi.org/10.2979/esj.2007.5.2.27
Kowalski, J., Jaskulska, A., Skorupska, K., Abramczuk, K., Biele, C., Kopec, W., & Marasek, K. (2019). Older adults and voice interaction: A pilot study with google home. ArXiv, 1–6.
Large, D. R., Burnett, G., Harrington, K., Clark, L., Luton, J., Thomas, P., & Bennett, P. (2019). “It’s small talk, jim, but not as we know it.” Engendering trust through human-agent conversation in an autonomous, self-driving car. ACM International Conference Proceeding Series. https://doi.org/10.1145/3342775.3342789
Lee, E. J., Nass, C., & Brave, S. (2000). Can computer-generated speech have gender? An experimental test of gender stereotype. Conference on Human Factors in Computing Systems - Proceedings, January, 289–290. https://doi.org/10.1145/633292.633461
Lee, Y. C., Yamashita, N., & Huang, Y. (2020). Designing a Chatbot as a Mediator for Promoting Deep Self-Disclosure to a Real Mental Health Professional. Proceedings of the ACM on Human-Computer Interaction, 4(CSCW1). https://doi.org/10.1145/3392836
Lee, Y. C., Yamashita, N., Huang, Y., & Fu, W. (2020, April 21). “I Hear You, i Feel You”: Encouraging Deep Self-disclosure through a Chatbot. Conference on Human Factors in Computing Systems - Proceedings. https://doi.org/10.1145/3313831.3376175
Liu, B., & Sundar, S. S. (2018a). Should Machines Express Sympathy and Empathy? Experiments with a Health Advice Chatbot. Cyberpsychology, Behavior, and Social Networking, 21(10). https://doi.org/10.1089/cyber.2018.0110
Liu, B., & Sundar, S. S. (2018b). Should Machines Express Sympathy and Empathy? Experiments with a Health Advice Chatbot. Cyberpsychology, Behavior, and Social Networking, 21(10), 625–636. https://doi.org/10.1089/cyber.2018.0110
Liu, H., Russo, N. M., & Larson, C. R. (2010). Age-related differences in vocal responses to pitch feedback perturbations: A preliminary study. The Journal of the Acoustical Society of America, 127(2), 1042–1046. https://doi.org/10.1121/1.3273880
Looije, R., Neerincx, M. A., & Cnossen, F. (2010). Persuasive robotic assistant for health self-management of older adults: Design and evaluation of social behaviors. International Journal of Human Computer Studies, 68(6), 386–397. https://doi.org/10.1016/j.ijhcs.2009.08.007
Lopatovska, I., Rink, K., Knight, I., Raines, K., Cosenza, K., Williams, H., Sorsche, P., Hirsch, D., Li, Q., & Martinez, A. (2019). Talk to me: Exploring user interactions with the Amazon Alexa. Journal of Librarianship and Information Science, 51(4), 984–997. https://doi.org/10.1177/0961000618759414
Lopatovska, I., & Williams, H. (2018). Personification of the amazon alexa: BFF or a mindless companion? CHIIR 2018 - Proceedings of the 2018 Conference on Human Information Interaction and Retrieval, 2018-March, 265–268. https://doi.org/10.1145/3176349.3176868
Lucas, G. M., Boberg, J., Traum, D., Artstein, R., Gratch, J., Gainer, A., Johnson, E., Leuski, A., & Nakano, M. (2018). Getting to Know Each Other: The Role of Social Dialogue in Recovery from Errors in Social Robots. ACM/IEEE International Conference on Human-Robot Interaction, April 2019, 344–351. https://doi.org/10.1145/3171221.3171258
Medhi Thies, I., Menon, N., Magapu, S., Subramony, M., & O’Neill, J. (2017). How do you want your chatbot? An exploratory Wizard-of-Oz study with young, Urban Indians. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 10513 LNCS. https://doi.org/10.1007/978-3-319-67744-6_28
Miner, A. S., Laranjo, L., & Kocaballi, A. B. (2020). Chatbots in the fight against the COVID-19 pandemic. Npj Digital Medicine, 3(1), 1–4. https://doi.org/10.1038/s41746-020-0280-0
Mohammadi, G., Vinciarelli, A., & Mortillaro, M. (2010). The voice of personality: Mapping nonverbal vocal behavior into trait attributions. SSPW’10 - Proceedings of the 2010 ACM Social Signal Processing Workshop, Co-Located with ACM Multimedia 2010, January, 17–20. https://doi.org/10.1145/1878116.1878123
Montoya, R. M., Horton, R. S., & Kirchner, J. (2008). Is actual similarity necessary for attraction? A meta-analysis of actual and perceived similarity. Journal of Social and Personal Relationships, 25(6), 889–922. https://doi.org/10.1177/0265407508096700
Moon, Y. (2000). Intimate Exchanges: Using Computers to Elicit Self‐Disclosure From Consumers. Journal of Consumer Research, 26(4), 323–339. https://doi.org/10.1086/209566
Morris. (2013). Journal of Aging Science. Aging Sci, 1(1), 1–9. https://doi.org/10.4172/jasc.1000101
Morris, R. R., Kouddous, K., Kshirsagar, R., & Schueller, S. M. (2018). Towards an artificially empathic conversational agent for mental health applications: System design and user perceptions. Journal of Medical Internet Research, 20(6). https://doi.org/10.2196/10148
Murray, K. B., & Häubl, G. (2009). Personalization without Interrogation: Towards more Effective Interactions between Consumers and Feature-Based Recommendation Agents. Journal of Interactive Marketing, 23(2), 138–146. https://doi.org/https://doi.org/10.1016/j.intmar.2009.02.009
Nass, C., & Brave, S. (2007). Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship. The MIT Press.
Nass, C., & Lee, K. M. (2001). Does computer-ssynthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. Journal of Experimental Psychology: Applied, 7(3), 171–181. https://doi.org/10.1037/1076-898X.7.3.171
Nass, C., & Moon, Y. (2000). Machines and mindlessness: Social responses to computers. Journal of Social Issues, 56(1), 81–103. https://doi.org/10.1111/0022-4537.00153
Pickard, M. D., Roster, C. A., & Chen, Y. (2016). Revealing sensitive information in personal interviews: Is self-disclosure easier with humans or avatars and under what conditions? Computers in Human Behavior, 65. https://doi.org/10.1016/j.chb.2016.08.004
Pradhan, A., Findlater, L., & Lazar, A. (2019). “Phantom friend” or “just a box with information”: personification and ontological categorization of smart speaker-based voice assistants by older adults. Proceedings of the ACM on Human-Computer Interaction, 3(CSCW). https://doi.org/10.1145/3359316
Pradhan, A., Lazar, A., & Findlater, L. (2020). Use of intelligent voice assistants by older adults with low technology use. ACM Transactions on Computer-Human Interaction, 27(4). https://doi.org/10.1145/3373759
Purington, A., Taft, J. G., Sannon, S., Bazarova, N. N., & Taylor, S. H. (2017). “Alexa is my new BFF”: Social roles, user satisfaction, and personification of the Amazon Echo. Conference on Human Factors in Computing Systems - Proceedings, Part F1276. https://doi.org/10.1145/3027063.3053246
Rhee, C. E., & Choi, J. (2020). Effects of personalization and social role in voice shopping: An experimental study on product recommendation by a conversational voice agent. Computers in Human Behavior, 109. https://doi.org/10.1016/j.chb.2020.106359
Shneiderman B. & Plaisant C. (2004). Designing the User Interface: Strategies for Effective Human-Computer Interaction. Pearson Addison Wesley.
Sidner, C. L., Kidd, C. D., Lee, C., & Lesh, N. (2004). Where to look. May 2014, 78. https://doi.org/10.1145/964442.964458
Tsai, M. J., & Tsai, C. C. (2003). Information searching strategies in Web-based science learning: The role of Internet self-efficacy. Innovations in Education and Teaching International, 40(1), 43–50. https://doi.org/10.1080/1355800032000038822
Turk, V. (2016). Home invasion. New Scientist, 232(3104), 16–17. https://doi.org/https://doi.org/10.1016/S0262-4079(16)32318-1
van der Zwaan, J. M., Geraerts, E., Dignum, V., & Jonker, C. M. (2012). User validation of an empathic virtual buddy against cyberbullying. Annual Review of CyberTherapy and Telemedicine, 10(September), 243–247. https://doi.org/10.3233/978-1-61499-121-2-243
Vaportzis, E., Clausen, M. G., & Gow, A. J. (2017). Older adults perceptions of technology and barriers to interacting with tablet computers: A focus group study. Frontiers in Psychology, 8(OCT), 1–11. https://doi.org/10.3389/fpsyg.2017.01687
Von Der Pütten, A. M., Krämer, N. C., Gratch, J., & Kang, S. H. (2010). “It doesn’t matter what you are!” Explaining social effects of agents and avatars. Computers in Human Behavior, 26(6), 1641–1650. https://doi.org/10.1016/j.chb.2010.06.012
Yamashita, N., & Huang, Y. U. N. (2020). Let ’ s Work Together : Integrating Human Support with Conversational Agents. 1(1), 1–13.
Yeong Tan, D. T., & Singh, R. (1995). Attitudes and Attraction: A Developmental Study of the Similarity-Attraction and Dissimilarity-Repulsion Hypotheses. Personality and Social Psychology Bulletin, 21(9), 975–986. https://doi.org/10.1177/0146167295219011

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Machines Sounding like Older People: Investigating how older adults perceive an emphatic voice assistant in their first interactions

Status:

Version 1

Abstract

Figures

1. Introduction

2. Theoretical Framework

3. Methodology

3.1 Research Procedure and the Experimental Set-up with Wizard of Oz

3.2 Momentary Test Design and the Dialog Flow

3.3 Participants

4. Results & Discussion

4.1 Qualitative Results

4.1.1 Pre-test Results

4.1.2 Momentary Test results

4.1.3 Post-test results

4.1.3.1 General Insights

4.1.3.2 Voice Assistant's Ontological Categorization: More than a human

4.1.3.3 Power Struggle with the VA: Beat it and feel safe

4.2 Emphatic/Non-emphatic version

4.2.1 Insights about Emphatic Voice Assistant

4.2.1.2 Voice Assistant as a feel-good quarantine counsellor

4.2.1.3 Voice Assistant as an invisible friend with no judgement and no gossip

4.2.2 Insights about non-emphatic voice assistant

4.2.2.1 Voice assistant as an untrustworthy recording machine

4.2.3 Insights about the VA with Mature/Young Voice

5. Quantitative Results

6. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1