Skip to main content

Effects of communication style, anthropomorphic setting and individual differences on older adults using voice assistants in a health context

Abstract

Background

Voice assistants enable older adults to communicate regarding their health as well as facilitate ageing in place. This study investigated the effects of communication style, anthropomorphic setting, and individual differences on the trust, acceptance, and mental workload of older adults using a voice assistant when communicating health issues.

Methods

This is a mixed-methods study utilising both quantitative and qualitative methods. One hundred and six older adults (M = 71.8 years, SD = 4.6 years) participated in a 2 (communication style: social- vs. task-oriented; between-subject)\(\times\) 2 (anthropomorphic setting: ordinary profession vs. medical background; within-subject) mixed design experiment. The study used multivariate analysis of variance (MANOVA) to examine the effects of communication style, anthropomorphic setting of the voice assistant, and participants’ use frequency of digital devices on the trust, technology acceptance, and mental workload of older adults using a voice assistant in a health context. End-of-study interviews regarding voice assistant use were conducted with participants. Qualitative content analyses were used to assess the interview findings about the communication content, the more trustworthy anthropomorphic setting, and suggestions for the voice assistant.

Results

Communication style, anthropomorphic setting, and individual differences all had statistically significant effects on older adults’ evaluations of the voice assistant. Compared with a task-oriented voice assistant, older adults preferred a social-oriented voice assistant in terms of trust in ability, integrity, and technology acceptance. Older adults also had better evaluations for a voice assistant with a medical background in terms of trust in ability, integrity, technology acceptance, and mental workload. In addition, older adults with more experience using digital products provided more positive evaluations in terms of trust in ability, integrity, and technology acceptance.

Conclusions

This study suggests that when designing a voice assistant for older adults in the health context, using a social-oriented communication style and providing an anthropomorphic setting in which the voice assistant has a medical background are effective ways to improve the trust and acceptance of older adults of voice assistants in an internet-of-things environment.

Peer Review reports

Introduction

Off-the-shelf voice assistants (for example, Amazon Alexa and Google Home) provide a cost-effective means for older adults to communicate health information at home and thus facilitate ageing in place. In the internet-of-things (IoT) era, IoT voice assistants have become a gateway for users to connect with smart spaces [1]. People can converse with their IoT smart devices, such as smart lights or smart TVs, as with a friend via an IoT voice assistant in a smart home environment [1]. An IoT voice assistant could act as a health assistant for older adults, such as by providing updated dietary health information [2], asking health-related questions [3], reducing loneliness [4], and encouraging fitness exercise [5, 6], all of which could reduce pressure on the health care system during the coronavirus disease-2019 (COVID-19) pandemic [7]. In addition, an IoT voice assistant benefits older adults because interaction with voice assistants is more natural for such adults than interaction with smartphones or computers [2].

The challenges in designing a voice assistant for older adults involve the problems of having trust in the voice assistant, technology acceptance and mental workload. Older adults may abandon using the voice assistant as they may not consider it trustworthy, specifically in sensitive contexts (for example, health care) [8]. According to Mayer et al., the three factors that determine trustworthiness are the ability, integrity, and benevolence of the trustee [9]. “Ability” is defined as a “group of skills, competencies, and characteristics that enable a party to have influence within some specific domain”. “Integrity” means “the trustee adheres to a set of principles that the trustor finds acceptable”. “Benevolence” is “the extent to which a trustee is believed to want to do good to the trustor, aside from an egocentric profit motive” [9]. Technology acceptance is also important for older adults because the current generation of older adults who did not grow up using this type of technology have more difficulty understanding technological functions and instructions than younger people [10]. Mental workload is also of interest since it can reflect older adults’ frustration level in interacting with a voice assistant through subjective self-assessment [11]. “Mental workload” is defined as the degree to which tasks require limited physical and psychological resources of human beings [12]. Trust, technology acceptance, and mental workload are frequently used as indicators of user experience in human–computer studies, while higher trust, technology and lower mental workload indicate better performance with respect to user experience [13,14,15].

The objective of this study is to understand older adults’ requirements and preferences regarding voice assistants in a health-related context. The study examines the voice assistant as a tool to provide health information and guide older adults in following a healthy lifestyle. We designed an experiment to investigate the effects of anthropomorphic setting (professional background vs. ordinary occupation), voice assistant communication style (social- vs. task-oriented), and individual differences between older adults in their trust, acceptance, and mental workload while interacting with a health-related IoT voice assistant.

Related research and hypothesis

Effect of communication style

The communication style of a voice assistant may affect user evaluation. People are more likely to trust artificial agents with a happy tone and more familiar language styles [16, 17]. Studies have examined the effect of voice assistant communication style in various scenarios. Bickmore found that engaging in small talk with a voice assistant can be effective in settings such as customer service, health care, and weather information, but it may not contribute much in more serious settings, such as financial transactions or military training [18]. In an online shopping task, a social-oriented voice assistant could enhance trust at a website for older adults with high internet competency, but for those with low internet competency, a task-oriented voice assistant provided more significant cooperative and functional advantages [14].

In actual physician–patient communication, physicians seem to use a more task-oriented than social-oriented communication style. Yoo et al. found that text messaging between adolescents with asthma and clinicians was focused more on task-focused behaviours (345 times in 314 text messages) over socioemotional behaviours (237 times in 314 text messages) [19]. Chinese medical personnel seldom use affective descriptors but, rather, focus on technical tasks [20]. Bradley et al. found that the consultative style (i.e., doctor and patient as a collaborative team) brings higher satisfaction to patients than the authoritative style (i.e., doctor as an expert; more task-oriented) [21]. If a voice assistant is to behave as a health agent for older adults, it could be designed to communicate similarly to a real doctor. In this study, our focus is on examining whether the communication style (social-oriented vs. task-oriented) of voice assistants affects older adults’ evaluation of voice assistants in the health care context. Thus, we propose the following hypothesis:

H1: The communication style of a voice assistant (social- vs. task-oriented) has a significant effect on the trust, acceptance, and mental workload of older adults in the health-related context.

Effect of anthropomorphic setting

Previous studies have examined the effect of anthropomorphic setting on trust in robots. People consider voice assistants with female voices to be friendlier [22] and are more inclined to use voice assistants with a familiar appearance and mannerisms similar to those of humans [23]. Mo et al. found that older adults perceived social robots with in-family attributes (i.e., where the robot is viewed as a member of the older adult’s family) as more social and trustworthy than those without family attributes (i.e., where the robot is not a member of the older adult’s family) [24]. Zhao and Rau found that young people preferred a voice assistant that combined both work and life functions rather than one that handled either item separately [13]. Biermann et al. found that robots with a technical appearance with only the necessary mechanical structures were trusted more than anthropomorphic robots or robots in the care context [25].

In the field of health, people may place special trust in “experts”. A study found that consumers chose expert labels as the most- or second-most trustworthy among food product labels [26]. In China, doctors are regarded as experts and rarely challenged by patients [27]. In this study, we use real-life experience as a metaphor to design voice assistants as doctors, and it can be assumed that the “professional background” of the voice assistant may affect the user experience evaluation of the voice assistant. Thus, we propose the following hypothesis:

H2: The anthropomorphic setting of a voice assistant (professional background vs. ordinary occupation) has a significant effect on the trust, acceptance, and mental workload of older adults in the health-related context.

Effect of individual differences

Older adults’ attitudes and behaviours towards voice assistants may also be affected by individual differences, such as age, income, living situation and prior experience with technology. Attitudes towards voice assistants may change as adults grow older and their needs change [28]. Low-income older adults who use voice assistants like the ability to by notified regarding harmful events, such as changes in health indicators [29]. The lonelier that an older adult is, the more likely he or she is to perceive a voice assistant as human, and as the older adults increasingly anthropormorphise the voice assistant, their loneliness decreases [4]. Ghorayeb et al. found that among older adults who have already tried smart home monitoring technologies, acceptance increases over time and with use [30]. Such adults expressed fewer concerns than inexperienced smart home participants regarding privacy, trust, and usability and more concerns regarding utility [30]. Therefore, investigating the effect of participants’ individual differences (e.g., age, gender, living situation, prior technology use) on the user experience of health-related voice assistants is of interest to us. Thus, we propose the following:

H3: Individual differences among older adults (for example, age, gender, living situation, and prior technology use) have a significant effect on trust, acceptance, and mental workload in the health-related context.

Materials and methods

Experimental design

A 2 (communication style: social- vs. task-oriented; between-subject)\(\times\) 2 (anthropomorphic setting: ordinary profession vs. medical background; within-subject) mixed design was adopted. Demographic variables (i.e., age, gender, living situation, use experience of voice assistant, and frequency of using digital products) of the older adults were set as covariates. The sample size required to achieve a statistical power of 0.80 should not be less than 98 (effect size f = 0.25, \(\alpha\) = 0.05, 1-\(\beta\) = 0.8, number of groups = 2, number of measurements = 2, Corr among rep measures = 0.5), as calculated using G*Power 3.1 [31].

Regarding anthropomorphic setting, the voice assistant wore an introduction card (introducing the voice assistant’s work address, title and occupation). This study used cartoon characters for the voice assistants so that the participants could sense their occupational differences directly through the different uniforms. By reviewing the relevant literature [22, 32], we found that users think that female voices are amiable and gentle. To control for the gender variables of the voice assistant in the experiment, female characters were selected. Chinese voice assistant devices generally use Xiao + words as the wakeup words. For example, the wakeup word for the voice assistant produced by Baidu is XiaoDu, while that for the voice assistant produced by Xiaomi is Xiaoai. In addition, in terms of the "name" itself, the "Xiao + word" name is the most popular among users [33]. Because Li is a common Chinese surname, the wakeup word for the voice assistant was set as Xiao Li in the experiment. We used the name Xiao Li for the voice assistant with an ordinary profession and the name Doctor Li for the voice assistant with a medical background. The anthropomorphic settings of the voice assistants are presented in Fig. 1.

Fig. 1
figure 1

Anthropomorphic settings for voice assistants (a) Xiao Li and (b) Doctor Li

The task-oriented communication style of the voice assistant was formal. Its interaction with the user was limited to providing instructions and other information to help the user complete a task. There was no social conversation beyond the initial greeting of the initial wakeup call. In contrast, the social-oriented communication style of the voice assistant was conversational. In addition to providing necessary guidance and information, the voice assistants maintained informal conversations through small talk, questions, exclamatory feedback, and encouragement [4]. We obtained common questions regarding relevant scenes from older adults in the community and then searched the web for relevant questions to obtain answers. First, we extracted task-oriented answers according to the requirements and then transformed the task-oriented answers into social-oriented ones by adding small talk, caring statements, and mood words. Seven participants (who did not participate in the formal experiment) participated in the pre-experiment, which verified that the experiment could be completed smoothly and the script content and experiment process correctly understood. An example of a conversation is presented in Table 1.

Table 1 Design of the voice assistant interaction (Task Script Example: Diet)

Apparatus

We converted the script to Portable Document Format (PDF) and then implemented the reading function using the native language package of Microsoft Edge (Microsoft Xiaoxiao Online (Natural)). The recording sounds like the voice of a young woman. Subsequently, we used recording software to record all the scripts.

The “Wizard of Oz” method was applied. The participants interacted with a computer that they believed to be autonomous but that was actually operated by a person [34]. This method has been extensively used in human–computer interaction studies [13, 35]. The “Wizard of Oz” method was used instead of interacting with an off-the-shelf voice assistant for the following reasons. First, the accuracy of the off-the-shelf voice assistant is insufficient, and there may be “search bubbles” during the interaction [36]. Second, we set the communication style as a between-subject variable and designed the dialogue scripts accordingly. Thus, it was more convenient to control the voice assistant using “Wizard of Oz”.

During the experiment, we stored the recording on a computer and played the voice assistant’s voice through a Huawei mini–Bluetooth speaker (Model CM510, Huawei Technologies Co., China) when interacting with older adults. The experimental setup is shown in Fig. 2.

Fig. 2
figure 2

Experimental setup

Participants

One hundred and six older adults (61–84 years old) were recruited between January and February 2021 through word of mouth from communities in Panzhihua City (urban), Sichuan Province, China. In 2021, the city's domestic product (GDP) was 113.395 billion yuan (1 yuan = 0.147 US dollars), which ranked 65th among China's 333 mainland prefecture-level cities [37]. The per capita GDP of Panzhihua City is 93,500 yuan, while the per capita GDP of China is 80,976 yuan [38]. The inclusion criteria were age ≥ 60 years, living independently in the community, able to communicate in Mandarin Chinese or Sichuanese and not having visual and hearing impairments. There was no professional psychological screening. However, at the beginning of the experiment, the experimenter communicated with the participants to confirm that the participants had relatively healthy and normal psychological cognitive function. As noted, an exclusion criterion was the inability to communicate verbally in Mandarin Chinese or Sichuanese. We recruited a heterogeneous group of older adults of different ages, living situations, and technology use experience. The study’s ethics were approved by the university of the authors.

Tasks and procedure

The experimental procedure lasted 25 min for each participant. First, the participants signed consent forms and completed a questionnaire eliciting demographic information, including age, gender, living situation, voice assistant use experience, and weekly use of digital products. Next, the experimenter showed the participant how to use the voice assistant. Subsequently, each participant performed three tasks that required interacting with the voice assistant. Each task lasted approximately two min, and all participants were required to read the script shown in Table 1 (Diet), Appendix Table 1 (Exercise) and A2 (Medical consultation). The participants asked the voice assistant the same set of questions. The three tasks were as follows:

Task 1: Diet

The older adults were instructed to ask the voice assistant questions related to food, including food combinations, therapeutic effect, and the context in which food is consumed. Finally, the older adults provided corresponding responses based on the answers provided by the voice assistant.

Task 2: Exercise

The older adults asked the voice assistant questions related to their fitness. Subsequently, they asked the voice assistant how to perform a chest expansion exercise. Thereafter, they followed the instructions of the voice assistant and performed the chest expansion exercise according to the provided rhythm.

Task 3: Medical consultation

The older adults asked the voice assistant questions regarding cold symptoms, suggestions for medical treatment after the illness, and preventive care. Finally, the older adults furnished corresponding responses based on the answers provided by the voice assistant.

To reduce the influence of order, the appearance of Xiao Li and Doctor Li was set to occur randomly. Subsequently, the participants completed a questionnaire on trust, acceptance, and mental workload. Finally, a short interview of the participants was conducted, comprising the following questions: (1) Do you have any other questions for Xiao Li/Doctor Li? (2) Whom do you trust more, Xiao Li or Doctor Li? Why? (3) Do you have any suggestions or comments on the voice assistant? The experimental procedure is shown in Fig. 3.

Fig. 3
figure 3

Experimental procedure

Mengyao Ma was the experimenter and interviewer (23-year-old female), and Runting Zhong trained the interviewer. As there was only one experimenter, we can ensure the consistency of the experiment. Prior to the experiment, the team discussed and categorised the responses provided by the older adults. During the experiment, the experimenter (Mengyao Ma) recorded the interview data using videos and notes. After the experiment, the interview data were first independently coded and classified into categories by the two research team members (Runting Zhong and Mengyao Ma). Then, the team members met and compared their coding differences. In the event of disagreement, the two team members discussed their reasoning and reached a consensus. Finally, the team counted the frequency and percentages of the participants’ opinions for each category using Microsoft Excel. The open-ended responses were analysed using magnitude coding, a process that quantifies the answers and highlights the most frequent comments [39, 40].

Measures

All measurement items are provided in the Appendix Table 3. Trust was assessed using an 8-item trust scale with two subscales to address the trust of users in the ability and integrity of the voice assistant [14]. In addition, technology acceptance was assessed using a 5-point Likert scale (1 = totally disagree, 5 = totally agree) and included perceived ease of use, perceived usefulness, ease of learning, and intention to use [41,42,43,44]. Mental workload was assessed with the National Aeronautics and Space Administration – Task Load Index (NASA-TLX). The index has six major components (mental, physical and temporal demands, frustration, effort and performance) [45]. We used a 5-point Likert scale to measure each of the six components. An example question of the NASA-TLX questionnaire is as follows: “It takes brain power to complete the task”. Cronbach’s \(\alpha\) for each dimension ranged from 0.547 to 0.879. For an exploratory study, a Cronbach’s \(\alpha\) of over 0.6 is considered acceptable [46].

Statistical analysis

Multivariate analysis of variance (MANOVA) was conducted to examine the effects of communication style and anthropomorphic setting. The within-subject variable was anthropomorphic setting (Xiao Li vs. Doctor Li), while the between-subject variable was communication style (social- vs. task-oriented). In addition, the covariates were five demographic variables (age, gender, living situation, use experience of voice assistant, and frequency of using digital products), which were included in the first MANOVA model. Among the demographic variables, frequency of using digital products significantly impacted the dependent variables. Therefore, we added frequency as an independent variable in a subsequent test. A p value < 0.05 was considered to indicate a significant difference. The effect sizes were calculated using the partial \({\eta }^{2}\). A partial \({\eta }^{2}\) of 0.01 was considered small, 0.09 medium, and 0.25 large [43, 47]. Statistical analysis was conducted using IBM SPSS version 26.0 (IBM, Armonk, NY, USA). Furthermore, the interview data regarding the recommendations by the participants were analysed using content analysis.

Results

Descriptive analysis

One hundred and six older Chinese adults (mean age = 71.8 years, SD = 4.6 years) participated in the study. The demographic information of the participants is presented in Table 2. There were no significant differences in demographic information (gender, age, living situation, use experience of voice assistant, frequency of using digital products) between the social-oriented and task-oriented groups (ps > 0.05). Fifteen participants (14.2%) could not read the script. Therefore, the researcher read the script for these participants, who repeated what they heard. None of the participants had previously used a smart speaker. However, 22 participants (20.8%) had used similar products, such as voice assistants for smartphones, automobiles, or televisions. In the subsequent analysis, we defined the weekly frequency of using digital devices as follows: “never use” was “low”; “1–7 times per week” was “medium”; “every day” was “high”.

Table 2 Demographic information of the participants (N = 106)

Trust in ability

As presented in Table 3, a significant interaction effect between communication style and frequency was observed for trust in ability: F (2, 100) = 3.349, p = 0.039, \({\eta }^{2}\) = 0.063. Therefore, a simple main effect analysis is applied. For the social-oriented communication style, frequency had a significant effect on trust in ability: F (2, 100) = 4.638, p = 0.012, \({\eta }^{2}\) = 0.085. In addition, a Bonferroni post hoc test revealed that participants who used digital products daily had significantly higher trust in ability than those who used them 1–7 times per week (p = 0.019). Similarly, for the task-oriented communication style, frequency had a significant effect on trust in ability: F (2, 100) = 24.872, p < 0.001, \({\eta }^{2}\) = 0.332. Significant differences between frequency groups were observed (low vs. medium, p < 0.001; low vs. high, p < 0.001; medium vs. high, p = 0.011).

Table 3 Trust in ability

Trust in integrity

As presented in Table 4, a significant interaction effect between communication style and frequency was observed for trust in integrity: F (2, 100) = 12.49, p < 0.001, \({\eta }^{2}\) = 0.200. Therefore, a simple main effect analysis is applied. For the social-oriented communication style, frequency had no significant effect on trust in integrity: F (2, 100) = 0.656, p = 0.521, \({\eta }^{2}\) = 0.013. In contrast, for the task-oriented communication style, frequency had a significant effect on trust in integrity: F (2, 100) = 31.033, p < 0.001, \({\eta }^{2}\) = 0.383. Significant differences between the frequency groups of older adults were observed (low < medium < high; ps < 0.001), indicating that participants with a higher frequency of using digital devices had higher trust in the integrity of the health-related voice assistant.

Table 4 Trust in integrity

Technology acceptance

As presented in Table 5, we observed a significant interaction effect between communication style and frequency for trust in technology acceptance: F (2, 100) = 5.842, p = 0.004, \({\eta }^{2}\) = 0.105. For the social-oriented communication style, frequency had a significant effect on acceptance: F (2, 100) = 5.240, p = 0.007, \({\eta }^{2}\) = 0.095. The post hoc analysis revealed that participants who used digital products daily had significantly higher trust in the ability of the voice assistant than those who used them 1–7 times per week (p = 0.005). In addition, for the task-oriented communication style, frequency had a significant effect on acceptance: F (2, 100) = 23.579, p < 0.001, \({\eta }^{2}\) = 0.320. Significant differences between the groups were observed (low vs. medium, p < 0.001; low vs. high, p < 0.001; medium vs. high, p = 0.026).

Table 5 Technology acceptance

Furthermore, there was a significant interaction effect between anthropomorphic setting and frequency: F (2, 100) = 6.86, p = 0.002, \({\eta }^{2}\) = 0.121. For the anthropomorphic setting Xiao Li, frequency had a significant effect on acceptance: F (2, 100) = 23.277, p < 0.001, \({\eta }^{2}\) = 0.318. Post hoc analysis revealed significant differences between groups (low vs. medium, p = 0.011; low vs. high, p < 0.001; medium vs. high, p < 0.001). Moreover, for the anthropomorphic setting Doctor Li, frequency had a significant effect on acceptance: F (2, 100) = 23.579, p < 0.001, \({\eta }^{2}\) = 0.168. Participants who used digital products every day had significantly higher acceptance than those who never used digital products (p = 0.009) or used digital products 1–7 times per week (p < 0.001).

Mental workload

As presented in Table 6, no significant interaction effects were observed between variables. The anthropomorphic setting had a significant effect on mental workload: F(1, 100) = 11.575, p = 0.001, \({\eta }^{2}\) = 0.104. Participants interacting with Xiao Li had a significantly higher mental workload than those interacting with Doctor Li.

Table 6 Mental workload

Interview analysis

Questions for Xiao Li/Doctor Li

Thirty-eight older adults participated in the interviews. Interviewees were randomly selected by the study team based on the participant's interests. Regarding questions for Xiao Li/Doctor Li, the participants mainly disclosed their physical conditions and diseases to obtain information on drugs, treatments, and nursing from the voice assistant. Participants mentioned that they wanted to ask Xiao Li/Doctor Li additional questions about their health conditions, such as hypertension. An example quote was, “What medicine should I take for hypertension? What shall I do?” The most frequently mentioned diseases were hypertension (7 participants), colds (5), gastrointestinal disease (4), eye disease (4), lumbar disease (3), leg trouble (2), postoperative care (2), insomnia (2), and COVID-19 (1). Regarding the illness-related information they provided, the participants mostly offered full details, and they even described certain illnesses in detail, including how they were contracted.

More trustworthy anthropomorphic setting

Eight participants mentioned that the voice assistant was very useful, and six considered Doctor Li more trustworthy than Xiao Li. One participant stated, “Doctors must be professional, and I certainly trust them more.” Another participant stated, “I think Doctor Li is more professional than Xiao Li. After all, she graduated from a medical university and has experience.” However, two participants believed that there was no difference between the two voice assistants because their answers were relatively fixed.

Suggestions for voice assistant developers

Four participants hoped that voice assistant developers would pay more attention to health care in the future: “We are old now with pain all over the body, and I hope the developers of this [voice assistant] will pay more attention to health care.” Three participants suggested that the voice assistant should be combined with real hospitals and doctors: “It would be more convincing if a voice assistant can simulate face-to-face consultation with a doctor, tones, and words.”

Participants provided suggestions regarding technological issues. Three participants desired their voice assistants to be more portable and able to be used without the internet. In addition, three participants noted the problem that the current range of use was not sufficiently wide: “It would be great if the voice assistant could answer questions about the instructions for digital equipment, how to maintain electrical appliances, and how to operate digital equipment needed in travel.” Considering that the memory of older adults declines rapidly, two participants suggested that the voice assistant could print the answers on paper to enable forgetful people to review the answers. Furthermore, one participant suggested a demonstration function: “When encouraging an old man to exercise, a character could be simulated to exercise with the older adults to increase interest in the activity.”

Discussion

General discussion

Social-oriented communication style is preferred in the health context

Regarding the effect of communication style, the participating older adults evaluated the social-oriented voice assistant significantly higher in trust in ability, integrity, and acceptance. Therefore, H1 was partly supported. The results of the current study differ from the previous studies [14, 48]. A possible reason is that the previous studies were conducted in the context of shopping, while this study was conducted in a health-related context. A social-oriented communication style may help older adults experience the impression of caring and empathy, thus gaining a more positive evaluation. In addition, a social-oriented communication style uses small talk, questions, feedback, and encouragement, which may result in the impression that the voice assistant is providing more information, such as showing a friendly attitude or offering a positive response to older adults.

There are differences between interacting with a voice assistant and interacting with real doctors. In the IoT environment, Chinese users wish to be dominant over things and to establish feelings of superiority and authority, which would indicate the user’s absolute control over things. The user is the “boss”, and the things are “obedient and smart subordinates” that provide the user support [49]. However, in real patient-doctor communication, the responses of patients to decision-making and information sharing are distinctively doctor-centred [50]; i.e., the doctor is the dominant individual in the communicative exchange. This phenomenon has an important implication for designing elderly friendly IoT health-related services. That is, when designing an IoT health-related service for older Chinese, the conversation design should differ from that of real patient-doctor communication, and a social-oriented communication style should therefore be preferentially considered.

Voice assistants with medical backgrounds are preferred in a health context

Regarding the effect of anthropomorphic setting, the analysed older adults evaluated the voice assistant with a medical background (i.e., Doctor Li) more positively. That is, they had higher trust in Doctor Li’s ability and integrity as well as higher technology acceptance and a lower mental workload with Doctor Li, although the communication content was consistent with that used for the other assistant. Therefore, H2 was supported. According to [51], people more easily accept robots that are in line with their stereotypes regarding such devices. This finding indicates that a stereotype of real-life experiences may also be generalised to an anthropomorphic setting in a smart environment. Older adults would trust an “expert” more in the smart environment in terms of communicating regarding health-related issues. The anthropomorphic setting of the “doctor” caused the participating older adults to feel that this voice assistant was more professional. These results confirm that the use of stereotypes that correspond to a specific context is an effective way to improve the trust of older adults in voice assistants.

This study found that the participating older adults had a higher level of trust in health-related voice assistants. The participants were willing to disclose detailed personal information, such as the type of disease they were suffering from. However, considered from another perspective, overtrust may also raise privacy concerns. Overtrust means that individuals blindly follow the voice assistant or take risks because they believe the voice assistant can perform functions that it does not have [52]. Therefore, it is necessary to include privacy protection mechanisms in the design of products and services for older adults, such as reminders regarding them how to answer, appropriate behaviour, and avoiding risky behaviour.

Older adults with a higher frequency of using digital products evaluated voice assistants more positively

This study also found that older adults with a higher frequency of using digital products evaluated the voice assistants more positively. That is, they had higher trust in the ability and integrity of voice assistants and exhibited higher technology acceptance. Age, gender and living situation did not significantly influence user evaluation. Therefore, H3 was partly supported. This result is in line with Ghorayeb et al. [30]. For most older adults, a voice assistant is a new IoT technology. Therefore, a lack of prior experience in using digital products may cause older adults to be doubtful about the performance of voice assistants. Thus, for older adults, previous experience using digital devices is beneficial for the adoption of voice assistants in the first interaction. To bridge the digital divide between older adults and youth, it is necessary for older adults to take the first step in using a voice assistant. In practice, sharing experiences with peers and children could serve as an important way to enhance the understanding of and trust in voice assistants of older adults.

Implications

This study investigated the attitudes of Chinese older adults towards voice assistants in a health-related context. Specifically, we investigated the effect of communication style and anthropomorphic setting on the trust, acceptance, and mental workload of older adults. The study participants were older Chinese adults who were using voice assistants (smart speakers) for the first time. We found that the participants had a relatively high level of trust in and acceptance of the voice assistant in a health-related context. In addition, the participants held more positive attitudes towards a social-oriented communication style and a voice assistant with a medical background. Furthermore, those participants with a higher frequency of digital product use had a more positive attitude towards the voice assistant. Therefore, when designing a voice assistant for older adults in the health context, it is necessary to create an assistant that requires older adults to interact with it. Adopting a more social-oriented communication style (for example, providing guidance, using small talk, asking questions, and furnishing feedback and encouragement) and providing an anthropomorphic setting for the voice assistant in which the assistant has a medical background (for example, that of a doctor, nurse, or fitness coach) could help older adults trust and accept the voice assistant more. Furthermore, functions such as connecting with real hospitals and doctors, portability, offline accessibility, operation guidance, printing, and a demonstration function are also requirements for older adults. The results of this study reveal the preferences of older adults for voice assistants in an intelligent environment and provide suggestions for improving the user experience of older adults utilizing voice assistants.

Limitations and future studies

Our study has several limitations. First, it only included community-dwelling older adults from a city in China, and the dialogue script was localised according to traditional medical beliefs in terms of diet, daily practices, and medication use [53,54,55]. Therefore, generalisation of the results requires caution. Future studies can examine whether older adults prefer a voice assistant that acts like a traditional medicine doctor rather than a modern medical doctor.

Second, the study used the “Wizard of Oz” method to simulate communication between older adults. The responses of the participating older adults may differ from those in a realistic setting. Future studies should validate the conclusions of this study in an actual smart environment.

Third, we investigated the effect of communication style and anthropomorphic setting in a health care context. Future studies could examine whether the effect of communication style and anthropomorphic setting could be generalised to other use contexts, such as educational or daily life contexts.

Conclusions

This study investigated the effect of the communication style (socially-oriented vs. task-oriented) and anthropomorphic setting (ordinary profession vs. medical background) of a voice assistant on the trust (ability and integrity), acceptance, and mental workload of older adults using such an assistant. The results indicated that older adults preferred a social-oriented voice assistant and a voice assistant with a medical background. In addition, older adults with more digital product experience evaluated voice assistance more positively. Furthermore, the analysed older adults identified functions such as connecting with real hospitals and doctors, portability, offline accessibility, operation guidance, printing, and demonstration functions as requirements. The results suggest that when designing a voice assistant for older adults, using a social-oriented communication style and providing an anthropomorphic setting in which the voice assistant has a medical background are effective ways to improve the trust and acceptance of voice assistants. Our results have implications for designers seeking to improve the acceptability and trust of older adults and support ageing in place. 

Availability of data and materials

The materials and data to support the study findings are available from the first author [RZ] upon request.

Abbreviations

IoT:

Internet of things

MANOVA:

Multivariate analysis of variance

NASA-TLX:

National Aeronautics and Space Administration-Task Load Index

PDF:

Portable Document Format

GDP:

Gross domestic product

M:

Mean

SD:

Standard deviation

SE:

Standard error

CI:

Confidence interval

VA:

Voice assistant

P:

Participant

References

  1. Li Z, Rau PLP. Talking with an IoT-CA: Effects of the Use of Internet of Things Conversational Agents on Face-to-Face Conversations. Interact Comput. 2021;33:238–49.

    Article  Google Scholar 

  2. Pradhan A, Lazar A, Findlater L. Use of Intelligent Voice Assistants by Older Adults with Low Technology Use. ACM Transactions on Computer-Human Interaction. 2020;27:1–27.

  3. Kim S. Exploring how older adults use a smart speaker-based voice assistant in their first interactions: qualitative study. JMIR Mhealth Uhealth. 2021;9:e20427.

    Article  Google Scholar 

  4. Jones VK, Hanus M, Yan C, Shade MY, Blaskewicz Boron J, Maschieri BR. Reducing Loneliness Among Aging Adults: The Roles of Personal Voice Assistants and Anthropomorphic Interactions. Front Public Health. 2021;9:750736.

    Article  Google Scholar 

  5. Luo Y, Lee B, Choe EK. TandemTrack: Shaping Consistent Exercise Experience by Complementing a Mobile App with a Smart Speaker. In: CHI ’20: CHI Conference on Human Factors in Computing Systems. 2020.

  6. Jansons P, Fyfe J, Via JD, Daly RM, Gvozdenko E, Scott D. Barriers and enablers for older adults participating in a home-based pragmatic exercise program delivered and monitored by Amazon Alexa: a qualitative study. BMC Geriatr. 2022;22:1–10.

    Article  Google Scholar 

  7. Chen K. Use of gerontechnology to assist older adults to cope with the covid-19 pandemic. J Am Med Dir Assoc. 2020;21:983–1003.

    Article  Google Scholar 

  8. Rheu M, Ji YS, Wei P, Huh-Yoo J. Systematic review: trust-building factors and implications for conversational agent design. International Journal of Human-Computer Interaction. 2021;37:81–96.

    Article  Google Scholar 

  9. Mayer RC, Davis JH, Schoorman FD. An integrative model of organizational trust. Acad Manag Rev. 1995;20:709–34.

    Article  Google Scholar 

  10. Koon LM, McGlynn SA, Blocker KA, Rogers WA. Perceptions of digital assistants from early adopters aged 55+. Ergonomics in Design. 2020;28:16–23.

    Article  Google Scholar 

  11. Dehais F, Lafont A, Roy R, Fairclough S. Neuroergonomics Approach to Mental Workload, Engagement and Human Performance. Frontiers in Neuroscience. 2020;14:268.

    Article  Google Scholar 

  12. Wickens CD. Multiple resources and mental workload. Hum Factors. 2008;50:449–55.

    Article  Google Scholar 

  13. Zhao J, Rau P-LP. Merging and synchronizing corporate and personal voice agents: Comparison of voice agents acting as a secretary and a housekeeper. Computers in Human Behavior. 2020;108:106334.

    Article  Google Scholar 

  14. Chattaraman V, Kwon WS, Gilbert JE, Ross K. Should AI-Based, conversational digital assistants employ social- or task-oriented interaction style? A task-competency and reciprocity perspective for older adults. Comput Hum Behav. 2019;90:315–30.

    Article  Google Scholar 

  15. Sun N, Rau PLP, Zhong R. Understanding patients’ trust toward community healthcare providers under different primary care settings in China: Impact of hospital ownership and registration mode. Human Factors and Ergonomics in Manufacturing & Service Industries. 2019.

  16. Torre I, Goslin J, White L. If your device could smile: People trust happy-sounding artificial agents more. Comput Hum Behav. 2020;105:106215.

    Article  Google Scholar 

  17. Kim Y, Kwak SS, Kim M. Am I acceptable to you? Effect of a robot’s verbal language forms on people’s social distance from robots. Comput Hum Behav. 2013;29:1091–101.

    Article  Google Scholar 

  18. Bickmore T, Cassell J. Social Dialongue with Embodied Conversational Agents. USA: Northeastern University; 2005.

    Book  Google Scholar 

  19. Yoo W, Kim SY, Hong Y, Chih MY, Shah DV, Gustafson DH. Patient-clinician mobile communication: analyzing text messaging between adolescents with asthma and nurse case managers. Telemedicine journal and e-health : the official journal of the American Telemedicine Association. 2015;21:62.

    Article  Google Scholar 

  20. Rau P-LP, Guo Z, Zhong R, Kim S. Chinese Pain Descriptors Used by Medical Personnel: A Case Study in Beijing. In: Rau P-LP, editor. Cross-Cultural Design. Applications in Health, Learning, Communication, and Creativity. Cham: Springer International Publishing; 2020. p. 176–90.

  21. Bradley G, Sparks B, Nesdale D. Doctor communication style and patient outcomes: gender and age as moderators. J Appl Soc Psychol. 2001;31:1749–73.

    Article  Google Scholar 

  22. Mitchell WJ, Ho C-C, Patel H, MacDorman KF. Does social desirability bias favor humans? Explicit–implicit evaluations of synthesized speech support a new HCI model of impression management. Comput Hum Behav. 2011;27:402–12.

    Article  Google Scholar 

  23. Li D, Rau PLP, Li Y. A Cross-cultural Study: effect of robot appearance and task. Int J Soc Robot. 2010;2:175–86.

    Article  Google Scholar 

  24. Mo F, Zhou J, Yi S. How to Enhance Intergenerational Communication? The Influence of Family Orientation and Generation When Using Social Robots as an Intermediary. In: International Conference on Human Aspects of IT for the Aged Population. Springer, Cham; 2017. p. 348–59.

  25. Biermann H, Brauner P, Ziefle M. How context and design shape human-robot trust and attributions. Journal of Behavioral Robotics. 2021;12:74–86.

    Article  Google Scholar 

  26. Rupprecht CDD, Fujiyoshi L, McGreevy SR, Tayasu I. Trust me? Consumer trust in expert information on food product labels. Food Chem Toxicol. 2020;137:111170.

    Article  CAS  Google Scholar 

  27. Wang W, Lau Y, Loo A, Chow A, Thompson DR. Medication adherence and its associated factors among Chinese community-dwelling older adults with hypertension. Heart Lung. 2014;43:278–83.

    Article  Google Scholar 

  28. Heerink M, Kröse B, Evers V, Wielinga B. Assessing acceptance of assistive social agent technology by older adults: the almere model. Int J Soc Robot. 2010;2:361–75.

    Article  Google Scholar 

  29. Chung J, Bleich M, Wheeler DC, Winship JM, McDowell B, Baker D, et al. Attitudes and perceptions toward voice-operated smart speakers among low-income senior housing residents: comparison of pre- and post-installation surveys. Gerontology and Geriatric Medicine. 2021;7:23337214211005868.

    Article  Google Scholar 

  30. Ghorayeb A, Comber R, Gooberman-Hill R. Older adults’ perspectives of smart home technology: Are we developing the technology that older people want? - ScienceDirect. International Journal of Human-Computer Studies. 2020;147.

  31. Faul F, Erdfelder E, Lang AG, Buchner A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav Res Methods. 2007;39:175–91.

    Article  Google Scholar 

  32. Louwerse MM, Graesser AC, McNamara DS, Lu S. Embodied Conversational Agents as Conversational Partners. Appl Cogn Psychol. 2009;23:1244–55.

    Article  Google Scholar 

  33. Baidu AI Interaction Design Lab. What name will wake you up with? My smart little friend. https://mp.weixin.qq.com/s/SuXKyhqFD4x5Yikbn9OEDg. Accessed 1 Aug 2022.

  34. Hanington B, Martin B. Universal Methods of Design: 100 Ways to Research Complex Problems, Develop Innovative Ideas, and Design Effective Solutions. 2012.

  35. Li Z, Rau P-LP. Effects of Self-Disclosure on Attributions in Human-IoT Conversational Agent Interaction. Interacting with Computers. 2019;31:13–26.

  36. Kowalski J, Jaskulska A, Skorupska K, Abramczuk K, Biele C, Kopeć W, et al. Older Adults and Voice Interaction: A Pilot Study with Google Home. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems. New York, NY, USA: Association for Computing Machinery; 2019. p. 1–6.

  37. In 2021, the per capita GDP of Panzhihua City in Sichuan ranked 65th in the mainland cities, with a serious decline! https://baijiahao.baidu.com/s?id=1726731479292618075&wfr=spider&for=pc. Accessed 18 Aug 2022.

  38. National Bureau of Statistics. National data. https://data.stats.gov.cn/easyquery.htm?cn=C01. Accessed 18 Aug 2022.

  39. Portz JD, Vehovec A, Dolansky MA, Levin JB, Boxer R. The development and acceptability of a mobile application for tracking symptoms of heart failure among older adults. Telemed J E Health. 2018;24:161–5.

    Article  Google Scholar 

  40. Zhong R, Rau P-LP. A mobile phone–based gait assessment app for the elderly: development and evaluation. JMIR Mhealth Uhealth. 2020;8:e14453.

  41. Davis FD, Bagozzi RP, Warshaw PR. User acceptance of computer technology: a comparison of two theoretical models. Manage Sci. 1989;35:982–1003.

    Article  Google Scholar 

  42. Renaud K, Van Biljon J. Predicting technology acceptance and adoption by the elderly : a qualitative study. In: Research Conference of the South African Institute of Computer Scientists & Information Technologists on It Research in Developing Countries: Riding the Wave of Technology. 2008. p. 210–9.

  43. Venkatesh V, Morris MG, Davis GB, Davis FD. User acceptance of information technology: toward a unified view. MIS Q. 2003;27:425–78.

    Article  Google Scholar 

  44. Wang L. Variables contributing to older adults’ acceptance of information technology in China, Korea and USA. Doctoral Dissertation. Tsinghua University; 2010.

  45. Hart SG, Staveland LE. Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research. North-Holland: Elsevier Science Publisher B. V; 1988.

    Google Scholar 

  46. Nunnally J, Bernstein I. Psychometric Theory. New York: McGraw-Hill Publishing Company; 1994.

    Google Scholar 

  47. Cohen J, Cohen P, West SG, Aiken LS. Applied multiple regression/correlation analysis for the behavioral sciences. Third Edition. Routledge; 2013.

  48. Whang C, Im H. “I Like Your Suggestion!” the role of humanlikeness and parasocial relationship on the website versus voice shopper’s perception of recommendations. Psychol Mark. 2021;38:581–95.

    Article  Google Scholar 

  49. Rau P-LP, Huang E, Mao M, Gao Q, Feng C, Zhang Y. Exploring interactive style and user experience design for social web of things of Chinese users: A case study in Beijing. A case study in Beijing International Journal of Human-Computer Studies. 2015;80:24–35.

    Article  Google Scholar 

  50. Ting X, Yong B, Yin L, Mi T. Patient perception and the barriers to practicing patient-centered communication: A survey and in-depth interview of Chinese patients and physicians. Patient Educ Couns. 2016;99:364–9.

    Article  Google Scholar 

  51. Tay B, Jung Y, Park T. When stereotypes meet robots: The double-edge sword of robot gender and personality in human–robot interaction. Comput Hum Behav. 2014;38:75–84.

    Article  Google Scholar 

  52. Chen J. The overtrust in human-robot interaction. Adv Psychol. 2020;10:1842–6.

    Article  Google Scholar 

  53. Kennedy BM, Rehman M, Johnson WD, Magee MB, Leonard R, Katzmarzyk PT. Healthcare providers versus patients’ understanding of health beliefs and values. Patient Experience Journal. 2017;4:29–37.

    Article  Google Scholar 

  54. Sun N, Rau PLP, Li Y, Owen T, Thimbleby H. Design and evaluation of a mobile phone-based health intervention for patients with hypertensive condition. Comput Hum Behav. 2016;63:98–105.

    Article  Google Scholar 

  55. Zhong R, Rau PLP, Li S. Measuring Chinese Medical Beliefs of Chinese Adults. Toronto: Springer, Cham; 2016. p. 729–36.

Download references

Acknowledgements

We would also like to thank all the participants who volunteered in this study.

Funding

This study is supported by the National Natural Science Foundation of China 72101100, and the Philosophy and Social Science Program of Jiangsu Province 2020SJA0861.

Author information

Authors and Affiliations

Authors

Contributions

Both authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by [RZ] and [MM]. The first draft of the manuscript was written by [RZ]. Both authors revised the manuscript. Both authors read and approved the final manuscript.

Corresponding author

Correspondence to Runting Zhong.

Ethics declarations

Ethics approval and consent to participate

Ethical approval was approved by the ethics committee of Department of Social Sciences, Jiangnan University. All methods were carried out in accordance with the Declaration of Helsinki. Written informed consent was obtained from each participant before the experiment. All participants’ information was kept confidential.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table A1.

Design of the voice assistant interaction (Task Script: Exercise) Table A2. Design of the voice assistant interaction (Task Script: Medical consultation) Table A3. Questionnaires used in this study

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhong, R., Ma, M. Effects of communication style, anthropomorphic setting and individual differences on older adults using voice assistants in a health context. BMC Geriatr 22, 751 (2022). https://doi.org/10.1186/s12877-022-03428-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12877-022-03428-2

Keywords