On the Influence of Multi-cue User Interfaces in Eliminating Cognitive Load of Repetitive Tasks in Augmented Reality

doi:10.21203/rs.3.rs-4778962/v1

Download PDF

Research Article

On the Influence of Multi-cue User Interfaces in Eliminating Cognitive Load of Repetitive Tasks in Augmented Reality

https://doi.org/10.21203/rs.3.rs-4778962/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Multi-cue user interfaces (MCUIs) are a typical set of graphical user interfaces that are commonly used in augmented reality because of its simultaneous encoding of multiple visual cues into interaction tasks. Previous investigations have proven the advantageous impacts of MCUIs on usability for one-time tasks. With the increasing prevalence of MCUIs in industrial environments, it is imperative to investigate the influence of MCUIs on users’ cognitive load for repetitive tasks. We replicated the repetitive processes of sorting parcels in modern warehouses and conducted a series of empirical investigations to assess the influence of MCUIs on cognitive load and perceived usability. The results showed the effectiveness of MCUIs in reducing cognitive load and physical fatigue and revealed the correlations between the participants’ task efficiency and the quantity of visual cues. In addition, the study unveiled the positive impact of MCUIs on the overall perceived usability, although the specific aspects -ease of use and overall satisfaction- are not affected.

multi-cue user interface

cognitive load

augmented reality

repetitive task

usability study

Multi-cue user interfaces (MCUIs), prevalent in augmented reality, encode a variety of visual cues to convey information pertinent to ongoing tasks and are spatially integrated within the augmented environment. Augmented reality, which melds virtual and physical elements by overlaying computer-generated graphics onto real-world objects (Poupyrev et al. 2002), facilitates the concurrent display of MCUIs and geographic information. The adaptability of MCUIs in presenting task-relevant data enhances user perception and task performance. For instance, the MCUIs support task execution by effectively associating and presenting interaction-related information (Eswaran et al. 2023). Consequently, MCUIs have been widely applied in industrial scenarios, such as Boeing’s airplane assembly (Al-Ameen et al. 2015) and parcel sorting in contemporary warehouses (Hou et al. 2013, Merenda et al, 2018, Radkowski et al. 2015), with their efficacy in diverse contexts being substantiated (Xie et al. 2022).

Nonetheless, MCUIs come with side effects, such as information overwhelming (Karelaia 2006) and perception distraction (Gil et al. 2018). The presentation of multiple cues can increase cognitive load, necessitating users to locate and interpret all relevant cues within their field of view (Moon and Ryu 2021). Additionally, the dispersed arrangement of cues in augmented spaces can complicate spatial navigation during information retrieval tasks (Steffen et al. 2019). While prior research has highlighted these challenges and their impact on cognitive performance in short and one-off tasks (Gibson 2017, Pusch and Lécuyer 2011, Zhou and Jia 2023), the effect of MCUIs on cognitive load for repetitive tasks remains underexplored.

Given that many industrial tasks, such as parcel sorting in warehouses, are repetitive and involve operations performed at regular intervals (Mathiassen et al. 2003), understanding the influence of MCUIs on cognitive load is crucial. Over time, these tasks can lead to physical fatigue and a gradual decline in cognitive functions like learnability and memory (Serafini and Sanz 2016). Although previous studies have developed tools to mitigate these effects, such as instant indication interfaces (Kim et al. 2020), they have not adequately addressed the specific impact of MCUIs on cognitive load and related dimensions, including work efficiency and overall satisfaction (Hou et al. 2013, Radkowski et al. 2015). Therefore, with the growing adoption of MCUIs in industry, it is imperative to investigate their influence on cognitive load for repetitive tasks.

To bridge this gap, our study introduces the MCUI prototype tailored for repetitive tasks, replicating warehouse parcel sorting processes. We conducted empirical research to assess the MCUIs' specific impact on cognitive load and related factors like perceived usability. The study's contributions are two-fold: it elucidates the effect of MCUIs on cognitive load during repetitive tasks using robust eye-tracking data and examines the perceived usability of MCUIs in an industrial setting, providing actionable insights for MCUI design in augmented reality.

The paper is structured as follows: section 2 reviews literature on MCUIs, augmented reality, repetitive tasks, and perceived usability. Section 3 details the methodology, including experimental specifications and MCUI functionalities. Section 4 analyses the data and presents the findings. Section 5 concludes with the study's main insights and implications for future research.

2.1 Augmented reality and spatially distributed user interface

Augmented reality (AR) superimposes computer generated graphics onto the real world (Brannon Barhorst et al. 2021), facilitating interaction with both virtual and physical artifacts (Maiti et al. 2018). AR's immersive experience is delivered through various devices, including head-mounted displays and smartphones (Spittle et al. 2023). Given the capability of simultaneous interactions with virtual and physical objects, AR is now widely adopted in critical applications, such as military task (Mao and Chen 2021), automotive tasks (Pratticò et al. 2021), medical tasks (Venkatesan et al. 2021), and construction tasks (Behzadan et al. 2015).

Due to the nature of virtual-physical hybrid, AR is spontaneously compatible with user interfaces that have multiple visual clues distributed in the space. In the early mentioned AR applications, there is a common feature of displaying multiple visual cues that are relevant with the current tasks. Given the large number of these objects (Devagiri et al. 2022), there is sufficient space, positions, and objects to display the visual cues.

AR aids in repetitive tasks, traditionally characterized by fixed operational processes over extended periods (Cirulis and Ginters 2013, Stoltz et al. 2017). For instance, Ocampo and Tavakoli (2019) compared AR user interfaces of gaming task with virtual reality interfaces and demonstrated that AR interfaces were more capable of reducing cognitive load. AR can prompt visual cues for target parcels and related operations. Boeing's use of Microsoft Hololens to display assembly instructions alongside component information exemplifies this application (Blaga et al. 2021). Head-mounted AR devices facilitate hands-free interaction and allow users to view various cues during repetitive tasks (Papadopoulos et al. 2021).

AR is particularly beneficial for repetitive industrial tasks. The industrial sector increasingly employs AR for tasks like electronic engineering and warehouse parcel sorting (Wang et al. 2020). Alves et al. (2021) developed an AR-based quality control system to enhance efficiency for repetitive tasks in industrial production. Comparative studies have shown AR's superiority for repetitive tasks over other technologies. Loch et al. (2016) compared the differences between AR applications and a video-assisted system in a manual repetitive assembly process in terms of performance, user acceptance, and mental workload of users, which exhibited that AR was more helpful to users in repetitive tasks.

Integrating multiple visual clues in AR is essential as users require comprehensive information to navigate around the augmented space and interact with the virtual and physical hybrid objects (Heemsbergen et al. 2021). Effectiveness in using multiple visual clues for repetitive industrial tasks has been demonstrated in warehouse operations (Egger and Masood 2020), maintenance and assembly (Henderson and Feiner 2011). Previous research has examined the performance of workers in the process of parcel sorting within modern warehouses. Yan et al. (2022) investigated the potential enhancement of navigation to the target parcel by the utilisation of various visual cues displayed on an AR headset. A study conducted by Murauer et al. (2018) found AR feedback to improve learning efficiency and reduce cognitive burden of new workers more effectively than text input. AR has been found to be more effective than other forms of media in terms of lowering cognitive load and boosting task performance (Buchner et al. 2022).

2.2 Multi-cue user interface

Multi-cue user interfaces (MCUIs) refer to the user interfaces that encode multiple visual cues to facilitate interactive operations (Silva et al. 2015) (example see Fig.1). The MCUIs have been commonly used in conventional desktop and mobile systems with effectiveness of information encoding and comprehension (McNab and Hess 2009, Bertoni et al. 2013). They provide simultaneous points of contact between applications and users (Fani et al. 2022), enhancing efficiency of information processing.

MCUIs are utilized across virtual reality, wearable device interaction, and body gestural interaction. Wearable MCUI systems, for instance, deliver stimuli for human-machine collaboration work in industrial settings (Fani et al. 2022). In virtual e-commerce, the use of MCUIs can enhance the enjoyment and engagement of online purchasing and consequently, increasing consumers’ desire to make purchase decisions (Li et al. 2023). Paralinguistic Digital Affordance (PDA) systems utilise MCUIs to mitigate the caustic and contradicting characteristics found in single-cue interfaces (Sumner et al. 2020). When building natural interfaces that utilise dynamic gestures, it is essential to establish multi-cue semantics between trajectories and commands in the realm of human-computer interaction (Golash and Jain 2021). This is further supported by the study on attention-based design and user decision-making, which indicates that display mode and position in an interactive interface impact the user's ability to selectively attend to information (Amin et al. 2021).

In AR, previous studies primarily focused on the positive effects of MCUIs, particularly on the aspects of realism and presence. Fani et al. (2018) developed a fabric-based wearable display for multi-cue delivery that increases realism by adding additional tactile cues. Jin et al. (2022) found that MCUIs had positive effects on user presence and engagement in HMD-based AR. MCUIs also benefit remote collaboration in immersive technologies such as VR, AR, and MR, increasing social presence and task performance while reducing information overload (Kim et al. 2020).

However, the impact of MCUIs on cognitive load and task performance requires further investigation. Research have demonstrated that regular attention resources are limited (Lindsay 2020). Additionally, the utilisation of AR in MCUIs has been shown to effectively attracts user attention (Gong et al. 2022). Nevertheless, several studies showed that cognitive overload can occur with excessive information in learning tasks (Wu et al. 2018, Buchner et al. 2022). Given these findings, the research community has yet to reach a consensus on MCUIs' impact on cognitive load in repetitive AR tasks, and further investigations are needed.

2.3 Cognitive load of MCUIs

Cognitive load refers to the mental resources a person has available for solving problems or completing tasks at a given time (Oviatt 2006). It consists of three main types: intrinsic cognitive load, which is incurred by the intrinsic nature of tasks being processed, such as the difficulty of task and therefore is inherent to the task, extrinsic cognitive load, which is usually caused by external factors such as mood, work organisation, time pressure, and environmental noise, and it is variable per the task requirements, and associated cognitive load, which includes load placement on working memory during schema formation and automation (Sweller et al. 1998). These loads are cumulative, with intrinsic and extraneous loads being performance-based and germane load being learning-based (Schnotz and Ku¨rschner 2007).

Previous studies on cognitive load emphasised that the bottleneck of cognitive load lied on the limited attention and working memory capacity for information processing (Broadbent et al. 2023, Otermans et al. 2022). Like other influence factors such as work process, task disruption, time pressure, and working stress, cognitive load is an essential factor that determines the workers’ well-being in working environments (Nilsen and Kongsvik 2023). High level cognitive load results in psychological overload and reduction of working performance throughout longitudinal tasks (Wei et al. 2022), and high difficulty of memory tasks posed a negative influence on cognitive functions and consequently their learning and work performance (Wang et al. 2021). High cognitive load can also negatively affect assembly task performance, resulting in reduced employee productivity (Biondi et al. 2021). These studies showed the strong relationships between the level of intrinsic cognitive load and cognitive resources (Skulmowski and Xu 2022), as well as the strong influence on employees’ task performances (Malan 2019).

The level of intrinsic cognitive load was heavily influenced by the number of elements that needed to be processed at one time, and the interactivity of these elements had an extraordinary impact (Kalyuga 2009, Sweller and Chandler 1994). Presenting too much visual information may overload the learners' cognitive abilities, impairing the information selection and organization process (Albus et al. 2021). While a single cue is often ambiguous during interactions and this highlights necessity of multiple cues (Severijnen et al. 2023). Previous studies have shown that using visual, movement (Urakami and Seaborn 2023), gesture (Bai et al. 2020), and speech cues (Chen et al. 2021) provided by machines or humans is effective in identifying objects than using a single cue. Similar findings reported that multiple cues were practically helpful for the users to communicate with robots (Hetherington et al. 2021), in terms of understanding interaction intentions and interpreting interaction feedback (Kassem et al. 2022).

In AR, MCUIs have multiple visual cues spontaneously displayed in the hybrid space. The presence of multiple visual cues in the user interface representation increased the level of complexity, hence impacting the inherent cognitive load (Buchner et al. 2022). The complex graphical interface of the MCUIs also increased the requirements for working memory and task comprehension difficulty (Shen et al. 2020), thus negatively affecting the associated cognitive load (Wang et al. 2023). However, as the users are already proficient with the interface, the memory burden and task comprehension difficulty for repetitive tasks subsequently decreased (Pan et al. 2022), potentially decreasing cognitive load.

2.4 Perceived usability of MCUIs

Previous studies showed that the MCUIs provided users rich information about task structures and decision policies, thus leveraging overall task performance (Amin et al. 2021). The MCUIs has a positive effect on operations involving complex information selection (Zhou and Jia 2023). In particular, the MCUIs demonstrated a positive effect on complex tasks, in which there were multiple cues associated with others and only some were visualised.

According to the cue-summation theory (Per- sike and Meinhardt 2008), the MCUIs facilitates users’ learning by assisting them to quickly locate target information, and displaying more visual clues at a time enhances the users’ learning gains as well as supplying rich information to memorise (Blanco et al. 2010). For example, the colour and location information were simultaneously displayed as an auxiliary cue of the customer account management application (McNab and Hess 2009). MCUIs were also used as an effective decision assistant tool that enhanced the users’ decision capabilities by assessing contextual features (Gl¨ockner 2008). In a feedback-based learning strategy development experiment, more participants chose to adopt a multi-cue strategy (Tilton-Bolowsky et al. 2021). Compared with the single cue user interfaces that responded fast with concise information in the design of multi-sensory tactile devices, the MCUIs demonstrated richer interaction experience in overall (Dunkelberger et al. 2018).

Among systems using AR technology, a past study had shown that perceived usability is the strongest influencing factor for users' behavioural intentions when using AR systems, followed by ease of use (Papakostas et al. 2021). The study by Kim et al. showed that in AR simulated warehouse work involving order picking and part assembly, user interface design has a greater impact on usability than the type of head-mounted display (Kim et al. 2019).

2.5 Lessons learned and hypothesis development

The preceding review draws a picture of recent studies in AR, MCUIs and its characteristics such as cognitive load, physical feature, and perceived usability. It accentuates the significance and necessity of MCUIs in the industrial contexts, and it also exposes the insufficient current understanding of MCUIs. Given the increasing applications of MCUIs, the review highlights the imperatives for investigating whether and how the MCUIs influence the users’ cognitive load throughout repetitive tasks in AR. According to the understanding of literature review, we propose the following hypotheses.

H1: The multi-cue user interfaces can reduce physical fatigue during repetitive tasks in AR.

H2: The multi-cue user interfaces can reduce cognitive load for repetitive tasks in AR.

H3: The multi-cue user interfaces can improve perceived usability throughout repetitive AR task operations.

The primary aim of this study is to investigate the impact of MCUIs on cognitive load with participants engaged in repetitive tasks in AR. To achieve the goal, we developed the prototypes of MCUIs tailored for parcel sorting activities, which are commonly encountered in warehouses. We conducted empirical research with 40 volunteers recruited openly from the local university. These participants executed a specialised set of parcel sorting tasks while we employed both qualitative and quantitative methods, including eye-tracking and NASA-TLX questionnaires, to evaluate cognitive load and perceived usability.

3.1 Ethical note

The study was approved by the university’s ethic committee, and all procedures were in accordance with the APA guidelines. Participants have provided informed consent forms, and their personal data were anonymized in compliance with privacy requests.

3.2 Design MCUIs for repetitive tasks in AR

In alignment with the requirements for repetitive tasks in AR, we adopted the parcel sorting task as the main task of the MCUIs study. Parcel sorting is a frequent task in modern warehouses. It involves multiple processes such as scanning and relocating postal parcels at varying sizes and shapes, which need to be repetitively performed during a long period. The parcel sorting tasks can embed multiple visual cues in the AR space, as the MCUIs were integrated mainly in the processes of parcel scanning and relocating and displayed via an AR headset.

The MCUIs were designed with two main components: multi-cue indicators and navigation indicators. The multi-cue indicators were implemented for the scenario of parcels picking (Fig.2), which were displayed to indicate presence of multiple target parcels. These indicators matched the exact shapes of target parcels and used bright green colours for the sake of visual clarity during long repetitive tasks. The navigation indicators were designed in two styles: the navigation squares and navigation arrows (Fig.3). Specifically, the navigation squares had the identical layout of the shelf slots, and a highlighted green square was to indicate the target slot. The navigation arrows included four semi-transparent arrows surrounding a circle, and the circle was displayed in front of the target slot (Fig.3 b). Both the multi-cue indicators and navigation indicators utilised simple geometric shapes such as squares and arrows, which ensured the simplicity and intuitiveness of MCUIs during long-term repetitive tasks.

3.3 Implementation of MCUIs for repetitive parcel sorting

The MCUIs were developed with Unity 3D and Holokit, an opensource AR toolkit[1]. A dozen of cardboards with printed QR patterns were designed to mimic the parcel boxes. Considering diversities of sizes and weights of the real parcels, we printed twelve 100×100x2.5mm cardboard squares, each having a specifically designed QR code pattern (Fig.4) to mimic real parcels with various shapes and sizes. These squares could be quickly placed into a slot of the shelf. These patterns were designed in white and black graphical patterns, so to be robustly recognised by cameras (Fig.4). Once multiple patterns detected are to fit with the same target slot, the corresponding cardboards are overlayed with green rectangles, which remind the participants that these cardboards could be processed simultaneously. Besides the reminding rectangles, the overall layout and the target slots are highlighted on the left bottom corner.

3.4 Participants

The study recruited 40 volunteers from the local university, including 7 undergraduates, 23 master students, and 10 doctoral students (22 males and 18 females, M_age = 23.9, SD_age = 3.43). The participants’ backgrounds include computer science, engineering management, machinery manufacturing and other disciplines. Of these participants, 2 used headset AR devices, 22 used mobile AR systems, and 6 had no exposure to any AR systems. Each participant received 50 CNY allowances and provided informed consent forms prior to the study.

3.5 Apparatus

The AR headset used in the study was modified based on a mobile phone-based AR headset which installed a see-through mirror to reflect mobile phone’s screen contents (Fig.5). To enable simultaneous installation of eye tracking devices, the modified headset covered the mirror with a non-transparent plastic film and mounted two front-facing webcams instead. The two webcams had identical specifications (resolution 1920*1080pixels, 1/2.7-inch CMOS, 60fps). The upper webcam was connected to the eye tracking device to capture world images and the bottom webcam was connected to the mobile phone to provide real-time experimental scene images.

The mobile phone was the Google Pixel 3 device (5.6-inch OLED screen, resolution 2220x1080pixels, Android 9.0 Pie, Qualcomm® Snapdragon™ 670 CPU, 3000 mAh battery, size 6.0x2.8x0.3 mm,147 grams). It ran Unity programmes in the binocular mode and superimposed the programme-generated contents on the real-world images which were captured from the front webcam (Fig.6).

To capture real-time eye gazes throughout the study, the study adopted Pupil Core, a lightweight and adjustable eye tracking device (Fig.7). It comprised a fixed world camera, an adjustable eye camera (infrared sensing-enabled, 720P camera, 1/4 Inch CMOS, resolution 1280*720, 60fps), and a main supporting frame with accessories such as nose pad and cable clippers (Fig.7). Due to conflicting space of wearing both the AR headset and eye tracking device, we uninstalled the original world camera of the eye tracking device and replaced it with the front-facing webcam.

The shelf’s overall size was 1800*1000*350mm. Specifically, the height of the shelf was designed by referring to the average height of Asian users, so to ensure all the participants’ accessibility to the shelf while performing tasks. Each shelf slot was covered with a QR code card. This allowed the cameras to detect whether the current pseudo parcel was placed in the correct slot. The shelf was designed with six slots in the same sizes (260*260mm) in two rows by three columns (Fig.8). When the participants stood at one meter away from the shelf, they were ensured to see and reach all the slots.

3.6 Procedures

The study randomly divided the participants into two groups (N_{group A} =20, N_{group B} =20). Both groups were given the basic visual cues e.g., colour markers, location mapping cues, and arrows. The group A had only the basic visual cues, and the group B were able to see multiple associative cues when they picked up a parcel. The multiple cues were displayed to indicate other parcels that could be processed simultaneously.

Procedures for the group A.

(1) The participants picked up one of the cardboard squares that were piled on the table and scanned it with the front camera (Fig.9).

(2) The participants were responded with a beep sound and a green square when the scan was successful (Fig.9). The AR headset also displayed the navigation rectangle to indicate locations to place the current parcel (Fig.9). The target location was highlighted as a small green square (Fig.9).

(3) The participants followed the navigation square to place the pseudo parcel into the expected shelf slot. During the process, the slot position indicator –a circle surrounded by four arrows, was displayed in the AR headset’s view and the application detected which slot the current parcel slid into (Fig.10).

(4) The participants repeated the above task operations until putting all the twelve cardboard squares were successfully delivered to corresponding slots.

Procedures for the group B.

(1) The participants picked up one of the cardboard squares that were tiled on the table and scanned it with the front camera (same as depicted in Fig.9).

(2) The participants were responded with a beep sound and a green square was displayed on the cardboard square when the scan was successful (Fig.11). In addition, the AR headset highlighted other cardboard squares that shared the same target slots with the current cardboard square (Fig.11). There was the navigation rectangle using small green squares to indicate which shelf slot to place these pseudo parcels.

(3) The participant needed to collect all the cardboard squares highlighted, and as instructed to place these into the expected shelf slot. The indicator –a circle surrounded with four arrows, guided the participants to deliver the parcels. And the application detected whether the parcels were slid to the correct slot, otherwise it responded with error alert messages.

(4) The participants repeated the above task operations until the twelve cardboard squares were all correctly delivered.

After completing the tasks, both groups’ participants needed to fill out two questionnaires, the NASA-TLX and SUS, and received a semi-formal interview from the experimenter.

[1] HoloKit: Open-source AR headset for creative spatial computing. https://holokit.io/. Visited on April 19, 2023.

The study collected pre- and post-study questionnaires, eye tracking recordings, and experiment video footages from the forty participants. Based on these data, we analysed the influence of the respective user interfaces on the participants’ cognitive load, as well as usability and learnability of the user interfaces. Specifically, the cognitive load was measured through blink duration, fixation duration, gaze areas, and NASA-TLX scale.

4.1 Cognitive load

4.1.1 Blink duration

Eye blink reflects the participants’ visual attention activities (Zagermann et al, 2016), as long blink duration used to indicate high cognitive load (Behroozi et al, 2018). We used the Pupil Core eye tracking system to collect eye movement data of each participant and processed the resulting data to remove outliers and derive averages, which ultimately resulted in the average blink duration for the two groups of participants. We calculated both the mean and median to make the results more comprehensive and reliable. The results showed that there were descriptive differences of mean/median eye blink duration between the two groups (M_{group A}= 0.211ms, SD_{group A}=0.034, low = 0.166, median = 0.205, high = 0.287, M_{group B}= 0.201ms, SD_{group B} = 0.019, low = 0.172, median = 0.195, high = 0.239) (Fig.12).

Due to non-normal distribution of the results, we adopted non-parametric analyse to test statistical difference of eye blink duration between the groups, and no significant difference was found (Mann Whiteny test U = 116, z = -0.720, P = 0.471 > 0.05). This finding indicated that the MCUIs had little effect on blink duration, reflecting weak cognitive load change.

4.1.2 Fixation duration

Fixation is a voluntary movement that refers to the eye gazes that last approximately from 200-300ms to several seconds (Zagermann et al, 2016). High cognitive load often leads to long eye fixation duration (Rudmann et al, 2003, Zagermann et al, 2016). A long eye fixation usually means a large amount of time is necessarily spent on interpreting the visual components. During the process, there are few meaningful visual components to be perceived.

We calculated the mean fixation duration of the two participants groups by processing the obtained eye movement data to remove outliers and derive mean values. There were descriptive differences between the two groups’ mean/median eye fixation duration (M_{group A}= 159.477ms, SD_{group A}= 20.110, low = 134.346, median = 154.128, high = 205.243, M_{group B}= 156.988ms, SD_{group B}= 20.174, low = 120.090, median = 154.887, high = 195.010) (Fig.13). Given the non-normal distributions of the result data, we tested the two groups’ statistical difference with non-parametric method and no significant difference was found (Mann-Whitney test U =133, z = -0.108, P = 0.914 > 0.05). This finding was consistent with the previous one that the MCUIs had little effect on the participants’ fixation duration, reflecting a weak change in cognitive load.

4.1.3 Gaze areas

We analysed gaze areas and visualised related data in the form of heatmap to reflect the participants’ regions of interest. Heat maps are effective in discovering the areas that received the greatest number of eye gazes, which were successfully tested in many previous studies (Jyotsna and Amudha 2018, Le Meur et al. 2017, Chandrika et al. 2020).

To probe how the participants looked at the MCUIs during the repetitive parcel sorting tasks, we analysed the participants’ eye gazes. Since the participants were constantly moving during the parcel sorting tasks, we manually extracted 200 specific eye tracking video clips of when the participants viewed these cardboards with MCUIs. These video clips, which lasted for couple seconds on average, comprised a number of images viewing from a static point of view. Since the participants in group A picked one of these parcels a time from the desk, the heatmaps revealed how the participants looked at the target parcel and the others (Fig.14 a). In contrast, the heatmap of the group B shows how the participants’ main visual attention when they were given multiple visual clues during parcel selecting (Fig.14 b).

The results showed that both groups exhibited the similar observable eye gaze fixation areas and the main eye gaze areas concentrated on the target cardboard, regardless of the display of MCUIs. We calculated dispersions of the fixation data and conducted a variance analysis. The results showed that the two sets of data were significantly different (p = 0.044 < 0.05). The differences of eye gaze areas are also descriptively noticeable. For example, the group B (MCUIs) participants showed more divergent eye gaze areas than the participants of the group A, indicating the difference of the two groups in terms of viewing the MCUIs. Furthermore, the eye gaze trajectories of the group B participants were more converged, regardless of the cardboard distributions. The same patterns of eye gaze fixations were also found when the participants positioned the cardboards in the target shelf slots (Fig.14 c and d), which indicated consistent influence of the MCUIs on eye gazes.

4.1.4 NASA-TLX questionnaire

After the participants completed the tasks, they were asked to complete NASA-Task Load Index questionnaires (Grier 2015) (see Appendix A). The questionnaires consisted of six dimensions for comprehensive assessment of cognitive load during repetitive tasks. The dimensions have been frequently concerned in studies assessing psychological stress (Virtanen et al. 2022) and cognitive load (Braarud 2021), and their correlations with cognitive load levels were strictly validated (Akyeampong et al. 2014). Table 1 summarized the mean levels of cognitive load of the participants in two groups.

Table 1 Result of NASA-TLX questionnaire

Subscale	Weight	Rating		Adjusted score
		Group A	Group B	Group A	Group B
Mental Demand	3	26.25	25.38	78.75	76.14
Temporal Demand	1	43.5	40.19	43.50	40.19
Physical Demand	2	56.25	46.07	112.50	92.14
Performance	3	68.00	69.80	204.00	209.40
Effort	3	51.25	44.42	153.75	133.26
Frustration Level	3	28.75	26.35	86.25	79.05
Weighted sum				678.75	630.18

The analysis results showed a significant difference of perceived cognitive load between the two groups, as the level of cognitive load with group B was 7% lower than that with the group A (M_{group A} = 45.25, M_{group B} = 42.01, Mann-Whitney test p = 0.002), indicating the effectiveness of the MCUIs in reducing cognitive load.

4.2 Perceived usability

We adopted the SUS questionnaires to examine the perceived usability of the MCUIs in terms of effectiveness, ease of use, and overall satisfaction. The usability examination also validated whether the participants’ task performances were adversely influenced by any usability factors.

The results exhibited a significant difference of overall perceived usability scores between the two groups (M_{group A} = 78.750, SD_{group A} = 8.586, M_{group B} = 86.974, SD_{group B} = 6.378, Mann-Whitney test p = 0.004), indicating that the group B (MCUI) outperformed the group A in terms of overall perceived usability. This unexpected result suggests that the display of multiple visual clues did not impair system usability, instead it effectively enhanced over perceived usability by supplying assistive visual clues.

Furthermore, we analysed each aspect of perceived usability of the MCUIs. The results showed significant difference of effectiveness between the two groups (M_{group A} = 77.315, SD_{group A} = 13.650, Median_{group A} = 75.000, M_{group B} = 90.351, SD_{group B}= 6.952, Median_{group B} = 91.667, Mann-Whitney test p = 0.002). The result was consistent with the overall perceived usability result. In contrast, there was no significant difference of ease of use between the two groups (M_{group A} = 82.639, SD_{group A}= 13.985, median_{group A} = 87.500, M_{group B} = 88.487, SD_{group B} = 8.656, median_{group B} = 87.500, Mann-Whitney test p = 0.269). There was either no significant difference of overall satisfaction between the two groups (M_{group A} = 75.000, SD_{group A} =14.852, median_{group A} = 75.000, M_{group B} = 81.579, SD_{group B} =12.291, median_{group B} = 83.333, Mann-Witney test p = 0.154).

Taking together the above results, we claimed that the group B (MCUIs) participants felt confident about using MCUIs to improve parcel sorting task operation effectiveness e.g., lowering operation mistakes. In contrast, the group B participants were less confident about using the MCUIs to significantly enhance ease of use and overall satisfaction. Both groups’ participants complemented in the post-study interviews that they encountered few difficulties with or without the MCUIs. In this regard, we conservatively assume that the equivalent performances of ease of use and overall satisfaction are attributed to the simplicity design of MCUIs, visually and interactively.

Table 2 Result of perceived usability

Group	Effectiveness	Ease of use	Overall satisfaction
User Interface of group A(n=20)	77.315	82.639	75.000
User Interface of group B(n=20)	90.351	88.487	81.579
The Mann-Whitney U statistic	76.500	135.500	125.000
The Mann-Whitney z statistic	-3.037	-1.105	-1.425
p	0.002*	0.269	0.154

∗p < 0.05 ∗ ∗p < 0.01

4.3 Summary of results

To present a complete picture of the study findings, we concisely summarised all the results (Table 3). Based on the results, the H1 is supported, as both the eye tracking analysis and the questionnaire results showed consistent results about lower physical fatigue with the MCUIs group. The H2 is partially supported, as the NASA-TLX questionnaire results showed significant lower levels of cognitive load with the MCUIs group, whereas the eye tracking analysis showed inconsistent results: the MCUIs group had significantly more convergent gaze areas, but their eye blinking and eye fixations showed no difference from the other group. The H3 is also partially supported, as the MCUIs group reported significantly higher scores of overall perceived usability and effectiveness, whereas their feedback reported no difference in terms of ease of use and overall satisfaction.

Table 3 Summary of study results

Measures	Metrics	Results
Cognitive load	Blink duration	no significant difference, p = 0.587
	Fixation duration	no significant difference, p = 0.649
	Gaze area	significant difference, p = 0.044
	NASA-TLX	significant difference, p = 0.002
Perceived usability	Overall usability	significant difference, p = 0.004
	Effectiveness	significant difference, p = 0.002
	Ease of use	no significant difference, p = 0.269
	Satisfaction	no significant difference, p = 0.154

The study findings disclose the influence of MCUIs on the cognitive load and perceived usability. It shows findings that are consistent with the previous studies, e.g., the MCUIs reduce the participants’ physical fatigue by providing multiple visual clues for repetitive operations, and the MCUIs effectively result in high level perceived usability. More importantly, the study reveals how the MCUIs influence cognitive load and perceived usability in the context of repetitive tasks in augmented reality. Specifically, the MCUIs are capable of redirecting the participants visual attention, as there are significant changes of gaze area during the repetitive tasks. Another advantage of MCUIs includes reducing searching and navigating time during continuous tasks. Nevertheless, the MCUIs do not reduce task difficulty, as the participants’ blink duration and fixation duration have no significant changes.

The study adopts both qualitative and quantitative methods to measure the cognitive load and perceived usability of the MCUIs. The overall findings are consistent, however, there are several inconsistent aspects of the results. For example, the eye gaze analysis shows no influence of MCUIs on blink duration and fixation duration, but there is a significant influence on gaze area. The results of usability evaluation are similar, as the aspect of effectiveness, as well as the overall usability scores, is significantly high when using the MCUIs, but the other aspects, including ease of use and satisfaction, have no differences. Such inconsistency also exists between the eye gaze analysis results and NASA-TLX questionnaires.

Despite the inconsistency of specific results, the overall findings remain consistent. Since the MCUIs cannot lower task difficulty that has a bold impact on the level of cognitive load, the participants’ blink duration and fixation duration are not affected. However, because the MCUIs can redirect the participants’ visual attention during repetitive tasks, it is reasonable to observe significant gaze area changes. Due to the MCUIs’ assistance to facilitate repetitive tasks, it makes the participants feeling like cognitive load to be lowered. Likewise, the usability questionnaires confirm the improvement of task effectiveness, but the ease of use (or task difficulty) is no difference. Regarding the aspect of satisfaction, all the participants expressed high level satisfaction, although the MCUIs group has observably higher scores.

One of the study findings is contradictory to the previous researches, which used to claim the effect of MCUIs on reducing cognitive load. In contrast, our study demonstrates that the MCUIs do not significantly lower the level of cognitive load during repetitive tasks in augmented reality. The difference is attributed to several reasons. One is the attributions of repetitive tasks in augmented reality, which in fact have a relatively stabilised level of cognitive load. The previous researches often adopt one-off tasks with varying difficulties e.g., reading a foreign language paragraph. Another reason is because the MCUIs help the participants complete the tasks more quickly, which consequently gives the participants an implicit impression of the task difficulty tapering. This is supported by the NASA-TLX questionnaire results, which qualitatively indicate the significant effect of MCUIs on eliminating cognitive load. Based on that, we presumably claim that enhancing task operation efficiency would lead to the feeling of lower cognitive load, although the actual task difficulty has no change.

The study adopted the fixed number of visual clues throughout the experiments. There are observable differences between the MCUIs and the other groups, as the former group maintains stable operation efficiency by following the visual prompts, but the later group has to search the relevant clues even after multiple task repetitions. This again confirms the advantages of MCUIs. In addition, the previous researches advocate that the MCUIs may run into information overload. Our study complements that the MCUIs’ overload problem can be effectively tackled by close integration with task contexts, e.g., the spatially distributed MCUIs are relatively memorable. Regarding the number of visual clues and related influence, it is out of the study’s scope and thus no solid evidence is provided.

There are several limitations of the current study. One is the participant pool, which is drawn from a university setting, may possess relatively higher cognitive capabilities than the warehouse workers. This may exaggerate the task performances such as picking up a parcel and recognise related information. Another limitation is the experimental environment, which mimics the warehouse with parcel sorting shelves and pseudo cardboard parcels. The real parcels have varieties of sizes and weights, which may quickly incur physical fatigue. Despite these constraints, the study adopts rigorous experiment methods and procedures to ensure the overall study reliability and validity.

The study investigates how the multi-cue user interfaces influence cognitive load and perceived usability for repetitive tasks in augmented reality. It developed the prototype of MCUIs and adopted eye tracking and NASA-TLX questionnaires to measure the level of cognitive load and scores of perceived usability. The results show that the MCUIs are effective in reducing the participants’ physical fatigue during the repetitive tasks. But it does not eliminate the levels of cognitive load, although it improves the participants task efficiency and leaves them an impression of lowering cognitive load. In addition, the MCUIs can effectively improve the overall perceived usability and the aspect of effectiveness, but do not affect the aspects of ease of use and overall satisfaction. Generalising implications for the study findings and future applications are discussed.

The authors indicate no financial relationships. Informed consent was obtained from all the participants in the study.

This study was funded by Zhejiang Provincial key R&D Programme (ref no: 2023C01045).

Informed consent was obtained from all the participants in the study.

Acknowledgements

The research work is supported by Information Technology Center and State Key Lab of CAD&CG ,Zhejiang University.

Akyeampong J, Udoka S, Caruso G, et al (2014) Evaluation of hydraulic excavator human–machine interface concepts using NASA TLX. International Journal of Industrial Ergonomics 44(3):374–382. https://doi.org/10.1016/j. ergon.2013.12.002
Al-Ameen MN, Wright M, Scielzo S (2015) Towards making random passwords memorable: Leveraging users’ cognitive ability through multiple cues. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. Association for Computing Machinery, New York, NY, USA, CHI ’15, pp 2315–2324, https://doi.org/10.1145/2702123.2702241
Albus, P., Vogt, A., Seufert, T (2021) Signaling in virtual reality influences learning outcome and cognitive load. Computers & Education 166, 104154. https://doi.org/10.1016/j.compedu.2021.104154
Alluisi EA, Muller Jr PF, Fitts PM (1957) An information analysis of verbal and motor responses in a forced-paced serial task. Journal of Experimental Psychology 53(3):153
Alves, J.B., Marques, B., Dias, P., Santos, B.S (2021) Using augmented reality for industrial quality assurance: a shop floor user study. Int J Adv Manuf Technol 115, 105–116. https://doi.org/10.1007/s00170-021-07049-8
Amin, Z., Ali, N.M., Smeaton, A.F (2021) Attention-Based Design and User Decisions on Information Sharing: A Thematic Literature Review. IEEE Access 9, 83285–83297. https://doi.org/10.1109/ACCESS.2021.3087740
Bai, H., Sasikumar, P., Yang, J., Billinghurst, M (2020) A User Study on Mixed Reality Remote Collaboration with Eye Gaze and Hand Gesture Sharing, in: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, CHI ’20. Association for Computing Machinery, New York, NY, USA, pp. 1–13. https://doi.org/10.1145/3313831.3376550
Behroozi M, Lui A, Moore I, et al (2018) Dazed: Measuring the cognitive load of solving technical interview problems at the whiteboard. In: Proceedings of the 40th International Conference on Software Engineering: New Ideas and Emerging Results. Association for Computing Machinery, New York, NY, USA, ICSE-NIER ’18, pp 93–96, https://doi.org/10.1145/3183399.3183415
Behzadan AH, Dong S, Kamat VR (2015) Augmented reality visualization: A review of civil infrastructure system applications. Advanced Engineering Informatics 29(2):252–267. https://doi.org/10.1016/j.aei.2015.03.005
Bertoni A, Bertoni M, Isaksson O (2013) Value visualization in product service systems preliminary design. Journal of cleaner production 53:103–117
Biondi, F.N., Cacanindin, A., Douglas, C., Cort, J (2021) Overloaded and at Work: Investigating the Effect of Cognitive Workload on Assembly Task Performance. Hum Factors 63, 813–820. https://doi.org/10.1177/0018720820929928
Blaga, A., Militaru, C., Mezei, A.-D., Tamas, L (2021) Augmented reality integration into MES for connected workers. Robotics and Computer-Integrated Manufacturing 68, 102057. https://doi.org/10.1016/j.rcim.2020.102057
Blanco, C.F., Sarasa, R.G., Sanclemente, C.O (2010) Effects of visual and textual information in online product presentations: looking for the best combination in website design. European Journal of Information Systems 19, 668–686. https://doi.org/10.1057/ejis.2010.42
Bottani E, Vignali G (2019) Augmented reality technology in the manufacturing industry: A review of the last decade. IISE Transactions 51(3):284–310. https://doi.org/10.1080/24725854.2018.1493244
Braarud, P.Ø (2021) Investigating the validity of subjective workload rating (NASA TLX) and subjective situation awareness rating (SART) for cognitively complex human–machine work. International Journal of Industrial Ergonomics 86, 103233. https://doi.org/10.1016/j.ergon.2021.103233
Brannon Barhorst, J., McLean, G., Shah, E., Mack, R (2021) Blending the real world and the virtual world: Exploring the role of flow in augmented reality experiences. Journal of Business Research 122, 423–436. https://doi.org/10.1016/j.jbusres.2020.08.041
Broadbent, D.P., D’Innocenzo, G., Ellmers, T.J., Parsler, J., Szameitat, A.J., Bishop, D.T (2023) Cognitive load, working memory capacity and driving performance: A preliminary fNIRS and eye tracking study. Transportation Research Part F: Traffic Psychology and Behaviour 92, 121–132. https://doi.org/10.1016/j.trf.2022.11.013
Brooke J, et al (1996) Sus-a quick and dirty usability scale. Usability evaluation in industry 189(194):4–7. https://doi.org/10.1201/9781498710411-35
Buchner, J., Buntins, K., Kerres, M (2022) The impact of augmented reality on cognitive load and performance: A systematic review. Journal of Computer Assisted Learning 38, 285–303. https://doi.org/10.1111/jcal.12617
Chandrika, K.R., Amudha, J., Sudarsan, S.D (2020) Identification and Classification of Expertise Using Eye Gaze—Industrial Use Case Study with Software Engineers, in: Bansal, J.C., Gupta, M.K., Sharma, H., Agarwal, B. (Eds.), Communication and Intelligent Systems, Lecture Notes in Networks and Systems. Springer, Singapore, pp. 391–405. https://doi.org/10.1007/978-981-15-3325-9_30
Chen W, Shan Y, Wu Y, et al (2021) Design and evaluation of a distancedriven user interface for asynchronous collaborative exhibit browsing in an augmented reality museum. IEEE Access 9:73,948–73,962. https://doi.org/ 10.1109/ACCESS.2021.3080286
Chen, Y., Li, Q., Kong, D., Kei, Y.L., Zhu, S.-C., Gao, T., Zhu, Y., Huang, S (2021) YouRefIt: Embodied Reference Understanding With Language and Gesture. Presented at the Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1385–1395.
Cirulis A, Ginters E (2013) Augmented reality in logistics. Procedia Computer Science 26:14–20. https://doi.org/10.1016/j.procs.2013.12.003
Devagiri, J.S., Paheding, S., Niyaz, Q., Yang, X., Smith, S (2022) Augmented Reality and Artificial Intelligence in industry: Trends, tools, and future challenges. Expert Systems with Applications 207, 118002. https://doi.org/10.1016/j.eswa.2022.118002
Devos, H., Gustafson, K., Ahmadnezhad, P., Liao, K., Mahnken, J.D., Brooks, W.M., Burns, J.M (2020) Psychometric Properties of NASA-TLX and Index of Cognitive Activity as Measures of Cognitive Workload in Older Adults. Brain Sciences 10, 994. https://doi.org/10.3390/brainsci10120994
Du W, Piater J (2008) A probabilistic approach to integrating multiple cues in visual tracking. In: Forsyth D, Torr P, Zisserman A (eds) Computer Vision–ECCV 2008. Springer, Berlin, Heidelberg, Lecture Notes in Computer Science, pp 225–238, https://doi.org/10.1007/978-3-540-88688-4 17
Dunkelberger N, Bradley J, Sullivan JL, et al (2018) Improving perception accuracy with multi-sensory haptic cue delivery. In: Prattichizzo D, Shinoda H, Tan HZ, et al (eds) Haptics: Science, Technology, and Applications, vol 10894. Springer International Publishing, Cham, p 289–301, https://doi. org/10.1007/978-3-319-93399-3 26
Dunphy P, Nicholson J, Olivier P (2008) Securing passfaces for description. In: Proceedings of the 4th Symposium on Usable Privacy and Security, pp 24–35
Egger, J., Masood, T (2020) Augmented reality in support of intelligent manufacturing – A systematic literature review. Computers & Industrial Engineering 140, 106195. https://doi.org/10.1016/j.cie.2019.106195
Eswaran, M., Gulivindala, A.K., Inkulu, A.K., Raju Bahubalendruni, M.V.A (2023) Augmented reality-based guidance in product assembly and maintenance/repair perspective: A state of the art review on challenges and opportunities. Expert Systems with Applications 213, 118983. https://doi.org/10.1016/j.eswa.2022.118983
Fani, S., Ciotti, S., Battaglia, E., Moscatelli, A., Bianchi, M (2018) W-FYD: A Wearable Fabric-Based Display for Haptic Multi-Cue Delivery and Tactile Augmented Reality. IEEE Transactions on Haptics 11, 304–316. https://doi.org/10.1109/TOH.2017.2708717
Fani, S., Ciotti, S., Bianchi, M (2022) Multi-Cue Haptic Guidance Through Wearables for Enhancing Human Ergonomics. IEEE Transactions on Haptics 15, 115–120. https://doi.org/10.1109/TOH.2021.3137899
Febiyani, A., Febriani, A., & Ma'Sum, J (2021) Calculation of mental load from e-learning student with NASA TLX and SOFI method. Jurnal Sistem Dan Manajemen Industri, 5(1), 35-42.
Gibson, A.E (2017) The Design, development, and analysis of a wearable, multi-modal information presentation device to aid astronauts in obstacle avoidance during surface exploration (Thesis). Massachusetts Institute of Technology.
Gigerenzer G, Goldstein DG (1996) Reasoning the fast and frugal way: Models of bounded rationality. Psychological Review 103(4):650–669. https://doi. org/10.1037/0033-295X.103.4.650
Gil, H., Son, H., Kim, J.R., Oakley, I (2018) Whiskers: Exploring the Use of Ultrasonic Haptic Cues on the Face, in: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. Presented at the CHI ’18: CHI Conference on Human Factors in Computing Systems, ACM, Montreal QC Canada, pp. 1–13. https://doi.org/10.1145/3173574.3174232
Gl¨ockner A (2008) Does intuition beat fast and frugal heuristics? a systematic empirical analysis. In: Intuition in Judgment and Decision Making. Lawrence Erlbaum Associates Publishers, Mahwah, NJ, US, p 309–325
Golash, R., Jain, Y.K., n.d. Low-cost Design of Vision-based Natural User Interface via Dynamic Hand Gestures.
Gong, Z., Wang, R., Xia, G (2022) Augmented Reality (AR) as a Tool for Engaging Museum Experience: A Case Study on Chinese Art Pieces. Digital 2, 33–45. https://doi.org/10.3390/digital2010002
Grier, R.A (2015) How High is High? A Meta-Analysis of NASA-TLX Global Workload Scores. Proceedings of the Human Factors and Ergonomics Society Annual Meeting 59, 1727–1731. https://doi.org/10.1177/1541931215591373
Hart SG, Staveland LE (1988) Development of NASA-TLX (task load index): Results of empirical and theoretical research. In: Hancock PA, Meshkati N (eds) Advances in Psychology, Human Mental Workload, vol 52. North-Holland, p 139–183, https://doi.org/10.1016/S0166-4115(08)62386-9
Heemsbergen, L., Bowtell, G., Vincent, J (2021) Conceptualising Augmented Reality: From virtual divides to mediated dynamics. Convergence 27, 830–846. https://doi.org/10.1177/1354856521989514
Henderson SJ, Feiner S (2009) Evaluating the benefits of augmented reality for task localization in maintenance of an armored personnel carrier turret. In: 2009 8th IEEE International Symposium on Mixed and Augmented Reality, pp 135–144, https://doi.org/10.1109/ISMAR.2009.5336486
Henderson SJ, Feiner SK (2011) Augmented reality in the psychomotor phase of a procedural task. In: 2011 10th IEEE International Symposium on Mixed and Augmented Reality, pp 191–200, https://doi.org/10.1109/ISMAR.2011. 6092386
Hetherington, N.J., Croft, E.A., Van der Loos, H.F.M (2021) Hey Robot, Which Way Are You Going? Nonverbal Motion Legibility Cues for Human-Robot Spatial Interaction. IEEE Robotics and Automation Letters 6, 5010–5015. https://doi.org/10.1109/LRA.2021.3068708
Hoffmann, H.-J (2003) Jesse James Garrett: The elements of user experience - User-centered design for the Web. i-com 2, 44–44. https://doi.org/10.1524/icom.2.1.44.19040
Hou, L., Wang, X., Bernold, L (2013) Using Animated Augmented Reality to Cognitively Guide Assembly. Journal of Computing in Civil Engineering. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000184
Jin, Y., Ma, M., Zhu, Y (2022) A comparison of natural user interface and graphical user interface for narrative in HMD-based augmented reality. Multimed Tools Appl 81, 5795–5826. https://doi.org/10.1007/s11042-021-11723-0
Jyotsna, C., Amudha, J (2018) Eye Gaze as an Indicator for Stress Level Analysis in Students, in: 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI). Presented at the 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), pp. 1588–1593. https://doi.org/10.1109/ICACCI.2018.8554715
Kalyuga S (2009) The expertise reversal effect. In: Managing cognitive load in adaptive multimedia learning. IGI Global, p 58–80
Karelaia, N (2006) Thirst for confirmation in multi-attribute choice: Does search for consistency impair decision performance? Organizational Behavior and Human Decision Processes 100, 128–143. https://doi.org/10.1016/j.obhdp.2005.09.003
Kassem, K., Ungerböck, T., Wintersberger, P., Michahelles, F (2022) What Is Happening Behind The Wall? Towards a Better Understanding of a Hidden Robot’s Intent By Multimodal Cues. Proc. ACM Hum.-Comput. Interact. 6, 196:1-196:19. https://doi.org/10.1145/3546731
Kim, S., Billinghurst, M., Kim, K (2020) Multimodal interfaces and communication cues for remote collaboration. J Multimodal User Interfaces 14, 313–319. https://doi.org/10.1007/s12193-020-00346-8
Kim, S., Nussbaum, M.A., Gabbard, J.L (2019) Influences of augmented reality head-worn display type and user interface design on performance and usability in simulated warehouse order picking. Applied Ergonomics 74, 186–193. https://doi.org/10.1016/j.apergo.2018.08.026
Knowlton BJ, Squire LR, Gluck MA (1994) Probabilistic classification learning in amnesia. Learning & Memory 1(2):106–120. https://doi.org/10.1101/lm. 1.2.106
Le Meur, O., Coutrot, A., Liu, Z., Rämä, P., Le Roch, A., Helo, A (2017) Visual Attention Saccadic Models Learn to Emulate Gaze Patterns From Childhood to Adulthood. IEEE Transactions on Image Processing 26, 4777–4789. https://doi.org/10.1109/TIP.2017.2722238
Li, S., Zhu, B., Yu, Z (2023) The Impact of Cue-Interaction Stimulation on Impulse Buying Intention on Virtual Reality Tourism E-commerce Platforms. Journal of Travel Research 00472875231183163. https://doi.org/10.1177/00472875231183163
Lima, I.B., Jeong, Y., Lee, C., Suh, G., Hwang, W (2021) Severity of Usability Problems and System Usability Scale (SUS) Scores on Augmented Reality (AR) User Interfaces. https://doi.org/10.24507/icicelb.12.02.175
Lindsay, G.W (2020) Attention in Psychology, Neuroscience, and Machine Learning. Frontiers in Computational Neuroscience 14.
Loch, F., Quint, F., Brishtel, I (2016) Comparing Video and Augmented Reality Assistance in Manual Assembly, in: 2016 12th International Conference on Intelligent Environments (IE). Presented at the 2016 12th International Conference on Intelligent Environments (IE), pp. 147–150. https://doi.org/10.1109/IE.2016.31
Maiti A, Smith M, Maxwell AD, et al (2018) Augmented reality and natural user interface applications for remote laboratories. In: Auer ME, Azad AK, Edwards A, et al (eds) Cyber-Physical Laboratories in Engineering and Science Education. Springer International Publishing, Cham, p 79–109, https://doi.org/10.1007/978-3-319-76935-6 4
Malan J.J (2019) The influence of digital distraction on cognitive load, attention conflict and meeting productivity.
Mao, C.-C., Chen, C.-H (2021) Augmented Reality of 3D Content Application in Common Operational Picture Training System for Army. International Journal of Human–Computer Interaction 37, 1899–1915. https://doi.org/10.1080/10447318.2021.1917865
Mathiassen, S.E., Möller, T., Forsman, M (2003) Variability in mechanical exposure within and between individuals performing a highly constrained industrial work task. Ergonomics 46, 800–824. https://doi.org/10.1080/0014013031000090125
McNab AL, Hess T (2009) Designing interfaces for faster information processing: Examination of the effectiveness of using multiple information cues. AMCIS 2009 Proceedings
Merenda C, Kim H, Tanous K, et al (2018) Augmented reality interface design approaches for goal-directed and stimulus-driven driving tasks. IEEE Transactions on Visualization and Computer Graphics 24(11):2875–2885. https://doi.org/10.1109/TVCG.2018.2868531
Moon, J., Ryu, J (2021) The effects of social and cognitive cues on learning comprehension, eye-gaze pattern, and cognitive load in video instruction. J Comput High Educ 33, 39–63. https://doi.org/10.1007/s12528-020-09255-x
Murauer N, Mu¨ller F, Gu¨nther S, et al (2018) An analysis of language impact on augmented reality order picking training. In: Proceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference. Association for Computing Machinery, New York, NY, USA, PETRA ’18, pp 351–357, https://doi.org/10.1145/3197768.3201570
Nilsen, M., Kongsvik, T (2023) Health, Safety, and Well-Being in Platform-Mediated Work – A Job Demands and Resources Perspective. Safety Science 163, 106130. https://doi.org/10.1016/j.ssci.2023.106130
Ocampo, R., Tavakoli, M (2019) Improving User Performance in Haptics-Based Rehabilitation Exercises by Colocation of User’s Visual and Motor Axes via a Three-Dimensional Augmented-Reality Display. IEEE Robotics and Automation Letters 4, 438–444. https://doi.org/10.1109/LRA.2019.2891283
Otermans, P.C.J., Parton, A., Szameitat, A.J (2022) The working memory costs of a central attentional bottleneck in multitasking. Psychological Research 86, 1774–1791. https://doi.org/10.1007/s00426-021-01615-1
Oviatt S (2006) Human-centered design meets cognitive load theory: Designing interfaces that help people think. In: Proceedings of the 14th Annual ACM International Conference on Multimedia - MULTIMEDIA ’06. ACM Press, Santa Barbara, CA, USA, p 871, https://doi.org/10.1145/1180639.1180831
Paivio A (2014) Mind and Its Evolution: A Dual Coding Theoretical Approach. Psychology Press, New York, https://doi.org/10.4324/9781315785233
Pan, L., Yu, C., Li, J., Huang, T., Bi, X., Shi, Y (2022) Automatically Generating and Improving Voice Command Interface from Operation Sequences on Smartphones, in: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, CHI ’22. Association for Computing Machinery, New York, NY, USA, pp. 1–21. https://doi.org/10.1145/3491102.3517459
Papadopoulos, T., Evangelidis, K., Kaskalis, T.H., Evangelidis, G., Sylaiou, S (2021) Interactions in Augmented and Mixed Reality: An Overview. Applied Sciences 11, 8752. https://doi.org/10.3390/app11188752
Papakostas, C., Troussas, C., Krouska, A., Sgouropoulou, C (2021) Measuring User Experience, Usability and Interactivity of a Personalized Mobile Augmented Reality Training System. Sensors 21, 3888. https://doi.org/10.3390/s21113888
Persike M, Meinhardt G (2008) Cue summation enables perceptual grouping. Journal of Experimental Psychology: Human Perception and Performance 34(1):1–26. https://doi.org/10.1037/0096-1523.34.1.1
Plessner H, Schweizer G, Brand R, et al (2009) A multiple-cue learning approach as the basis for understanding and improving soccer referees’ decision making. In: Raab M, Johnson JG, Heekeren HR (eds) Progress in Brain Research, Mind and Motion: The Bidirectional Link between Thought and Action, vol 174. Elsevier, p 151–158, https://doi.org/10.1016/ S0079-6123(09)01313-2
Poupyrev I, Tan D, Billinghurst M, et al (2002) Developing a generic augmented-reality interface. Computer 35(3):44–50. https://doi.org/10. 1109/2.989929
Pratticò, F.G., Lamberti, F., Cannavò, A., Morra, L., Montuschi, P (2021) Comparing State-of-the-Art and Emerging Augmented Reality Interfaces for Autonomous Vehicle-to-Pedestrian Communication. IEEE Transactions on Vehicular Technology 70, 1157–1168. https://doi.org/10.1109/TVT.2021.3054312
Pusch, A., Lécuyer, A (2011) Pseudo-haptics: from the theoretical foundations to practical system design guidelines, in: Proceedings of the 13th International Conference on Multimodal Interfaces. Presented at the ICMI’11: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ACM, Alicante Spain, pp. 57–64. https://doi.org/10.1145/2070481.2070494
Radkowski R, Herrema J, Oliver J (2015) Augmented reality-based manual assembly support with visual features for different degrees of difficulty. International Journal of Human–Computer Interaction 31(5):337–349. https://doi.org/10.1080/10447318.2014.994194
Rudmann DS, McConkie GW, Zheng XS (2003) Eyetracking in cognitive state detection for hci. In: Proceedings of the 5th International Conference on Multimodal Interfaces. Association for Computing Machinery, New York, NY, USA, ICMI ’03, pp 159–163, https://doi.org/10.1145/958432.958464
Said, S., Gozdzik, M., Roche, T.R., Braun, J., Rössler, J., Kaserer, A., Spahn, D.R., Nöthiger, C.B., Tscholl, D.W (2020) Validation of the Raw National Aeronautics and Space Administration Task Load Index (NASA-TLX) Questionnaire to Assess Perceived Workload in Patient Monitoring Tasks: Pooled Analysis Study Using Mixed Models. Journal of Medical Internet Research 22, e19472. https://doi.org/10.2196/19472
Schnotz W, Ku¨rschner C (2007) A reconsideration of cognitive load theory. Educational Psychology Review 19(4):469–508. https://doi.org/10.1007/ s10648-007-9053-4
Serafini, E.J., Sanz, C (2016) EVIDENCE FOR THE DECREASING IMPACT OF COGNITIVE ABILITY ON SECOND LANGUAGE DEVELOPMENT AS PROFICIENCY INCREASES. Studies in Second Language Acquisition 38, 607–646. https://doi.org/10.1017/S0272263115000327
Severijnen, G.G.A., Di Dona, G., Bosker, H.R., McQueen, J.M (2023) Tracking talker-specific cues to lexical stress: Evidence from perceptual learning. Journal of Experimental Psychology: Human Perception and Performance 49, 549–565. https://doi.org/10.1037/xhp0001105
Shen, Z., Zhang, L., Li, R., Liang, R (2020) The effects of icon internal characteristics on complex cognition. International Journal of Industrial Ergonomics 79, 102990. https://doi.org/10.1016/j.ergon.2020.102990
Silva, S., Almeida, N., Pereira, C., Martins, A.I., Rosa, A.F., Oliveira E Silva, M., Teixeira, A (2015) Design and Development of Multimodal Applications: A Vision on Key Issues and Methods, in: Antona, M., Stephanidis, C (Eds.), Universal Access in Human-Computer Interaction. Access to Today’s Technologies, Lecture Notes in Computer Science. Springer International Publishing, Cham, pp. 109–120. https://doi.org/10.1007/978-3-319-20678-3_11
Skulmowski, A., Xu, K.M (2022) Understanding Cognitive Load in Digital and Online Learning: a New Perspective on Extraneous Cognitive Load. Educ Psychol Rev 34, 171–196. https://doi.org/10.1007/s10648-021-09624-7
Spittle, B., Frutos-Pascual, M., Creed, C., Williams, I (2023) A Review of Interaction Techniques for Immersive Environments. IEEE Transactions on Visualization and Computer Graphics 29, 3900–3921. https://doi.org/10.1109/TVCG.2022.3174805
Srihari RK (1995) Computational models for integrating linguistic and visual information: A survey. Artificial Intelligence Review 8(5-6):349–369. https://doi.org/10.1007/BF00849725
Steffen, J.H., Gaskin, J.E., Meservy, T.O., Jenkins, J.L., Wolman, I (2019) Framework of Affordances for Virtual Reality and Augmented Reality. Journal of Management Information Systems 36, 683–729. https://doi.org/10.1080/07421222.2019.1628877
Stoltz MH, Giannikas V, McFarlane D, et al (2017) Augmented reality in warehouse operations: Opportunities and barriers. IFAC-PapersOnLine 50(1):12,979–12,984. https://doi.org/10.1016/j.ifacol.2017.08.1807
Sujito, F., Arifudin, R., Arini, F.Y (2019) An Analysis of User Interface and User Experience Using System Usability Scale and GOMS Method. Journal of Advances in Information Systems and Technology 1, 65–73. https://doi.org/10.15294/jaist.v1i1.36503
Sumner, E.M., Hayes, R.A., Carr, C.T., Wohn, D.Y (2020) Assessing the cognitive and communicative properties of Facebook Reactions and Likes as lightweight feedback cues. FM. https://doi.org/10.5210/fm.v25i2.9621
Sweller J (1994) Cognitive load theory, learning difficulty, and instructional design. Learning and Instruction 4(4):295–312. https://doi.org/10.1016/ 0959-4752(94)90003-5
Sweller J, Chandler P (1994) Why some material is difficult to learn. Cognition and Instruction 12(3):185–233. https://doi.org/10.1207/s1532690xci1203 1
Sweller J, van Merrienboer JJG, Paas FGWC (1998) Cognitive architecture and instructional design. Educational Psychology Review 10(3):251–296. https://doi.org/10.1023/A:1022193728205
Tan HZ, Reed CM, Durlach NI (2010) Optimum information transfer rates for communication through haptic and other sensory modalities. IEEE Transactions on Haptics 3(2):98–108. https://doi.org/10.1109/TOH.2009.46
Tilton-Bolowsky, V., Vallila-Rohter, S., Arbel, Y (2021) Strategy Development and Feedback Processing During Complex Category Learning. Frontiers in Psychology 12.
Urakami, J., Seaborn, K (2023) Nonverbal Cues in Human–Robot Interaction: A Communication Studies Perspective. J. Hum.-Robot Interact. 12, 22:1-22:21. https://doi.org/10.1145/3570169
Van Krevelen DWF, Poelman R (2010) A survey of augmented reality technologies, applications and limitations. International journal of virtual reality 9(2):1–20
Venkatesan, M., Mohan, H., Ryan, J.R., Schürch, C.M., Nolan, G.P., Frakes, D.H., Coskun, A.F (2021) Virtual and augmented reality for biomedical applications. Cell Reports Medicine 2, 100348. https://doi.org/10.1016/j.xcrm.2021.100348
Virtanen, K., Mansikka, H., Kontio, H., Harris, D (2022) Weight watchers: NASA-TLX weights revisited. Theoretical Issues in Ergonomics Science 23, 725–748. https://doi.org/10.1080/1463922X.2021.2000667
Vlachogianni P, Tselios N (2022) Perceived usability evaluation of educational technology using the system usability scale (sus): A systematic review. Journal of Research on Technology in Education 54(3):392–409. https://doi.org/10.1080/15391523.2020.1867938
Wang, C., Zhang, F., Wang, J., Doyle, J.K., Hancock, P.A., Mak, C.M., Liu, S (2021) How indoor environmental quality affects occupants’ cognitive functions: A systematic review. Building and Environment 193, 107647. https://doi.org/10.1016/j.buildenv.2021.107647
Wang, T., Li, S., Lajoie, S (2023) The Interplay Between Cognitive Load and Self-Regulated Learning in a Technology-Rich Learning Environment. Educational Technology & Society 26, 50–62.
Wang, W., Wang, F., Song, W., Su, S (2020) Application of Augmented Reality (AR) Technologies in inhouse Logistics. E3S Web Conf. 145, 02018. https://doi.org/10.1051/e3sconf/202014502018
Wei, T., Wang, W., Yu, S (2022) Analysis of the Cognitive Load of Employees Working from Home and the Construction of the Telecommuting Experience Balance Model. Sustainability 14, 11722. https://doi.org/10.3390/su141811722
Wu, P.-H., Hwang, G.-J., Yang, M.-L., Chen, C.-H (2018) Impacts of integrating the repertory grid into an augmented reality-based learning design on students’ learning achievements, cognitive load and degree of satisfaction. Interactive Learning Environments 26, 221–234. https://doi.org/10.1080/10494820.2017.1294608
Xie, Y., Zhou, L., Dai, X., Yuan, L., Bach, N., Liu, C., Zeng, M (2022) Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning. Advances in Neural Information Processing Systems 35, 17287–17300.
Yan, Z., Wu, Y., Li, Y., Shan, Y., Li, X., Hansen, P (2022) Design Eye-Tracking Augmented Reality Headset to Reduce Cognitive Load in Repetitive Parcel Scanning Task. IEEE Transactions on Human-Machine Systems 52, 578–590. https://doi.org/10.1109/THMS.2022.3179954
Zagermann J, Pfeil U, Reiterer H (2016) Measuring cognitive load using eye tracking technology in visual computing. In: Proceedings of the Sixth Workshop on Beyond Time and Errors on Novel Evaluation Methods for Visualization. Association for Computing Machinery, New York, NY, USA, BELIV ’16, pp 78–85, https://doi.org/10.1145/2993901.2993908
Zhou, Y., Jia, N (2023) The Impact of Item Difficulty on Judgment of Confidence—A Cross-Level Moderated Mediation Model. Journal of Intelligence 11, 113. https://doi.org/10.3390/jintelligence1106011

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

On the Influence of Multi-cue User Interfaces in Eliminating Cognitive Load of Repetitive Tasks in Augmented Reality

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related work

2.1 Augmented reality and spatially distributed user interface

2.2 Multi-cue user interface

2.3 Cognitive load of MCUIs

2.4 Perceived usability of MCUIs

2.5 Lessons learned and hypothesis development

3. Method

3.1 Ethical note

3.2 Design MCUIs for repetitive tasks in AR

3.3 Implementation of MCUIs for repetitive parcel sorting

3.4 Participants

3.5 Apparatus

3.6 Procedures

4. Data analysis and result

4.1 Cognitive load

4.2 Perceived usability

4.3 Summary of results

5. Discussion

6. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1