Abstract

The purpose of this study was to examine the effect of navigation mode (passive versus active) on the virtual/real transfer of spatial learning, according to viewpoint displacement (ground: 1 m 75 versus aerial: 4 m) and as a function of the recall tasks used. We hypothesize that active navigation during learning can enhance performances when route strategy is favored by egocentric match between learning (ground-level viewpoint) and recall (egocentric frame-based tasks). Sixty-four subjects (32 men and 32 women) participated in the experiment. Spatial learning consisted of route learning in a virtual district (four conditions: passive/ground, passive/aerial, active/ground, or active/aerial), evaluated by three tasks: wayfinding, sketch-mapping, and picture-sorting. In the wayfinding task, subjects who were assigned the ground-level viewpoint in the virtual environment (VE) performed better than those with the aerial-level viewpoint, especially in combination with active navigation. In the sketch-mapping task, aerial-level learning in the VE resulted in better performance than the ground-level condition, while active navigation was only beneficial in the ground-level condition. The best performance in the picture-sorting task was obtained with the ground-level viewpoint, especially with active navigation. This study confirmed the expected results that the benefit of active navigation was linked with egocentric frame-based situations.

1. Introduction

Spatial cognition refers to the cognitive processes associated with the development of a comprehensive understanding of a spatial environment and the utilization of that knowledge for various purposes. Large-scale spatial cognition provides procedural knowledge and/or configurational knowledge about the environment which results from the acquisition of spatial knowledge levels [1, 2]: landmarks, route, and survey knowledge of the environment. The third level would allow the development of a cognitive map that contains configurational information. This L-R-S model (landmarks, route, and survey) has been subsequently described in a “seri-parallel” form, in which the initiation of the construction of configurational knowledge may start independently from the completion of subordinate levels [3, 4].

In everyday life navigation, individuals use two types of strategies: route strategy and survey strategy. Each strategy is based on the reference frame of the internal representations developed during navigation, that is, egocentric representation (body-centered) in route strategy and allocentric representation (world-centered) in survey strategy [5]. The frame of reference of the internal representations (i.e., egocentric versus allocentric) often depends on the learned viewpoint [6, 7]. In other words, different strategies will be involved depending on the different types of learning sources used [8, 9]. For example, direct navigation involves egocentric representations and facilitates understanding of the route, while studying a map involves allocentric representations and promotes the acquisition of survey knowledge [8].

Research into spatial cognition has benefited from the introduction of Virtual Reality (VR) technology [10]. However, it is important to assess similarities and differences between knowledge obtained in real life and that obtained in virtual environments (i.e., to verify that the best transfer of knowledge from the VE to the real situation is obtained) [11]. Transfer studies make a distinction between transfer of skill (from one sensory modality to another) and transfer of spatial knowledge (knowledge conservation from learning to test situation) [11]. Several studies have demonstrated an effective transfer of skill and/or spatial knowledge from virtual to real environments (virtual/real transfer), indicating that the spatial knowledge acquired in virtual environments is quite similar to that acquired in real environments (e.g., [1120]). With respect to activities of daily living (ADL), the aim of the “transfer” studies is to measure the benefits of the exposure to the VE on the future navigation activities in real life. Such a task is classically called “wayfinding.” It consists of verifying that a subject can replicate in a real situation a route learned in VE. Moreover, various additional factors can optimize the quality of this type of transfer (e.g., [1820]).

Among factors that have mobilized the researcher interest is the user navigation mode relative to the spatial cognition field. Evidence from experiments in a real environment have indicated that spatial learning performance improved when subjects navigated actively (i.e., active condition) compared to passive navigation (i.e., the subject was only an observer). Active navigation may therefore reinforce perception-action coupling (e.g., [21, 22]). In VR studies and according to the taxonomy presented by Wilson et al. [23] four navigation modes are distinguished: (i) physically passive navigation (the subject observes the VE without moving), (ii) physically active navigation (the subject moves using a motor interactor), (iii) psychologically passive navigation (the subject follows directions), and (iiii) psychologically active navigation (the subject freely explores the VE). In the present study, we focus on the physical navigation mode. Some earlier studies in VR have demonstrated that active physical navigation was more effective for spatial knowledge acquisition (e.g., [2426]), whereas others revealed an inferior effect, or no significant difference at all (e.g., [23, 2730]). Also, other studies have obtained mixed results, depending on the type of recall task involved (e.g., [13, 1820, 31, 32]). Thus, the beneficial effect of active navigation in VE studies has not yet been clearly established. This can be due to the fact that active navigation is examined by these authors in different experimental designs, mode of navigation, and test administered (see [33] for review).

Few studies have examined the impact of active navigation on the virtual/real transfer of skill (in using wayfinding task) (e.g., [1820, 32]). These studies suggest a beneficial effect of the active navigation. For example, Wallet et al. [20] studied the effect of active learning in VE on performances in wayfinding task. This task is considered as principally based on route strategy when it comes to path following after route learning [2]. The subjects had to learn a route in a virtual environment in passive or active mode (with or without joystick) and then replicate this route in the real environment. The results indicate better performances for “active” participants. These results can be explained, as for real environment results, by a consolidation of the perception-action coupling (e.g., [20, 24]) (leading to the route strategy in pedestrian navigation). The two strategies are probably used for navigation of daily living as individuals can “switch” between egocentric and allocentric representations during navigation [34]. However, these results [20] suggest that a strategy is privileged according to the situation. Consequently, we presume that the activity during navigation is beneficial particularly in egocentric frame-based situation. To test this hypothesis, we propose to disrupt the frame of reference in manipulating the viewpoint (during active learning in VR).

In VR-based studies, the viewpoint is easily manipulated according to the locomotion metaphor implemented to display the VE. Ground- or aerial-level metaphors are simply display parameters of VE. Darken and Peterson [35] claimed that the viewpoint displacement metaphor was an important variable in VR systems, due to its mediating effects on spatial representation and understanding of the VE explored. Learning in ground-level mode involves egocentric representations and facilitates understanding of the route, whereas the aerial-level mode can involve allocentric representations and promotes the acquisition of survey knowledge [36]. For example, Witmer et al. [36] proposed several navigation aids to evaluate the acquisition of configuration knowledge. Among these aids, the authors proposed adding an aerial-level mode (such as map in this case) to ground-level navigation in VE. The results obtained indicate that the addition of a map during navigation improves performance in tasks on the environment configuration. In our experiment, for aerial-level we apply a 4 meter elevation to the viewpoint. It is difficult to envisage that a 4 m high viewpoint would lead to a “total” survey representation; however, we expect that this unusually elevated perspective will be sufficient to disturb the acquisition of egocentric representation during active navigation.

In this context, the main purpose of this study was to investigate the impact of navigation mode (physically passive versus active) on the virtual/real transfer of spatial learning according to viewpoint displacement (ground: 1 m 75 versus aerial: 4 m). Our main task involves wayfinding. It consists in verifying that a subject can replicate in a real situation a route learned in VE. This task is probably the closest one to the everyday life spatial activities and allows us to examine the virtual/real transfer of skills. We added two laboratory tasks (sketch-mapping task and picture-sorting task) to study the transfer of spatial knowledge and to study the impact of the type of spatial recall task administered. Concerning this last point, several authors have mentioned that the outcome of active-passive comparisons may depend on the type of spatial recall task and the measure employed (e.g., [13, 33]). We hypothesized that an active mode of route learning may enhance perception-action coupling (leading to the route strategy in pedestrian navigation) in an egocentric frame-based situation. In other words, we expect that active navigation may be beneficial when route strategy is favored by a matched viewpoint between learning (ground-level viewpoint) and recall (egocentric frame-based tasks). Thus, we expect the best performances in wayfinding task to be obtained in active/ground-level navigation. Concerning the laboratory tasks, we hypothesized that the active/ground-level navigation would facilitate the performance in the picture-sorting task (egocentric frame-based) and passive/aerial-level navigation in the sketch-mapping task (allocentric frame-based). For these hypotheses, the “frame” does not refer to the internal representation but rather just to the frame of reference proposed by tasks.

2. Method

2.1. Participants

The participants were 64 student volunteers from Victor Segalen Bordeaux 2 University (32 men and 32 women, with an average age of 20). All subjects were unfamiliar with the area used in experimentation. All of the subjects were native French speakers and right-handed. Thirty-two subjects participated in the ground-level condition and 32 in the aerial-level condition (corresponding to the type of viewpoint displacement). In each condition, two groups of 16 subjects were formed according to the type of learning (i.e., passive virtual versus active virtual). The four groups were matched by age ( ) and number of years of education ( ). To match the groups in terms of spatial skills, the subjects performed paper-pencil tests before the experimental tasks. Spatial skills were assessed using the following: the Survey Aptitude Test (only parts GZ-5 and GZ-6; [37]) and the Mental Rotation Test (MRT, [38]). There were no significant differences between the groups in terms of performance on any of the spatial tests ( ; ; , resp.). In addition, there were no gender differences across the groups for the two parts of the Survey Aptitude Test ( ; , resp.) or the MRT ( ).

2.2. Material

Two environments were used in our experiment: virtual and real. The real environment was an area near the Bordeaux hospital, and the virtual environment (VE) was a 3D replica of that environment created in 2006 by engineering students from Bordeaux 2 University, using Virtools software. Significant landmarks (e.g., signposts, signs, urban furniture, etc.) as well as an auditory environment (i.e., urban sounds) were included in the VE. The characteristics of the route were as follows: 1,457 meters, 14 streets, 18 intersections, and 18 direction changes. To manipulate the navigation mode in the VE in the passive condition, participants only visualized the route without any interaction, whereas in the active condition, the participants used a joystick to interact with the VE. In this present experiment, with the future aim of developing a retraining program for patients with orientation disorders, we selected a joystick, because it is an interactor suitable for this population [39]. This allows using it with patients who present a lower limb motor disability at the time of cognitive rehabilitation.

To manipulate the viewpoint mode in the VE in the ground-level condition, participants visualized the route at human height (i.e., 1.75 meters above the ground), whereas in the aerial-level condition, the participants visualized the route according to an elevated viewpoint (i.e., 4 meters above the ground). In these two viewpoints (see Figure 1) conditions, the eye orientation was parallel with the ground plane.

The apparatus used in the virtual reality room was a Dell personal computer (3 GHz, 5 GB RAM) with an nVidia Quadro FX 4400 graphics card, F1+ projector, 2 × 1.88 meter flat screen, and Logitech force 3D pro joystick for the active navigation mode. The participants were seated two meters from the display screen.

2.3. Procedure

The experimental procedure consisted of the three following steps for each participant: (1) a training phase (10 min.): the subject freely explored a virtual environment similar to the VE used in the experiment (created by the same process, but representing a different area of Bordeaux). The purpose of this phase was to allow the subjects to become comfortable with using the VE technology. (2) A route learning phase: participants were randomly allocated to groups. They visualized either the ground-level viewpoint or the aerial-level viewpoint. In each condition, the participants were divided into two groups according to the display mode: a passive virtual environment (with a recorded route and directions given by the researcher) or an active virtual environment with a joystick (the route was still pre-established by the researcher who gave direction instructions). (3) A test phase: three spatial knowledge recall tasks presented in a counterbalanced order among the participants. (i)The wayfinding task was a reproduction of the route in the real environment. Participants were required to replicate the learned route in real life. Direction errors were noted. In case of error, participants were stopped and asked to change direction. In some cases, participants made several directional errors at the same intersection. In this task, all errors in direction changes were counted. (ii)The sketch-drawing task consisted of a free-hand reproduction of the visualized route in the form of a sketch (see Figure 2(a)). Participants were required to draw a simple outline sketch (connected segments), and the goal was to indicate and number the direction changes. In this task, all errors and omissions in direction changes on the sketches were counted. (iii)The picture-sorting task consisted of sorting a series of pictures (in real world) taken along the route into encountered order (see Figure 2(b)). The experimenter counted the number of classification errors (i.e., incorrectly positioned pictures).

3. Results

A two-way MANOVA (Navigation (passive virtual versus active virtual) × Viewpoint (ground versus aerial)) was applied to the three dependent measures from the wayfinding, sketch-mapping, and picture-sorting tasks. This multivariate analysis revealed significant effects for each main factor and their interaction as follows: Navigation, , , ; Viewpoint, , ., ; and their interaction, , , .

An examination of the univariate effects produced the following results. All means and standard deviations are reported in Figure 3 (the score for each task was compared with the best potential score to obtain percentages).

3.1. Direction Errors in the Wayfinding Task in Reality

The ANOVA analysis revealed a significant effect for the following:(i)navigation factor [ ; ]: subjects performed less well in the active than the passive navigation condition;(ii)viewpoint factor [ ; ]: subjects performed better in the ground-level than the aerial-level viewpoint condition;(iii)“navigation × viewpoint” interaction [ ; ]: subjects performed better in active than passive navigation in the ground-level viewpoint (i.e., positive effect of active condition); thus the reverse was observed with the aerial-level viewpoint (i.e., a negative effect of the active condition).

3.2. Errors and Omissions in the Sketch-Mapping Task

The ANOVA analysis revealed a significant effect for the following:(i)navigation factor [ ; ]: subjects performed better in the active than the passive navigation condition;(ii)viewpoint factor [ ; ]: subjects performed better in the aerial-level than the ground-level viewpoint condition;(iii)“navigation × viewpoint” interaction [ ; ]: subjects performed better in passive than active navigation in the aerial-level viewpoint condition (i.e., a negative effect of the active condition); thus the reverse was observed in the ground-level viewpoint condition (i.e., the active condition had a positive effect in the ground-level viewpoint condition).

3.3. Errors in the Picture-Sorting Task

The ANOVA analysis revealed a significant effect for the following:(i)navigation factor [ ; ]: subjects performed less well in the active than the passive viewpoint condition;(ii)viewpoint factor [ ; ]: subjects performed better in the ground-level than the aerial-level viewpoint condition;(iii)“navigation × viewpoint interaction” was significant [ ; ] in the passive navigation condition; subjects with a ground-level viewpoint performed as well as those with an aerial-level viewpoint. However, in the active navigation condition, subjects with a ground-level viewpoint performed better than those with an aerial-level viewpoint (i.e., the active condition had a negative effect in the aerial-level viewpoint condition).

In summary, the ground-level viewpoint condition induced the best performance in the wayfinding and picture-sorting tasks, whereas the aerial-level viewpoint condition induced the best performance in the sketch-mapping task. The active navigation mode had a negative impact on all tasks in the aerial-level viewpoint condition. In contrast, active navigation had a significant positive effect on both the wayfinding and sketch-mapping scores in the ground-level viewpoint condition. Finally, active navigation did not significantly increase the picture-sorting score in the ground-level condition, although it produced the highest scores for this task.

4. Discussion

The purpose of this study was to evaluate the effect of passive versus active navigation and ground-level versus aerial-level viewpoints on spatial knowledge in three different spatial recall tasks (wayfinding, sketch-mapping, and picture-sorting). We expected that the best performances in wayfinding task would be obtained in active/ground-level navigation. Concerning the laboratory tasks, we hypothesized that the active/ground-level navigation would facilitate the performance in the picture-sorting task (egocentric frame-based) and passive/aerial-level navigation in the sketch-mapping task (allocentric frame-based).

Firstly, in terms of viewpoint manipulation, the findings were as expected in the various tasks. Indeed, our study demonstrated that the ground-level viewpoint condition induced the best performance in both wayfinding and picture-sorting tasks (egocentric frame-based tasks), whereas the aerial-level viewpoint condition induced the best performances in the sketch-mapping task (allocentric frame-based task). This may be due to the well-established principle of appropriate processing transfer [40], where learning performance is optimized when there is a perfect match between the learning and recall conditions. For wayfinding and picture-sorting tasks, exploring the virtual district from the ground-level viewpoint matched the retrieval conditions reasonably well (i.e., reference points are perceived during learning). The egocentric representations developed by subjects may therefore be used directly in the recall tasks. In the sketch-mapping task, the frame of reference provided by the aerial-level viewpoint (i.e., 4 meters above the ground) was closer to that required by the sketch-mapping task (a bird’s eye view). It seems that participants in aerial-level mode build up survey knowledge of the previously followed path more easily. This can explain the better performance in the sketch-mapping task of subjects who had trained in the aerial-level viewpoint.

These observations are consistent with studies in a real environment, which demonstrated that different representations (i.e., egocentric versus allocentric) were involved depending on the learning source used and, more precisely, the learning viewpoint (e.g., direct navigation for an egocentric viewpoint and a map for an allocentric viewpoint) [69]. Our findings are also consistent with studies in VR systems, supporting the assertion that learning in ground-level mode involves principally egocentric representations and facilitates an understanding of the route, while, in contrast, learning in aerial-level mode involves principally allocentric representations and promotes the acquisition of survey knowledge [36]. In general, this first demonstration confirmed that the viewpoint displacement metaphor is an important variable in VR systems, due to its mediating effects on the spatial representation and understanding of explored VE [35].

Secondly, regarding the issue of interaction between the navigation and viewpoint factors, our results confirmed our hypothesis that active navigation is mainly beneficial in the ground-level mode for the egocentric frame-based recall tasks. Indeed, the combination of active navigation mode and ground-level viewpoint enhanced performance in the wayfinding task and, to a lesser extent, in picture-sorting task. Active/ground-level learning markedly increased the subjects’ performance in these two egocentric frame-based tasks compared to other combinations (i.e., active/aerial, passive/ground, or passive/aerial) and facilitated skill transfer from virtual to real environments in the wayfinding task. In contrast, combining the aerial-level viewpoint with the passive navigation mode led to better performance in the sketch-mapping task. As previously mentioned, the aerial-level viewpoint facilitates the creation of allocentric representations, and our findings indicated that this effect was enhanced in the passive condition. This is probably why the aerial-level/active coupling appears to interfere with the development of an adequate mental representation for the execution of the sketch-mapping task. We concluded that, like the aerial-level mode, the passive navigation mode allows subjects to focus their attention on a global understanding of the environment, as previously asserted by Witmer et al. [36]. Another explanation about the interference of active navigation in the aerial perspective may be that the activity proposed in our experiment (i.e., with a joystick) does not provide complete idiothetic information (as would have been the case with a treadmill or a step-exercise device) which seems to play a key role in the survey strategy [33]. In future studies, it will be interesting to explore the relation between active navigation and cognitive processes involved in spatial learning. Indeed, spatial learning requires many cognitive processes that interact on the involvement of attention and working memory and active mental manipulation of spatial information [33].

Overall, these results have demonstrated that active navigation is mainly beneficial in association with a ground-level viewpoint (i.e., egocentric frame) in the learning phase for egocentric frame-based tasks. This confirmed the hypothesis that the benefits of active navigation are linked with egocentric frame-based situations and facilitates the establishment of accurate perception-action coupling. In this way, sensory-motor activity potentiates the correlation between visual (i.e., egocentric view of landmarks) and motor (i.e., direction changes) inputs, likely to be particularly beneficial for route knowledge acquisition (i.e., the second level in the L-R-S framework). This confirms the need to match viewpoints between learning (ground-level viewpoint) and recall (egocentric frame-based tasks) for active navigation to be beneficial.

Our study also confirms that the outcome of active-passive comparisons may depend on the type of spatial recall task and the measure employed (e.g., [13, 33]). These results provide an explanation for the discrepancies in the literature. Indeed, the beneficial effect of active navigation in VE studies has not been clearly established. As already detailed, some authors have reported a positive impact on spatial learning performances [1820, 2426] while others failed to find it [23, 2730]. All of these studies used a ground-level viewpoint during learning but different frame-based tasks (i.e., egocentric or allocentric). For example, Carassa et al. [32], who reported a positive effect of activity on route learning, used wayfinding task (i.e., an egocentric frame-based task). In contrast, Gaunet et al. [28] and Wilson and Péruch [29], who found that active navigation did not have a positive effect, used sketch-mapping task (allocentric frame-based task).

5. Conclusion

To conclude, our results provide relevant information in the passive/active debate. These findings offer a possible explanation for discrepancies in the literature, by confirming that active learning is particularly beneficial in egocentric frame-based situations (learning with ground-level viewpoint and recall by wayfinding task) compared to allocentric frame-based situations. However, further investigation is required to clarify the strategies and spatial knowledge brought into play by the navigation mode (active/passive) and the role of sensory-motor activity in potentiating the coupling between visual and motor information, and so promoting the transfer of spatial abilities. Particularly relevant applications may be envisaged in the field of cognitive assessment and neuropsychological rehabilitation for many medical conditions involving orientation and spatial memory difficulties [41]. In this context, it is highly relevant to identify the situations most likely to optimize the transfer of learning from virtual to real and to clarify the level of representation and spatial skills altered or preserved, depending on the type of pathology.

Conflict of Interests

The authors report no conflict of interests. The authors alone are responsible for the content and writing of the paper.