Presence 1999

Learning and Building Together
in an Immersive Virtual World

Maria Roussos, Andrew Johnson, Thomas Moher
Jason Leigh, Christina Vasilakis, Craig Barnes

Electronic Visualization Laboratory (EVL) and
Interactive Computing Environments Laboratory (ICE)
University of Illinois at Chicago, 851 S. Morgan St., Room 1120,
Chicago, IL 60607, USA
(312) 996-3002 voice, (312) 413-7585 fax
nice@ice.eecs.uic.edu, www.ice.eecs.uic.edu/~nice

Abstract

This paper describes the design, evaluation, and lessons learned from a project involving the implementation of an immersive virtual environment for children called NICE (Narrative-based, Immersive, Constructionist / Collaborative Environments). The goal of the NICE project was to construct a testbed for the exploration of virtual reality as a learning medium within the context of the primary educational reform themes of the past three decades. With a focus on informal education and domains with social content, NICE embraces the constructivist approach to learning, collaboration, and narrative development, and is designed to utilize the strengths of virtual reality: a combination of immersion, telepresence, immediate visual feedback, and interactivity. Based on our experiences with a broad range of users, the paper discusses both the successes and limitations of NICE, and concludes with recommendations for research directions in the application of immersive VR technologies to children's learning.

1 Introduction

There are good reasons to presume that the application of virtual reality (VR) technologies to children's conceptual learning is, in the words of Fred Brooks, "rank foolishness" (Brooks, 1998). To date, there exists no clear evidence that VR brings "added value" to learning in children; historical experience with other media offers scant hope for powerful effects (Clark, 1983; Cuban, 1986). Even if overwhelming evidence of effectiveness were available, the prohibitive costs of VR technologies and concomitant staff development, operations, and maintenance would find no place in dwindling school budgets overwhelmingly dominated by human resource costs. Price/performance issues aside, there remain strong objections among educators and developmental psychologists regarding the appropriateness of "virtual" experiences for children (Cuban, 1986).

Yet, in spite of these concerns, there remain compelling reasons for believing that VR learning environments for children warrant serious investigation. There is general agreement that VR can have strong motivational impact (Bricken, 1991); ongoing efforts at characterizing phenomena such as immersion and presence are beginning to clarify these effects (Winn, 1993; Slater and Wilbur, 1997). VR affords opportunities to experience environments which, for reasons of time, distance, scale, and safety, would not otherwise be available to many young children, especially those with disabilities (Cromby et al., 1995). Early exposure to virtual environments may both leverage the well-known efficiency and capacity of children's learning and provide advance organizers for later learning experiences (Dede, 1998). Usability issues which plague adult VR users may prove less problematic among children, who both easily adapt to graphic and conceptual abstraction (in cartoons and comics) and who often have extensive experience in navigation 3-D spaces and discovering and exercising interface affordances (Provenzo, 1991).

In this paper we describe our experience in the development and assessment of the distributed virtual reality environment NICE (Narrative, Immersive, Collaborative/Constructivist Environment), designed to support children's learning of simple relationships between plant growth, sunlight, and water. NICE implements a persistent virtual garden in which children may collaboratively plant and harvest fruits and vegetables, cull weeds, and position light and water sources to differentially affect the growth rate of plants. NICE has been operational since July, 1996, and has now been "visited" by well over 300 users from around the world.

We begin with a brief survey of the use of virtual reality technologies in support of learning. Next, we describe the NICE world, the learning and pedagogical themes which informed its design, and briefly discuss its implementation. A major portion of the paper is devoted to user experience and formal assessment of NICE as a learning environment. Finally, we close with a discussion of lessons learned from the NICE project, and how our experience with NICE is shaping our future research directions.

2 Children's Learning in Immersive Virtual Environments

Research in conceptual learning and virtual reality is a relatively young field, but growing rapidly. In a recent report by the Institute for Defense Analysis, Christine Youngblut comprehensively surveys work over the past few years in the area, citing approximately 50 VR-based learning applications which include desktop but exclude text-based virtual environments (Youngblut, 1998). We restrict our focus here to those projects involving immersive VR technologies applied specifically to elementary and middle school children's learning.

The Human Interface Technology Laboratory (HITL) at the University of Washington has been one of the early educational seedbeds for VR, with projects such as the Virtual Reality Roving Vehicle (VRRV) (Rose, 1995; Winn, 1993) and summer camp programs in VR for students (Bricken and Byrne, 1993). The VRRV project was experienced by a large number of students, while the summer camps focused on "world-building" activities, where students conceived and created the objects of their own virtual worlds, using 3D modeling software on desktop computers. Although this gave the opportunity for students to understand the process involved in creating a virtual setting, the actual immersive experience was limited to a short visit of the pre-designed virtual worlds (10-minute VR experiences), making it difficult to come to conclusions on the value of a virtual experience itself for conceptual learning.

The NewtonWorld and MaxwellWorld ScienceSpace projects developed by researchers at George Mason University and the University of Houston (Dede et al, 1996) provide immersive learning environments in which students may explore the kinematics and dynamics of motion, electrostatic forces, and other physics concepts. Formative evaluation studies of these virtual worlds have been conducted with respect to their usability and learnability. These studies report on learners' engagement, surprise and understanding of the alternative representations of the concepts provided in the ScienceSpace worlds (Dede, et al., 1996). Limitations and discomfort caused by the current VR head-mounted displays hindered usability and learning. On the other hand, multisensory cues, multimodal interaction, and the introduction of multiple new representations is believed to have helped students develop correct mental models of the abstract material.

Researchers at The Computer Museum developed an immersive VR application designed to teach children about the structure and function of cells (Gay and Greschler, 1994). In the application, users were asked to construct cells from component parts, with successful completion indicated by an animation of internal cell function. In a comparison between immersive and non-immersive treatment groups, immersive subjects (children and adults) demonstrated better retention of symbolic information (remembering the names and functions of the organelles), and indicated more interest in taking a free biology class as a result of the experience.

Another exhibit-based research project, the Virtual Gorilla project (Allison, et al., 1997) recreates the Gorilla Exhibit at Zoo Atlanta, allowing users to adopt the role of an adolescent gorilla, navigating the environment and observing other gorillas' reactions to their approach. While no formal assessment has been reported, interviews with users elicited favorable responses in the sense of immersion, enjoyment, and successful communication of learning goals.

The above virtual worlds have been implemented to support only one (physically present or remote) student at a time. To our knowledge, the NICE project is the first immersive, multi-user learning environment designed specifically for children.

3 The NICE project

The Narrative Immersive Constructionist / Collaborative Environments (NICE) project is an exploratory learning environment for children between the ages of 6 and 10 (Roussos et al., 1997; Roussos et al., 1997b, Johnson et al., 1998.) The children's main activity in NICE is to collaboratively construct, cultivate, and tend a healthy virtual garden (see Figure 1.) This activity takes place in a highly graphical immersive virtual reality system called the CAVE (TM) (Cruz-Neira et al., 1993.)

The CAVE is a multi-person, room-sized virtual reality system consisting of three walls and a floor. All users entering the CAVE wear special lightweight stereoglasses, which allow them to see both the virtual and the physical world unobtrusively, and use a light-weight hand-held device, called a wand, for interaction (see Figure 2.) As the CAVE supports multiple simultaneous physical users, 5-6 children can participate in the learning activities at the same time. A similar but smaller VR system, the ImmersaDesk(tm), consists of one back-projected panel tilted at a 45-degree angle and resembles a drafting table.

The NICE garden was originally designed as an environment for young children to learn about the effects of sunlight and rainfall on plants, the "spontaneous" growth of weeds, the ability to recycle dead vegetation, and similar simple biological concepts that are a part of the life cycle of a garden. Since these concepts can be experienced by most children in a real garden, the NICE garden provides its users with tools that allow its exploration from multiple different perspectives. In addition to planting, growing, and picking vegetables and flowers, the children have the ability to shrink down and walk beneath the surface of the soil to observe the roots of their plants or to meet other underground dwellers. They can also leap high up in the air, climb over objects, factor time, and experience firsthand the effects of sunlight and rainfall by controlling the environmental variables that cause them.

Familiar methods of interaction are employed, which eliminate the use of menus and instead use simple visual metaphors. The wand has a joystick for navigation and three buttons: one for picking and planting, one for changing size, and one for leaping. In the garden there are several crates of seeds for the children to choose from. Using the wand, a child can pick a seed from a crate and drop it onto the soil. The corresponding vegetable will then begin to grow. The children must make sure the plants are not too close together, and that they get enough water and sunlight. Using the same pick-and-place action, they can water their plants by pulling a raincloud over them, provide sunlight with the use of the sun, or clear the garden weeds by recycling them in the compost heap. The symbolic representations of the various environmental elements as well as instant feedback are used to facilitate the learner's understanding of the biological relationships which take place in the garden. Thus, when the raincloud has been over a plant for too long, the plant holds an umbrella; when it's too sunny, it wears sunglasses, and so on.

The garden is persistent in that it continues to evolve, so the participants can return and check on its progress at a later time; the current garden has been growing for 2 years. In addition to the garden, the children have a whole island to explore: they can climb down a dormant volcano to access the catacombs beneath the island, look for fish in the sea, or see their own reflection in the water.

NICE supports real-time distributed collaboration. Multiple children can interact with the garden and each other from remote sites. Each remote user's presence in the virtual space is established using an avatar - a graphical representation of the person's body in the virtual world (see Figure 2.) The avatars have a separate head, body, and hand, which correspond to the user's actual tracked head and hand motions. This allows the environment to record and transmit sufficiently detailed gestures between the participants, such as the nodding of their heads, the waving of their hand, and the exchange of objects. Additionally, voice communication is enabled by a real-time audio connection.

NICE represents an explicit attempt to blend several learning and pedagogical themes within a single application. These themes: constructionism, exploratory learning, collaboration, and the primacy of narrative, reflect several of the most important educational reform themes of the past three decades.

Figure 1. A child (represented by an avatar) planting in the NICE garden

3.1 Constructionism and Exploration

The design of NICE supports the constructivist view that learners assimilate knowledge by engaging in self-directed learning activities which are accomplished through constructive tasks (Dewey, 1966; Papert, 1980.) The approach to constructionism taken by NICE echoes Papert's ideas in two ways: first, the learners can craft the environment within the virtual world. The activities of planting and tending of the garden entail making, manipulating, and exploring objects, systems, and ideas. The plants are simple agents with common rules of behavior based on simplified ecological models. They contain a common set of characteristics that contribute to their growth, such as their age, the amount of water and sunlight they need, and their proximity to other plants. The combination of these attributes determines the health of each plant and its size. The children gradually discover these relationships aided by the direct feedback provided.

Figure 2. Eddie interacting with the NICE garden in the CAVE

Second, the learners can construct something meaningful to them, such as the narrative.

3.2 Narrative

Papert believes that learning takes place when engaged in the construction of a personally meaningful artifact, such as a piece of art, a story, or an interactive computerized object (Papert, 1980.) The constructive artifact in NICE is in many ways the garden itself, as well as the stories formed by the kids that participate. Our original intentions for the narrative development in NICE stemmed from an earlier project, the Graphical Storywriter (Steiner and Moher, 1992), a shared workspace where young children can develop and create structurally complete stories.

The stories developed in NICE differ in that they do not achieve closure, rather they continue to evolve along with the garden. Every action in the environment adds to the story that is being continuously formed. The narrative revolves around tending the garden and the reactions or decisions taken while interacting with the other characters. These interactions are captured by the system in the form of simple sentences such as: "Amy pulls a cloud over the carrot patch and waters it. The tomatoes complain that they are not getting enough water." This story sequence goes through a simple parser, which replaces some of the words with their iconic representations and publishes it on a web page (see Figure 3.) This gives the story a picturebook look that the child can print to take home. As a tangible product of the virtual experience, this visual output is intended as a way to strengthen the interest and motivation of a student and not so much to challenge reading and writing skills. It is, however, possible that with further development, a predominantly visual medium such as immersive VR can provide a valuable environment for literary experiences.

Figure 3. a NICE story on the web

3.3 Collaboration

One of the most important purposes of an educational environment is to promote the social interaction among children located in the same physical space. Theories that emphasize the importance of social interaction in cognitive growth (Vygotskii, 1978) suggest that successful collaborative learning involves more than the final creation of a learning product. Learning that is contextualized in a social setting may involve verbal interaction, collective decision making, conflict resolutions, peer teaching, and other group learning situations characteristic of a classroom setting. With the use of VR technology that supports multiple users in the same physical space, as well as appropriate interaction techniques, a number of kids can participate in the learning activity at the same time, without having to take turns or wear heavy and intrusive hardware devices.

In NICE, the construction of the environment may foster collaboration. The power of the user to modify the environment is manifested on multiple levels, covering the spectrum of available interface options, from bodily to visual to textual representation. Through the use of avatars, geographically separated learners are simultaneously present in the virtual environment. The ability to connect with learners at distant locations, enhanced by visual, gestural, and verbal interaction can be important to the development of unique collaborative experiences for both the students and the educators. Teachers or parents can participate, either as members of the groups, or disguised as characters in the environment. This allows teachers to mentor the children in person or to guide parts of the activity from "behind the scenes", acting as simulated virtual characters. They can also determine the pace at which the world evolves; they may choose to see the plants grow very quickly, or, in the case of a school project, extend their growth over the period of a semester.

We explored this notion of a teacher-avatar in the studies we performed with students, as mentioned later.

3.4 Implementation

The growth of the plants is handled by a central garden server which can be run on any centrally located computer chosen to maintain the virtual world. The NICE garden is persistent, as the garden server is constantly running and will attempt to reestablish lost connections in the event of failure. This allows users to casually join in the collaboration whenever they wish. The garden continues to evolve even if no users are present: the plants continue to grow; animals may try to eat the vegetables; and weeds slowly take over the garden crowding out the other plants.

Since the CAVE library can support heterogeneous VR display devices (ImmersaDesk(TM), InfinityWall(TM), BOOM(TM), fish-tank VR systems) a large number of participants can join in the collaboration from a number of different VR hardware platforms. Multiple distributed NICE applications running on separate VR systems are connected via the central garden server which guarantees consistency within the shared virtual environment. In practice this has been tested successfully with as many as 16 simultaneous participants on three continents (Johnson, Leigh, and Costigan, 1998.)

The networking architecture for the NICE application was based on previous experience with CALVIN, a networked immersive collaborative environment for designing architectural spaces (Leigh and Johnson, 1996.) The networking protocols selected were tailored towards the characteristics of VR data, and the ability to enter and leave the environment easily from anywhere on the Internet. The networking component also allows clients other than virtual reality interfaces participate. For example, a recorder client can be connected to the network that records all of the movements and interactions in the virtual environment. This allows the session to be replayed later during evaluation studies. Monitoring clients can be connected to monitor the state of the NICE island and the state of the network. Web-based clients can also connect and cooperate with the VR clients, as explained in the next section.

4 Extending the Virtual Environment

Interactivity in NICE is augmented by providing possibilities to interact with the virtual world without being inside it. The children can check on the progress of the garden from a desktop computer with a web-browser and an Internet connection, seeing who is currently working in the garden and how the various plants are growing (see Figure 4.) They can converse with the other virtual and remote participants by typing in the provided text window - a feature that resembles text-based virtual environments. This feature is currently being enhanced with audio so the desktop users can talk to the immersive VR users.

Figure 4. A view of the NICE garden and chat window on the web

Children without access to the virtual reality system but with a personal computer and access to the web, may even create their own objects and characters to populate the virtual world. These models are downloaded by the NICE system automatically and in real-time when the users join in to the collaboration. We are currently working on a VRML interface to the garden itself allowing desktop VRML users to interact with the immersive VR users (see Figure 5.)

Using a Java applet written by Robert Stevenson, students interacting with a two-dimensional version of NICE on the Internet can simultaneously share and manipulate the same virtual space as the children in the CAVE. The users of the 2D environment use a `traditional' mouse and icon interface to interact with the garden, but have the same ability to pick and plant as the VR users do. These desktop users see the virtual reality users as 2D icons on their screen, while the VR users see the desktop users as 3D avatars in the space (see Figure 6.)

Finally, a current prototype is a two-dimensional interface where the child, by clicking and dragging icons, can manipulate the ecological model and observe immediate effect on the growth of the plants and vegetables in the three-dimensional VR environment. We envision these two-dimensional interfaces as a kind of visual language, allowing the virtual reality worlds to be easily programmed by children. Visual languages have shown to be ideal programming languages for young learners because they can map abstract concepts to pictorial elements that they are more familiar with (Soloway, 1996) and can be learned quickly.

Figure 5. An interface to the NICE
garden using a VRML browser

Allowing web-based and desktop participants access to the virtual environment provides several advantages for both the users as well as the researchers and educators. Virtual reality hardware is expensive and inaccessible to the public. Even when the technology becomes cheaper and more accessible, the time that a child can spend in an immersive virtual environment will still be limited. The web-based component allows children to sustain their interaction with the virtual world beyond the limited time they can spend in the virtual environment itself. It also allows educators and researchers to participate and evaluate the experience easier. Additionally, this approach holds promise for social interaction by students that are either geographically isolated or have special needs.

5 Evaluation

It is important to investigate the educational efficacy of VR in specific learning situations and broader learning domains, and to develop new rubrics of educational efficacy that compare it to other approaches. In practice, however, the assessment of VR technology has been focused primarily on its usefulness for training rather than its efficacy for supporting learning in domains with a high conceptual and social content (Dede et al., 1996; Whitelock et al., 1996.)

The education world would argue that using paper and pencil, in the form of standardized tests, is not an effective way to evaluate a virtual learning experience. As VR is a dynamic learning tool, evaluation should be tightly coupled with the actual learning process. Following the authentic assessment model, learning in constructivist environments is directly related to its evaluation (Reeves and Okey, 1996.) Moreover, considering the immature nature of the field at this time, it is important to apply multiple measures of learning and performance (Rose, 1995.)

Figure 6. A two-dimensional interface to the NICE garden using a Java applet

Virtual reality itself has great potential as a tool for assessment. Networked virtual reality systems can embed methods for facilitating learner's discourse while in the environment. Mentors, disguised as virtual characters, serve as guides and evaluators: to answer questions, direct action, ask for clarification, prompt for interpretation. In addition to recording data such as video and audio while in the virtual environment, it is also straightforward to have one of the networked clients act as a recorder, allowing the entire virtual reality session to be played back in 3D for further reflection and interpretation (Johnson et al., 1998). This form of assessment, embedded in the learning process, can provide meaningful reflections on learners' skills and knowledge.

5.1 Conceptual Framework

Of particular interest to us was the exploration of the effects of the NICE virtual environment as well as the overall educational efficacy of virtual reality learning experiences. As a first step, we developed an evaluation framework meant to serve as a prototype for a general evaluation framework. The exploratory nature of this study required a sound conceptual framework that would encompass, rather than restrict, the multiple dimensions of the issues that need to be examined in a virtual learning environment. Taking into account the multidimensionality of learning as well as virtual reality as a field, a number of technical, orientational, affective, cognitive, pedagogical, and other aspects were included (Lewin, 1995.)

The technical aspect examines usability issues, with respect to interface, physical problems, and system hardware and software.

The orientation aspect examines the relationship of the user to the virtual environment, including navigation, spatial orientation, presence and immersion, and feedback issues.

The affective parameter looks at the user's engagement, likes and dislikes, and confidence in the virtual environment.

The cognitive aspect identifies any improvement of the subject's internal concepts through this learning experience. We tried to evaluate the cognitive parameter in part from within the environment, with the given learning task built into the experience. In NICE, for example, the teacher-avatar can give goals to the users or ask them questions (e.g., plant and harvest a row of tomatoes). The responses to these activities may reveal what the user understands about the environment while inside it.

Finally, the pedagogical aspect includes the teaching approach: how to gain knowledge effectively about the environment and the concepts that are being taught - in this case, ecology or earth science. With respect to NICE, this aspect is examined in the context of collaboration between students or between teacher-avatars and students. The evaluation framework is summarized in Table 1.

Framework Category	Issue	Measurement
Technical	Usability	Time to learn an interface, comprehension of instructions, physical and emotional comfort
Orientation	Navigation, spatial orientation, presence and immersion, and feedback	Time to become immersed and comfortable in the environment
Affective	Engagement, preference, and confidence	Length of engagement, time to reach fatigue, reported and perceived enjoyment
Cognitive	Conceptual change, new skill	Performance within and outside the environment, think-aloud and stimulated recall techniques, oral and written surveys, video documentation
Pedagogical	Content general and specific teaching techniques	Collaboration (e.g., turn-taking, conflict, interaction), avatar acceptance, comparison of techniques
Collaborative VR	The added value of collaborative VR to instruction and learning	Comparisons of instruction and learning within and outside of collaborative VR environments

Table 1: Summary of Evaluation Framework

5.2 Methodology

The main study sessions were conducted with a total of 52 children: 44 second-grade children from an urban elementary school with an ethnically mixed student population; another 8 children from other schools participated in case studies after the classroom studies were completed. The gender distribution was equal: 26 boys and 26 girls. The activities at each evaluation session of NICE took approximately one to three hours to complete, depending upon whether the tests were conducted with groups or pairs of children. This included time to introduce the activity and organize the students, give them time to plan the activity beforehand, perform the activity inside the VR environment, and have some time for post-activity questions and discussion. The VR setting in all studies included the CAVE and one or two Immersadesks, all linked by an audio connection. The teachers were asked to evaluate the students in their class according to their reading and writing skills, leadership skills, and shyness. The children were then assigned to groups. We tried to keep the groups as equally distributed as possible by selectively matching and assigning the children with strong leadership skills or strong reading and writing skills to different groups. Each class of 22 students was divided into three teams of 7 to 8 students each.

Before beginning the VR experience, the children were asked to complete pretest question sheets. These initial questions attempted to identify each child's relationship to technology, familiarity with gardening, and understanding of simple ecological concepts. We wanted to establish what knowledge and understanding of the concepts displayed in the environment the children brought with them before the study.

After completing the questionnaires, each group of students was asked to generate ideas for planning their garden. A large piece of paper containing a top-down view of the garden was given to each group. Four rows of differently colored stickers, each one representing one of the four available vegetables, were provided. The children in each group had to plan where they would plant their vegetables by placing the stickers on the soil area of the garden. A total of forty vegetables were allowed (10 stickers for each kind). After the planning stage, the first team continued onto the CAVE and ImmersaDesk part, while the other teams remained in the room to continue their concept maps. Each team was split into two groups, one for the CAVE and the other for the Immersadesk. The two groups collaborated remotely, represented by the avatar of the leader of each group. The leader was assigned randomly by the researchers, to avoid conflicts during the experience in VR. The leaders were instructed in the use of the wand and were allowed a 10-minute period to practice navigation. Each session lasted for an average of 30 minutes. In addition to the two avatars sharing the same virtual space, an adult acting as teacher was disguised as a girl avatar and was guiding the groups from another Immersadesk. This teacher-avatar was also responsible for keeping the time, keeping the children focused on their planting task, helping them accomplish the garden planned on paper, and encouraging the two groups to think aloud. An audio connection between the three VR sites was established through the use of hidden ambient microphones. Out of a total of 8 groups for each classroom, 4 groups were of single gender (2 all-girls teams, 2 all-boys teams), and the remaining 2 were of mixed gender.

Following the virtual experience, an open-ended set of interviews was conducted with the children during which they answered an additional set of questions that related to their impressions and understanding of the environmental relationships in the NICE garden. The questions included space for open-ended responses and discussion with the researcher, regarding what the children did while in the environment, what they liked or disliked, and what they thought they learned.

After the interview, the groups returned to the room they started out from. Large pieces of white paper were placed on the tables, upon which the students could draw. They were asked to draw the gardens they just created in NICE. Similar activities also continued in their classrooms after the experiments. The teachers assigned homework to the students where they would describe the virtual reality experience and propose their own virtual worlds. Some of the children from the case studies returned a few more times to participate in NICE at a later time.

5.3 Observations

The observed results from the case and classroom studies have been grouped based on the theoretical framework defined previously. These observations have been collected by converging the multiple pieces of data gathered through observation, interviews and questionnaires, and are presented below.

Technical issues. The children exhibited diversity in their use of the interaction device, the wand. The instructions given depended on the situation and the environmental or personal distractions, so they were not exactly the same for everyone. Generally, these instructions started with showing the representation of the virtual hand to the leader, then the use of the joystick for navigation, and finally, once the child was able to move comfortably, the function of the buttons. Learning the functions of the wand lasted from 2 minutes, for the children that learned quickly, to 7 minutes.

After learning how to use the wand, the children's effort was focused on orientation, as noted in the following section. Limitations of the physical design of the wand caused discomfort to young users, as both hands were needed to reach the buttons and press the joystick at the same time. It was expected that the boys would generally be better at using the wand, partly because of their familiarity with similar input devices from playing video and computer games. According to both parents' and kids' reports, 92% of the boys play electronic games weekly, as opposed to 42% of the girls. The majority of these games have joystick-based interface devices. We did not notice, however, any gender differences in learning to use the wand.

A larger problem was the size of the stereo glasses. Despite the glass-ties used to tighten the glasses on the children's heads, the glasses would still fall off. Most children had to hold the glasses with their free hand and, when tired of holding them, would just take them off. Not only did this contribute to the subjects' fatigue, but also to their level of motivation and excitement. Since the stereo glasses and the wand are an integral part of the virtual experience, these limitations are a current hindrance not only to usability but also to learning.

The children's susceptibility to simulator sickness was not as large as expected. Less than 5% of the subjects complained about getting a headache or being dizzy during or after the experience, and for most it was so slight that they had not noticed until asked.

Evaluation of the system with respect to its robustness and cost effectiveness for broader use must be put off until the system is in a public locale. The NICE software is flexible enough to eventually expand into a user-authoring system. To be effective, however, it needs to be used by a small number of learners for an extended period of time.

Orientation. After learning how to use the wand, the children focused on trying to navigate and orient themselves in the virtual environment. With respect to the classroom groups, this proved to be the effort of the leader and not of the other children in the group, although their mission was to help the leader. The drivers were the only ones focused on the orientation task at hand, as they were the ones navigating, while the other children were distracted by the movement and the three-dimensional graphics. The girls seemed slightly better at orienting themselves in the environment, possibly because they were generally more focused and reserved compared to the boys. Even with the case studies, although not nearly to the same extent, there were times when the other child (the one not using the wand) would wander around, instead of observing or directing the driver's actions. While it was not expected that all children's full attention would be given at orientation, the result in these studies was that each child came up with their own version of the right direction, voiced them at the same time as the other children and confused the leader, who then individually decided which was the right path to take. As a result, apart from the difficulty in using the joystick for navigation, the leaders exhibited noticeable individual differences in their abilities to interact with the 3-D environment. These differences seemed to relate to their level of "independence": the ones pursuing their own goals did well, while the ones that attempted to listen to the others in their group ended up confused and disoriented.

A test for spatial orientation was the ability to find areas in the space, such as a hole that leads to the area under the garden. This was a relatively difficult task, although there were spatial clues: the passage was located near the only set of trees behind one of the garden fences. These were some of the instances where verbal interaction between the children and teacher seemed to work well, largely because the goal was very specific and required the kids' complete attention.

Another test for orientation was the concept map - the plan of the garden on paper. In the planning stage, students developed different strategies for planting. We wanted to see how they were able to implement this plan in VR. The case studies were more focused and, therefore, the children attempted to stick to their plan. With the exception of one boy in the initial study, the children were not successful at completing the task. Most children began planting as planned, but then changed their plans when running into difficulty. A younger girl who tried following the plan, commented that it was very hard to be precise in separating the vegetables. The teacher-avatar helped her with directions, but that "wasn't enough". The classrooms, on the other hand, hardly even tried to implement the plan, although constantly reminded by the teacher-avatar. Their entire experience was consumed by dealing with the group's behavior. None of the children admitted that they did not try; rather they stated that implementing the plan was a difficult task. One boy, after seeing the look of the group's final version of the garden asked his group: "how come we didn't get it right?" to receive the overwhelming response "because it was very hard!".

As perceived through observation, most kids felt immersed. This was indicated by their motion and excitement. Almost all children attempted to "touch" the virtual objects by moving and clasping their hands in the air. This was particularly noticeable in the case of the virtual beam that extended from the user's hand to help point to and select objects. As the beam was always attached to the hand and close to the user, it felt very "three-dimensional" to the children. Many leaders waved at the other avatars with the hand that was holding the wand, indicating that they understood the relationship between the wand, their real hand, and the virtual hand.

Affective. Measuring motivation is difficult, as it is indirect. Moreover, in the case of virtual reality, motivation is highly driven by other factors, such as the novelty effect, media hyperboly, and social issues. It is significant to look through these factors and try to identify whether the content taught within this medium is motivating for children, what it is that motivates them, and most importantly, for how long. This was difficult, as all of the children were excited before starting, just by the fact that they would experience virtual reality. Therefore, we had to look at their level of extended engagement during the actual experience.

The amount of time the children spent in VR ranged from 30 minutes to 1 hour and 30 minutes. Each classroom group, due to time constraints, remained in the experience for about 30 minutes. The case-study subjects, on the other hand, were allowed to stay until they displayed noticeable fatigue, at which point they were asked if they wished to continue. Most cases wished to remain in NICE for at least 45 minutes and started getting tired after one to one and a half hours.

Interactive activities ranked high amongst the preferences of the children, as shown by their responses in the post study questions. Planting was a favorite. An equal number of responses were in favor of the area under the garden. The fantasy was another fundamental driving force for many of the children. Many liked the water (or "swimming"), the rain, sun, umbrellas and sunglasses, and the vegetables. The three things that were most disliked by the children included "the stuff that we had to move with", the "glasses falling off", and the fact that some did not get to drive. Most (73%) of the children answered "nothing" to the question "what did you dislike the most?"

The most important issue related to motivation is control. As mentioned in the discussion of orientation, the children that were leading were more on-task and engaged, while all others were distracted and unfocused. This was also perceived, to a lesser degree, with the pairs of children in the case studies: the driver was focused on the task even if that meant only navigation, and was consequently more engaged, while the second child seemed less engaged. The post-experience questions verify these observations: Children that were leaders listed that what they enjoyed the most was being the leader, while most others that did not get that chance were very disappointed. Many of these observations are consistent with findings by other researchers in computer-based literature (Malone and Lepper, 1987).

Cognitive. Examining the cognitive value of a virtual learning environment is very difficult, as there are many other factors that correlate to learning, such as the ones described above. Particularly, distraction, fatigue, and cognitive overhead in mastering the interface influence the outcome. The classroom studies provide good examples of a situation in which all the above took place, and where one cannot derive any conclusions about conceptual learning. The results from the case studies are more promising, as the studies were more focused, prolonged, and with less noise and disorder.

However, even in the case studies, little can be concluded as far as learning is concerned. Confidence in using the interface does not necessarily signify understanding of the subject matter. One of the boys, for example, who reported playing many hours of video games per week, learned the interface very quickly and easily and had very good navigation and picking skills. After interacting with VR for about 40 minutes he was interviewed. During the interview and his post-study questions it was revealed that he had not perceived the effects of the sun and the rain on the plants, nor the function of the umbrellas and sunglasses. This was consistent with his pre-study test, which showed little knowledge of gardening concepts.

To simplify the understanding of the children's knowledge before and after the virtual experience, their responses were grouped into categories. For the pre-study test, three categories were devised according to the children's understanding of simple ecological relationships. The first category included the responses that displayed a very good understanding of gardening concepts: the plants need water and sunlight (i.e. good temperature), and good soil to grow, they wilt or look brown when they are sick, they wilt if they get too much water and dry out when they get too much sun, and the weeds need to be pulled out. About 12% of the subjects answered in this way. They were also the ones ranked high in reading/writing skills by the teachers. The second category included most of the above answers except for a few misconceptions (e.g. water is good but sunlight is bad for plants). 42% of the children's answers fit into this category. The third category included 44% of the responses, where more than one question included a "don't know" response or a wrong answer (such as "the plants grow down" when they get sunlight, or that weeds need to be planted and watered). Finally, one child could not answer most of the questions.

The answers to the post-study questions were grouped into categories based on the children's understanding of the NICE model: the plants display umbrellas when they receive too much water and sunglasses when there's too much sun, while the weeds are recycled in the compost heap. The responses here were more difficult to categorize, as many children had trouble synthesizing their learning during post-testing, due to fatigue or excitement, while others misunderstood the questions and answered in the same way as in the pre-test, not understanding that the post-questions pertained to the NICE garden in particular.

Approximately 17 children (35%) understood, for the most part, the NICE model. Of these 17, 13 were drivers, and all had done well in their pre-study questions. This shows that most of the leaders, children that were actively engaged in the task, understood the model of the NICE garden, whereas only a few of the other children perceived it. Approximately 45% of the children simply answered "they grew" to the questions "what happenned when you put the rain over the plants" and "what happened when you put the sun over the plants". Five kids answered that they did not know or see what happened while six kids were tired and did not answer at all.

Pedagogical. The children acted naturally while in NICE, just as they would have at a playground. They played, argued, listened, spoke loudly, and even rested. Very few were curious about the technology, excepting a girl asking if the screens were made of paper. The presence of "the computer" was not generally perceived by the children throughout the sessions. As one child put it, "I thought we were going to play with a computer, but this was different". This indicates that perhaps virtual reality may come closer to a "natural" medium for teaching, once technical and technology-specific problems are resolved.

Although children in these studies participated in the VR session longer than in any other educational VR study, it appears that this was not an important factor in the facilitation of learning. We do agree, however, with Dede (Dede et al., 1996) who reports that spreading lessons over multiple VR sessions appears to be more effective than covering many topics in a single session, as we attempted to do in our studies. Reviews and post-tests from their studies demonstrated that students were better able to retain and integrate information over multiple lessons. This is usually the case in school-based learning as well as being the main concept of life-long learning.

With respect to their pedagogical function in the NICE studies, collaboration and the narrative are explored further in the following sections.

Collaboration. The classroom studies were set up to encourage intra-group collaboration and inter-group competition, to ensure that each group had an incentive to focus on the task of creating a tended garden. However, none of these forms of cooperation occurred. After each group was split, one subgroup to go to one VR system and the other to the other, the children had to be continuously reminded by the teacher-avatar that they were still one group working on a common goal in the same garden. Most children, however, continued not to perceive this and regarded the other (remote) half of their group as their competitors. There were multiple instances of the two drivers fighting over who would grab the raincloud, and children from one location yelling at the ones in the other location to step out of "their" garden. As far as the classrooms were concerned, competition contributed to the excitement of the children in the group, but kept them
off-task and distracted them for nearly the entirety of the experience. Some of the groups even displayed a form of intra-group competition between the leader and other members. This related mainly to the control of the wand. Notable is the case of one girl who caused constant conflict because she was not the one chosen to be in control. The intent during these studies was to have only one child in each group control the wand. Our rationale for this was efficiency: it is easier and quicker to teach one subject than all, it is more efficient for one to control while others direct the activity, and it avoids fighting over who will do it.

On the other hand, this efficiency gain might not be helpful in terms of advancing all the students' learning. In the case of the other students, it was evident that the control over their learning and their experience was in the hands of the leader of the group. It was hoped that, in this way, the students would be able to pay more attention to the subject matter by leaving the control of the learning situation to the leader. For the child controlling, we supposed that this would not be an advantage, as it could lead to less attention to the subject matter and more to the task of controlling. As noted previously, the opposite was observed in these studies: the leader paid more attention to the subject than the other, less active members of the group.

Contrary to the classroom's behavior, the pairs of children in the case studies displayed excellent collaboration and no competition. In most cases, on-task communication was observed and there was general agreement on actions. Based on these observations, issues regarding the selection and number of members in a group of 2nd graders must be taken into account for a successful collaborative combination.

For both the classroom as well as the case studies, the teacher-avatar seemed to serve a helpful purpose, especially for giving the kids tips and keeping them on task. In terms of the classroom children, of course, the teacher-avatar consumed most of her time attempting to keep order - not unlike a real classroom.

The system's visual output (a printout of the narrative WWW page) was shown to each group during the interview to help the children reflect on their virtual experience. Each group was represented in the story by the avatar of the leader. Some children did not understand this until it was explained to them while showing them the narrative. Most were fascinated by the pictorial representations of the characters and vegetables and remembered what they were doing by looking at the story. It is believed that the iconic representation was helpful in giving the groups a general overview of their actions and is worthy of further exploration. An unanticipated function of the story was its use as a spelling aid by two children from different groups. When completing their questions, they consulted the story to find the spelling of certain object names.

6 Conclusions

In our view, the NICE project had a number of highly positive outcomes. There was ample evidence that the environment provided a strong sense of presence and immersion; one adult visitor to NICE commented that it was "the closest I've ever come to the feeling of being inside one of the cartoons I used to watch on televisions when I was a kid." NICE appeared to be a highly successful distributed virtual social space, particularly for those "drivers" who had full access to the input affordances. On the technical front, the NICE project provided a driving application for the development of the CAVERNSoft distributed virtual environment architecture (Leigh, Johnson, and DeFanti, 1998).

In retrospect, the most serious shortcoming of NICE is the inadequacy of its science model. In an attempt to engage children, we introduced elements (umbrellas, sunglasses, facial expressions) without natural analogs, and misrepresented naturally occurring features (e.g., root systems). These artifacts, deployed in a setting decontextualized from supporting discussion and instruction, may themselves have become the source of misconceptions regarding the underlying growth model we were attempting to teach. The balance among reality, abstraction, and engagement is particularly difficult to achieve; in this case, we likely veered sufficiently from reality to endanger the raison d'etre behind the project.

A second source of difficulty, in our opinion, drew from the open-ended exploratory nature of the environment itself. Instead of directing activity toward (and providing affordances for) the discovery of the underlying scientific knowledge, we assumed that the desired learning would take place naturally through exploration and discovery. This lack of directedness, both within the environment and in our task charges to users, combined with the novelty of the environment and usability issues associated with the learning of novel control affordances, appeared to obscure the intended learning goals in the eyes of the users.

Finally, collaboration itself proved a double-edged sword. The presence of avatars representing remote users was a strong spur to social interaction, again at the expense of the intended science learning. NICE supported collaboration through the provision of a shared virtual space, but did little to structure cooperative learning (Slavin, 1980; Johnson and Johnson, 1984) in a way that fostered positive interdependence among learners, or supported reflection and planning. Social interaction became an end unto itself, rather than a mechanism to support learning.

Researchers interested in learning in immersive virtual environments face a difficult challenge. On the one hand, there is a strong need for demonstrable "added value" to learning associated with the use of virtual reality technologies. In spite of our optimism regarding the ultimate broad availability of these technologies, there is little reason to bring VR technology to bear on learning goals that are already well met by conventional pedagogy. At the same time, however, it is difficult to conceive, much less conduct, an experiment whose results would be sufficiently generalizable to sway skeptics. Certainly there can be no experiment which ascribes specific learning value to the technology itself; the failure of an experiment to demonstrate added learning value would be due at least as much to the application as to the underlying technology.

There is a place for controlled experimental studies of learning in immersive virtual environments; we need more objective success stories. But the primary focus of this research domain, particularly in the case of younger children, should be directed toward the development and informal empirical evaluation of novel learning applications. In both cases, we believe that researchers should focus their attention on learning problems that meet four criteria:

1 The learning goal must be important. That is, it must be identified as a component of adult scientific (or other) "literacy," as reflected in national learning goals, standards, or benchmarks, such as those published by the National Council of Teachers of Mathematics or the American Association for the Advancement of Science (NCTM, 1989; AAAS, 1992.)

2 The learning goal must be hard. We probably don't need virtual reality to teach simple addition; in hindsight, we probably don't need it to teach simple facts about plant growth. Instead, we should focus on deep learning problems: learning which requires the rejection of inadequate and misleading models based on everyday experience, which have proven resistant to conventional pedagogy, and which are the source of persistent adult misconceptions.

3 The learning goal must be plausibly enhanced by the introduction of immersive virtual reality technologies. The most obvious plausible domains are those involving solid models; on its face, VR technologies would seem to offer more to learning about molecular models than the capitals of the U.S. states. But three-dimensional representation may not be the most important quality that VR brings to bear; there is reason to believe that the ability of VR to situate its users in an alternative cognitive frame of reference may be its most valuable contribution to learning.

4 Finally, VR-based learning environments must be informed by contemporary research in the learning sciences, by contemporary practice in education, and by the practical realities of school organization and funding. Research conducted outside these contexts runs the risk of irrelevancy.

Acknowledgements

We wish to thank all the teachers, students and their parents for participating in the user studies, the members of the original 'yet another world' group for their valuable discussions, and the all of the members of the Electronic Visualization Laboratory and Interactive Computing Environments Laboratory for their patience and support. We would especially like to thank Jim Costigan for his help.

The virtual reality research, collaborations, and outreach programs at EVL are made possible through major funding from the National Science Foundation, the Defense Advanced Research Projects Agency, and the US Department of Energy; specifically NSF awards CDA-9303433, CDA-9512272, NCR-9712283, CDA-9720351, and the NSF ASC Partnerships for Advanced Computational Infrastructure program. The CAVE and ImmersaDesk are trademarks of the Board of Trustees of the University of Illinois.

The continuation of this research is funded by an NSF Learning & Intelligent Systems grant, investigating how VR can be used to help teach concepts that are counter-intuitive given the learner's current mental models.

References

Allison, D., Wills, B., Bowman, D., Wineman, J., & Hodges, L. (1997). The Virtual Reality Gorilla Exhibit. IEEE Computer Graphics and Applications, November/December 1997, 30-38.

AAAS: American Association for the Advancement of Science (1992). Science for all Americans: A Project 2061 Report on Literary Goals in Science, mathematics, and Technology, Technical Report, AAAS Publication, Washington DC.

Bricken, M. (1991). Virtual Reality Learning Environments: Potentials and Challenges. Computer Graphics 25(3), 178-184.

Bricken, M. and Byrne, C. (1993). Summer students in VR: a pilot study. Virtual Reality : Applications and Explorations, Academic Publishers Professional, 178-184.

Brooks, F. (1998). Virtual Reality in Education: Promise and Reality panel statement. Proceedings IEEE Virtual Reality Annual International Symposium (VRAIS '98). 208.

Clark, R.E. (1983). Reconsidering research on learning from media. Review of Educational Research, 53, 445-460.

Cromby, J., Standen, P. and Brown, D. (1995). Using Virtual Environments in Special Education. VR in the Schools 1(3), 1-4.

Cruz-Neira, C., Sandin, D. J., and DeFanti, T.A (1993). Surround-Screen Projection-Based Virtual Reality: The Design and Implementation of the CAVE. Proceedings of ACM SIGGRAPH '93, 135-142.

Cuban, L (1986). Teachers and Machines: The Classroom Use of Technology Since 1920. New York: Teachers College Press.

Dede, C., Salzman, M., and Loftin, B. (1996). ScienceSpace: Virtual Realities for Learning Complex and Abstract Scientific Concepts. Proceedings IEEE Virtual Reality Annual International Symposium (VRAIS '96). 246-253.

Dede, C. (1998). Virtual Reality in Education: Promise and Reality panel statement. Proceedings IEEE Virtual Reality Annual International Symposium (VRAIS '98). 208.

Dewey, J. (1966). Democracy and Education. Free Press, New York.

Gay, E. and Greschler, D. (1994). Is Virtual Reality a Good Teaching Tool? Boston Computer Museum.

Johnson, D.W. and Johnson, R.T. (1984). Cooperative Learning. New Brighton, MN: Interaction Book Co.

Johnson, A., Roussos, M., Leigh, J., Vasilakis, C., Barnes, C., and Moher, T. (1998). The NICE Project: Learning Together in a Virtual World, Proceedings IEEE Virtual Reality Annual International Symposium (VRAIS '98), 176-183.

Johnson, A., Leigh, J., and Costigan, C. (1998). Multiway Tele-Immersion at Supercomputing â97. IEEE Computer Graphics and Applications, 18(4).

Leigh, J., and Johnson, A. (1996). Supporting Transcontinental Collaborative Work in Persistent Virtual Environments. IEEE Computer Graphics and Applications, 16(4), 47-51.

Leigh, J., Johnson, A. and DeFanti, T. (1998). Issues in the Design of a Flexible Distributed Architecture for Supporting Persistence and Interoperability in Collaborative Virtual Environments. Proceedings of Supercomputing â97.

Lewin, C. (1995). Test Driving CARS: Addressing the Issues in the Evaluation of Computer Assisted Reading Software. Proceedings of International Conference on Computers in Education, 452-459.

Malone, T.W and Lepper, M.R. (1987). Making Learning Fun: a Taxonomy of Intrinsic Motivations for Learning. Aptitude, Learning, and Instruction: Cognitive and Affective Process Analyses, Lawrence Erlbaum Associates,Hillsdale,NJ.

NCTM: National Council of Teachers of Mathematics (1989). Curriculum and Evaluation Standards for School Mathematics.

Papert, S. (1980). Mindstorms: Children, Computers, and Powerful Ideas. Basic Books, Inc., New York.

Provenzo, E.F. (1991). Video Kids: Making Sense of Nintendo. Cambridge, MA, Harvard University Press.

Reeves, T.C. and Okey, J.R. (1996). Alternative Assessment for Constructivist Learning Environments. Constructivist Learning Environments: Case Studies in Instructional Design, Educational Technology Publications.

Rose, H. (1995). Assessing Learning in VR: Towards Developing a Paradigm Virtual Reality Roving Vehicles (VRRV) Project. Technical Report TR-95-1, Human Interface Technology Laboratory - University of Washington.

Roussos, M. (1997). Issues in the Design and Evaluation of a Virtual Reality Learning Environment. Masterâs thesis, University of Illinois at Chicago.

Roussos, M., Johnson, A., Leigh, J., Barnes, C., Vasilakis, C., and Moher, T. (1997). The NICE Project: Narrative, Immersive, Constructionist / Collaborative Environments for Learning in Virtual Reality. Proceedings of ED-MEDIA/ED-TELECOM '97, AACE. 917-922.

Roussos, M., Johnson, A., Leigh, J., Vasilakis, C., Barnes, C., and Moher, T. (1997b). NICE: Combining Constructionism, Narrative, and Collaboration in a Virtual Learning Environment. Computer Graphics,31(3), New York. ACM SIGGRAPH. 62-63.

Slater, M., & Wilbur S. (1997). A Framework for Immersive Virtual Environments (FIVE): Speculations on the Role of Presence in Virtual Environments. Presence, 6(6), 603-616.

Slavin, R.E. (1980). Cooperative Learning. Review of Educational Research, 50(2), 315-342.

Soloway, E., Jackson, S., Klein, J., Quintana, C., Reed, J., Spitulnik, J., Stratford, S., Studer, S., Eng, J., and Scala, N. (1996). Learning Theory in Practice: Case Studies of Learner-Centered Design. Proceedings of ACM Conference on Human Factors in Computing Systems (CHI '96), New York. ACM Press. 189-196.

Steiner, K., and Moher, T. (1992). Graphic StoryWriter: An Interactive Environment for Emergent Storytelling}. Proceedings of ACM Conference on Human Factors in Computing Systems (CHI â92), New York, ACM Press, 357-364.

Vygotskii, L. (1978). Mind in Society: The Development of Higher Psychological Processes. Harvard University Press, Cambridge, MA.

Whitelock, D., Brna, P., and Holland S. (1996). What is the Value of Virtual Reality for Conceptual Learning? Towards a Theoretical Framework. Proceedings of European Conference on Artificial Intelligence in Education, 136-141.

Winn, W. (1993). A Conceptual Basis for Educational Applications of Virtual Reality. Technical Report TR-93-9, Human Interface Technology Laboratory - University of Washington.

Youngblut, C. (1998). Educational Uses of Virtual Reality Technology. Technical Report IDA Document D-2128, Institute for Defense Analyses, Alexandria, VA.