Introduction

A field trip is a school excursion that extends student learning beyond the classroom by exploring new environments. It is a long-standing tradition that has been acknowledged as an integral student-centered, interactive instructional method (Behrendt & Franklin, 2014). Under the auspices of the school, teachers and students embark on a journey outside the school building to develop an experiential connection between the concepts and ideas presented in various subject matters. Such on-site field experience offers students an authentic encounter with a natural environment, thereby becoming aware of the real world and its association with their classroom learning. This practical activity helps them in decoding the complex knowledge they are being taught in school (Bowker & Tearle, 2007; Kola‐Olusanya, 2005; Lebak, 2007; Skop, 2009). Moreover, field trips allow students to develop a greater vocabulary, an increased perception of learning, and a heightened interest in the outdoors (Hoisington et al., 2010). Researchers such as Knapp and Barrie, (2001), Hudak, (2003), Kisiel, (2006), and Nadelson and Jordan, (2012) have investigated knowledge acquisition and the permanent change in attitudes and behaviors of students that occurred during field trips. Their studies emphasized that well-organized field trips aligned with the school curriculum and designed to meet specific educational objectives resulted in successful cognitive, affective, and psychomotor learning outcomes.

Despite its many potentials, field trips are steadily declining and many educational institutions do not organize such experiential activities as much as before anymore. The common reasons for the disappearance of field trips include limited funding, time constraints, safety concerns, logistical issues, and difficulty controlling student behavior (Behrendt & Franklin, 2014; DeWitt & Storksdieck, 2008; Higginsa et al., 2012). As a cost-free alternative, Behrendt and Franklin, (2014) recommended campus field trips to maintain the benefits of a physical field trip (PFT). Although this type of field trip may be a viable alternative, it may be curtailed due to situations like COVID-19 in which schools are shuttered as a safety measure (Garcia, 2022; Pokhrel & Chhetri, 2021). To continue the tradition and ensure students acquire experiences that cannot be duplicated in the classroom, many schools are resorting to its virtual version (e.g., Evelpidou et al., 2021; Han, 2020; Seifan et al., 2020; Springer et al., 2020). Also titled virtual field guide or virtual excursion, a virtual field trip (VFT) is the exploration of digital worlds with the same educational intent as the PFT. It can take place in a range of digital platforms, such as single site exploration, curated collections (e.g., museum websites), or general exploration of the internet, which may be less structured and more self-determined by the student. Additionally, the technology used may also vary, including the use of augmented reality (AR) and virtual reality (VR). However, the potential benefit of immersing students in augmented or virtual worlds may be limited due to issues of access, cost, and other barriers. In the absence of a VR headset, Rupp et al., (2019) reported that it is better to use a non-VR device (e.g., smartphones) than a low-fidelity consumer VR (e.g., Google Cardboard). The latter increases simulator sickness, particularly on the disorientation and oculomotor subscales, making the user experience uncomfortable and thereby limiting the consideration of low-cost options.

Fortunately, the recent advancements in video technologies offer exciting new modalities to implement a VFT. An example of new technology is 360-degree (360°) video. The educational research on 360° videos is still in its infancy, as shown by a systematic literature review (Ranieri et al., 2020). Thus, the cognitive impact and student experience with a VFT need to be documented. In addition, most research on 360° videos focused on higher education, retaining a dearth of research on 360° videos used by students in primary education. Our research addresses the gap in the literature by exploring the possibility of using 360° videos as an alternative to PFTs in primary education. To measure the applicability of the platform, we adopted a switching replications experimental design to compare 360° videos (360V) with regular videos (REGV) as a mode of VFT across four-time points. We also examined the students’ content recall (assessment tests) and their perceptions of their VFT experience (attitude, perceived usefulness, involvement, inquiry, video engagement, and virtual guide). Our design addressed another literature gap by assessing the longer-term impact of school field trips (DeWitt & Storksdieck, 2008). Further, most of the VFT research we reviewed combined 360° videos with VR technology. Thus, it is unclear whether existing findings translate to using 360° videos alone, which are less immersive than 360° videos powered by VR platforms (Rupp et al., 2019).

Background of the Study

VFT: A Replacement for Physical Field Trips?

In the early experimentation with student engagement in a VFT, Spicer and Stratford, (2001) found students considered the virtual option as an enjoyable way to learn but did not think the virtual experience could be a substitute for a real field experience. Rather, they regarded VFTs as an enhancement to their fieldwork that offers a valuable indirect field experience and a way to empower physically or financially disadvantaged students (Stainfield et al., 2000). Prior VFT research involved using computers and digital visuality, such as hypertexts, videos, sound clips, and photographs, whereby students were passively browsing, watching, listening, and observing. An example is a VFT to Cumberland Island National Seashore (Hosticka et al., 2002) in which students passively observed someone else's actual field experience. Technology advances have led to increased interest in VFT usage in education, especially with the new opportunity for students to actively interact with the virtual world (Springer et al., 2020). Additionally, situations like the COVID-19 pandemic have prompted the reimagination of traditional field trips, making the VFT experience a more viable and acceptable alternative (Thönnessen & Budke, 2021).

Recent studies demonstrate that more technological advancements may be needed for VFTs to be widely accepted as equivalent to PFTs (Evelpidou et al., 2021; Seifan et al., 2020). Instead of serving as a replacement, VFTs are currently most commonly considered a supplement to PFTs that provide necessary pre-information before students visit an actual location (Seifan et al., 2019) or after an in-person field trip for better recall of experience (Harron et al., 2019). Çaliskan, (2011) also asserted that VFTs are useful when certain locations cannot be visited due to time, safety, weather, or other constraints. For instance, Evelpidou et al., (2021) designed a VFT to Corinthian Gulf with five stops (i.e., Cenchreae, Lechaeum, Lake Vouliagmeni, Diolkos, and Heraeon), a trip that would be prohibitive for most students. The authors also used the VFT to disseminate geoarchaeological and geomorphological information. A similar VFT was conducted by Jitmahantakul and Chenrai, (2019) where 360° VR environments were created to showcase landscapes and geological features in three dimensions. The current restrictions on public mass gatherings due to public health concerns (e.g., the spread of COVID-19), VFTs are positioned as viable replacements rather than its previous supplementary role to PFTs. Until such time that there are no more threats for the education sector to pursue returning to normal operations, VFT will serve as the primary means of conducting field trips across educational levels.

The Prevalence of 360° Videos in Education

Over the years, the use of video materials for teaching and learning has gained acceptance by teachers and students (Noetel et al., 2021). Many facets of education delivery (i.e., traditional, online, and hybrid) often include video integration for instruction. The wide array of instructional uses of videos include live streaming of lessons (Huang & Hong, 2017), playing video clips as part of the lecture (Kosterelioglu, 2016), and uploading recorded video lectures as a methodology for asynchronous education (Garcia & Yousef, 2022). With the continued advancements in video technologies, the extent of its pedagogical implementation is steadily increasing — with 360° video as one of its latest innovative forms. Also referred to as immersive videos, 360° video is a type of video content recorded in an omnidirectional form allowing viewers to control viewing direction while watching. Video recordings are captured using special camera equipment (e.g., Samsung Gear 360, Nikon KeyMission 360, and Kodak PIXPRO SP360) and edited (or stitched together) in post-production using standard video-editing software. The videos are still viewable using regular video players, which also have additional drag and drop functionality for panning the viewing perspective. Because viewers of 360° videos can use a mouse or keyboard to roll, pitch, and yaw to explore different parts of the scene, these videos are more immersive than traditional 2D videos albeit less immersive than 360° videos in a VR platform (Rupp et al., 2019).

Quite reasonably, there has been an increasing prevalence of 360° videos in education (most often with a combination of VR technology). Some pedagogical applications of 360° videos are cultural heritage virtual tours (Argyriou et al., 2020), supplemental materials in laboratory experiments (Ardisara & Fung, 2018), viewing modality of medical procedures (Arents et al., 2021), and safety skills teaching tool (Araiza-Alba et al., 2021). Most of the research on using 360° videos for instruction report no significant difference in student learning outcomes compared to traditional instructional approaches. For instance, comparing students’ knowledge recall between a conventional education group and the 360° VR video group in an Obstetrics and Gynecology internship curriculum revealed no significant difference (Arents et al., 2021). Similarly, teaching water-safety skills to children did not significantly differ whether using either traditional teaching mediums or 360° (Araiza-Alba et al., 2021). Both studies suggest that 360° videos are more effective in the affective rather than cognitive domain. Meanwhile, the 360° immersive video virtual tour developed by Argyriou et al., (2020) for the historical city of Rethymno, Greece elicited a high level of user engagement and a satisfying immersion experience. This result can be attributed to the design of the virtual tour, which was a strategic combination of experience (e.g., the flow of the story) and interaction (e.g., navigation in the virtual world). The results reinforce the importance of being mindful of the different facets of 360° videos that can influence learner engagement and knowledge acquisition.

Methodology

Research Design

We structured our research using a switching-replications experimental design in which an experimental (360V) and a control group (REGV) switched roles throughout the four rounds of experimentation. Customarily, this research design features only two rounds of the experiment (Trochim & Donnelly, 2008). We adjusted our experiment setup into four rounds to match the number of quarters in an academic year. In each quarter, we designated themes to follow the concept of a traditional field trip where multiple locations are visited. We consulted with the schools and they approved the selected themes before we carried out the interventions. The variation of themes and locations was part of our strategy to ensure that each group not only has more than one but also a unique VFT experience per intervention. Following this format, we performed an experiment before the end of each quarter of the academic year 2020-2021. See Table 1 for the sequence of the experiment themes and interventions.

Quarter VFT themes/locations Group 1 (G1) Group 2 (G2)
First (Q1) Farms and factories 360V REGV
Second (Q2) Museums and galleries REGV 360V
Third (Q3) Zoos and wildlife parks 360V REGV
Fourth (Q4) Historical sites and landmarks REGV 360V

Our design is not directly aligned with the structure of traditional field trips, which customarily happen once a year. We deviated from the traditional structure to strengthen the statistical power and control for threats to internal validity (Edmonds & Kennedy, 2017). This design modification allowed us to address the loss of control over essential resources (e.g., physical access to experimental settings) caused by the COVID-19 pandemic, which posed risks to internal validity and generalizability (Alsiri et al., 2021). Furthermore, we adopted this research to eliminate the necessity of denying participants (control group) a possibly beneficial treatment caused by random assignment. During these trying times when schoolchildren are confined within their homes, we believe that everyone deserves access to any intervention strategy. Moreover, while the random assignment is considered to be the most robust method for determining the impact of a particular treatment (Alferes, 2012; Shadish & Ragsdale, 1996), it is rarely possible in educational research (Davies et al., 2008). Fortunately, Edmonds and Kennedy, (2017) asserted that a switching replications experimental design works well for researching educational interventions where learning events are repeated at standard intervals throughout the year.

Participants and Procedures

We invited third-grade pupils (aged 8 to 9) from two private elementary schools in a metropolitan area in the Philippines to participate in our study. To have an equal participant distribution, we only recruited students from three sections in third-grade classes per school. The eligibility criteria of our study included students who voluntarily accepted to participate, have parental permission (informed consent was required), and with access to an Internet-connected device at home. The average class size of both schools was 30 pupils per section. All participants (n = 180) were eligible and participated. Before commencing the study, we oriented all teachers and parents on their specific roles. We performed the intervention assignment by school level, with all participating students engaging in either the 360V or REGV intervention.

Before joining a VFT, we asked students to complete a ten-item content recall test (pre-test). We administered the same assessment after the VFT (post-test) as well as a VFT experience questionnaire (VFT-XP). In the first round of the experiment (Q1), both groups attended a VFT, with G1 engaging in the 360V (experimental group) and G2 engaging in the REGV (control group). We replicated this experiment in Q2 but the groups switched roles, with G1 serving as a control group (by using REGV) and G2 as the experimental group (by using 360V). We administered a similar set of content recall tests aligned with the contents of this quarter. The experiments were replicated again for Q3 and Q4, switching the group roles accordingly and adjusting the assessments to align with the VFT content (see Table 1).

Interventions and Videos

The foci of the VFTs in our study are common locations of PFTs, including farms and factories (Q1), museums and galleries (Q2), zoos and wildlife parks (Q3), and historical sites and landmarks (Q4). Because it was impossible to shoot videos during the study, we relied on a combination of 360° videos from various online video sharing platforms for indoor shots and custom tour recordings using Google Earth for outdoor shots (see Figure 1). We edited and assembled all video materials in post-production using Adobe Premiere Pro. Like the discussions in PFTs, we added narration to share meaningful stories, interesting facts, and the VFT lessons. Although it is possible to transform the 360° videos and create VR videos, we purposely chose not to because the converted videos require a wearable VR headset to view. The mean runtime of video clips was one hour and two minutes. We uploaded all videos to Google Drive and shared the links with the cooperating teachers and parents during the orientation. We encouraged downloading the videos ahead of time, especially for those with slow internet connection.

Instrument Development

We developed and used two instruments to evaluate the interventions: multiple choice tests (with four possible answers) and the VFT-XP questionnaire. The purpose of the assessment tests was to evaluate the content recall of pupils before and after each VFT. As a team of two academic heads and six primary teachers, we engaged in a Modified Delphi method to develop eight content recall tests with ten items each (one per quarter with two variations each for pre- and post-test). The team determined the salient content, verified it was aligned with the videos, and ensured the language used in the tests was age appropriate for third-grade level students. On the other hand, the VFT-XP was intended to measure the virtual experience of pupils in a VFT. Because VFT is still in its infancy stage and there is no thorough investigation yet on factors that influence the success of a VFT, we created a custom instrument based on a variety of related studies. The final VFT-XP questionnaire (See Appendix A) has six subscales: attitude (Orion & Hofstein, 1994), perceived usefulness (Arents et al., 2021), inquiry (Behrendt & Franklin, 2014), involvement (Han, 2020), video engagement (Dobrian et al., 2011), and virtual guide (Lavie Alon & Tal, 2015). The Cronbach’s alpha values for the internal consistency of all constructs (attitude = 0.73, perceived usefulness = 0.71, inquiry = 0.81, involvement = 0.87, video engagement = 0.85, and virtual guide = 0.83) were greater than 0.70, indicating acceptable reliability (Taber, 2018).

Statistical Analyses

We analyzed the collected data using IBM SPSS Statistics 26.0. Customarily, a two-way mixed analysis of variance (ANOVA) should be used to determine if there are interactions between the groups over time, e.g., whether pupils’ content recall scores changed over time depending on their intervention (REGV or 360V). However, since the participants switched roles per quarter, it violates one of the basic assumptions of this statistical test: there should be no relationship between the observations in each category of the between-subjects factor (Seltman, 2018). Consequently, we employed a combination of parametric and non-parametric tests (for data that violates the normality assumption) for our analysis. We used paired-sample t-tests for our within-group analysis per quarter to determine whether there was a significant difference in content recall before and after VFTs (pre- vs. post-test). In addition, we used independent-samples t-tests and Mann-Whitney U tests for between-group analyses (G1 vs. G2) of both content recall and VFT-XP per each quarter and group. Lastly, we used one-way repeated measures ANOVA and Friedman Tests to determine whether there was a significant difference in content recall and VFT-XP per group across four-time points (Q1 vs. Q2 vs. Q3 vs. Q4).

Results

Content Recall

We conducted a series of paired-sample t-tests to determine whether there was a statistically significant mean difference in content recall scores before and after VFTs per each quarter. No outliers were detected through a visual inspection of boxplots, and the assumption of normality was not violated as assessed by Shapiro-Wilk's test (p > .05). As shown in Table 2, the results of our paired-samples t-tests revealed only one significant increase in content recall (Q3: Zoos and Wildlife Parks; p = 0.043). However, there was an interesting pattern in the results: REGV groups had higher mean content recall scores during pre-tests but were surpassed by 360V groups during post-tests for all quarters. This finding indicates that the increase in content recall scores of 360V groups after VFTs were consistently higher compared to REGV groups. On the first VFT (Q1), the mean content recall scores of G1 had increased by 1.20 points (95% CI, 0.910 to 1.490; t = 8.227) while G2 had increased only by 0.82 points (95% CI, 0.485 to 1.137; t = 4.942). After switching their roles (Q2), G2 had increased by 1.12 points (95% CI, 0.530 to 1.114; t = 5.592) while G1 had only increased by 0.82 points (95% CI, 0.821 to 1.423; t = 7.411). We observed similar findings on the succeeding quarters where where G1 (2.20 points; 95% CI, 1.822 to 2.578; t = 11.554) had a higher increase on mean content recall score compared to G2 (0.97 points; 95% CI, 0.705 to 1.228; t = 7.341) during Q3, and G2 (1.76 points; 95% CI, 0.466 to 1.023; t = 7.815) had a higher increase on mean content recall score compared to G1 (0.93 points; 95% CI, 0.671 to 1.196; t = 7.070) during Q4. The group with a 360V intervention had always a higher increase in mean content recall score after each VFT.

We also conducted a series of independent-sample t-tests to determine whether there was a statistically significant mean difference in post-test scores between the groups (see Figure 2). There were no outliers in the data as assessed by visual inspection, the assumption of normality was not violated as assessed by Shapiro-Wilk's test (p > .05), and variances were homogeneous as assessed by Levene's test (p values: Q1 = .405, Q2 = .885, Q3 = .309, Q4 = .097). The difference in post-test scores between the groups was only significant during Q4 (Historical Sites and Landmarks, p = .039). We continued our analysis by examining the results of a one-way repeated measures ANOVA to determine whether there was a statistically significant difference in post-test scores of each group over the course of four VFTs. While there were no outliers in the data and the assumption of normality was not violated, the assumption of sphericity was only met by G1 (χ2[5] = 8.58, p = .127) but not met in G2: (χ2[5] = 21.02, p = .000), as assessed by Mauchly's test of sphericity. Therefore, we applied a Greenhouse–Geisser correction for G2 (ε = .699). We found that VFTs elicited statistically significant changes in content recall over time for G1 (F[3, 267] = 50.84, p < .05, partial η2 = 0.36) and G2 (F[3, 267] = 135.70, p < .05, partial η2 = 0.60). This finding indicates that switching the intervention affects the assessment scores and this outcome is likely caused by the specific intervention assigned to the group.

VFT Experience

For the VFT-XP questionnaire, we conducted a series of Mann-Whitney U tests to determine if there were differences between the two groups in terms of attitude, perceived usefulness, inquiry, involvement, video engagement, and virtual guide per quarter. During Q1, median scores were only statistically significantly different in terms of attitude (U = 2,917, z = -3.365, p = .001), video engagement (U = 5,613, z = 4.638, p = .000), and involvement (U = 2,123, z = -2.292, p = .001). Similar findings were found on Q2 where median scores were statistically significantly different in terms of attitude (U = 3,423, z = -1.955, p = .024), video engagement (U = 6,199, z = 5.215, p = .000), and involvement (U = 3,339, z = 2.118, p = .019). Meanwhile, median scores during Q3 were statistically significantly different in terms of attitude (U = 3,432, z = -4.174, p = .000), perceived usefulness (U = 4,784, z = 3.967, p = .004), video engagement (U = 4,348, z = -3.929, p = .000), and involvement (U = 4,189, z = 3.378, p = .000). Lastly, median scores during Q4 were statistically significantly different in terms of attitude (U = 3,243, z = -4.238, p = .000), inquiry (U = 2,547, z = -3.216, p = .001), video engagement (U = 2,567, z = 2.583, p = .000), and involvement (U = 3,624, z = -2.368, p = .028). In Figure 3, we used the mean scores of VFT-XP across four quarters to show the between-group comparison.

To summarize, only three constructs (attitude, video engagement, and involvement) from the VFT-XP were consistently rated significantly differently by pupils in all quarters. Conversely, the virtual guide was the only factor that was consistently rated not significantly different in all quarters. We also found that pupils’ perceptions of perceived usefulness and inquiry were mixed and rated significantly different during Q3 and Q4, respectively. We continued our analysis by conducting a Friedman test to determine if there were significant differences in VFT-XP constructs over the course of four VFTs. Our results show that engaging in VFTs elicited statistically significant changes over time for G1 in terms of attitude (χ2(2) = 17.684, p = .001), involvement (χ2(2) = 24.149, p = .000), and video engagement (χ2(2) = 29.110, p = .000), and for G2 in terms of attitude (χ2(2) = 16.256, p = .000), involvement (χ2(2) = 18.299, p = .002), and video engagement (χ2(2) = 21.926, p = .042). Other constructs did not have significant changes.

Discussion

Cognitive Effects of Going into a VFT

The result of within-group analyses indicated that the increase in content recall scores of 360V groups after VFTs was consistently higher compared to REGV groups at all time points. Although, pupils from REGV learned as much except during a VFT on Zoos and Wildlife Parks (Q3), which echoes existing studies where no significant differences were found in the learning outcomes (Araiza-Alba et al., 2021; Arents et al., 2021). We partially attribute the consistent increase in content recall scores to 360° videos because of its capability to create authentic learning opportunities and engage pupils more realistically. The inclusion of such realism requires less active imagination of the world during the knowledge acquisition process, thereby reducing cognitive demands. Further, the engagement provided by a VFT allows learners to take more active self-responsibility for their learning process. Although not to the full extent, 360° videos provided an opportunity for pupils to make their own discoveries. Unlike existing studies (e.g., the VFT to Corinthian Gulf; Evelpidou et al., 2021) that only used Google Earth (outdoor scenes), one advantage of our study is that it incorporated 360° videos to show indoor shots of museums, farms, zoos, and others. We speculate that by allowing pupils to virtually travel from any starting point (e.g., school) to the actual field trip locations (e.g., zoo) including its indoor views, the experience becomes almost similar to traditional field trips. Despite 360° videos being less immersive than 360° videos in a VR platform, our experiment implies that 360° videos can still elevate the engagement and sense of immersion and authenticity while pupils are exploring in the virtual realm, which reiterates the findings of Argyriou et al., (2020).

The Quality of Experience in a VFT

Our results also show that VFTs elicited statistically significant changes in VFT experience for both groups over time. Meaning, that at all VFT sessions, the experience of each group varies depending on the allocated treatment (REGV or 360V). Nevertheless, only attitude, video engagement, and involvement from the VFT-XP were consistently rated significantly different by pupils in all quarters, while the virtual guide was the only factor that was consistently rated not significantly different. To elaborate, pupils have a more positive attitude towards VFT and are more likely to attend another session willingly when 360V is the mode of virtual excursion. As demonstrated by Orion and Hofstein, (1994), the fun factor is a determinant of a positive attitude towards field trips, and the immersive view functionality offered by 360V may have contributed to the amusement of pupils. This affective state could also explain why they were more engaged and involved since they can navigate the real world even in the comfort of their homes as compared to simply watching static videos. Although pupils may have been overwhelmed by the novelty effect of a VFT as mentioned in other studies (e.g., Zhao et al., 2021), the longitudinal nature of our experiment settles this uncertainty. Upon examination, these positive affective outcomes seem to be in a tug of war with the cognitive outcomes. That is, although not significant, the virtual guide factor was rated higher when REGV is the VFT mode. This finding suggests that pupils cannot focus on teachers’ narration on 360° videos because they were too engaged with navigating the videos. Because they were not attentive to the information being narrated throughout the videos, it may have affected their content recall scores where the assessments were based on. This deviation is further supported by our mixed findings on the inquiry construct, which measure the effect of convincing pupils to want to study and learn more.

Implications, Limitations, and Future Research

From a theoretical standpoint, our study contributes to the literature on exploiting 360° videos for educational purposes and implementing VFTs as a viable alternative to PFTs. Previous studies have emphasized that VFTs are not a replacement but only a supplement to PFTs that provide necessary pre-information before students visit an actual location (Seifan et al., 2019) or after an in-person field trip for better recall of experience (Harron et al., 2019). Although we share the same beliefs, the restrictions posed by the COVID-19 pandemic provided us with an opportunity to be the first study to completely replace PFTs with VFTs. Thus, our findings offer new insights and evidence on VFTs that may be valuable in educational policy formation. For instance, schools may finally address potential equity issues by offering a VFT option to students who could not afford to go on field trips. Rather than a once-a-year-only event, teachers could maximize VFTs as an instructional strategy to support teaching and learning both inside and outside the classroom. Parents would not have to worry about their children getting separated, loss, or hurt, especially in field trips in which they cannot chaperone. Finally, students could benefit by having the opportunity to go in any VFT from anywhere at any time.

Despite the positive results of our study, we do not claim that VFTs could be used as a substitute for in-person field trips. A critical factor that is missing from this field trip version is the opportunity for social interaction. According to Behrendt and Franklin, (2014), field trips promote social growth by encouraging positive interactions between students, parents, and teachers. While we assumed pupils may have been very interested in the VFT experience, we did not anticipate the potential for internet fatigue due to school shuttering and being overwhelmed with online learning experiences (Garcia, 2022). In addition, one of our concerns was how pupils interacted with the VFT experience. Although we recognized their engagement was independent of the teacher, we were not able to determine how they interacted with the 360° videos, the duration of time on specific content, or the return to a particular timestamp for further exploration. Gathering these information requires the integration of an appropriate framework for video viewing behaviors (Kleftodimos & Evangelidis, 2014), educational data mining (El Aouifi et al., 2021) and video analytics (Zhou et al., 2021). Another concern was pupils' actual perceptions of the experience. While we assume our instrument was aligned with their VFT journey, their experience may have varied based on internet access, technology capacity, and parental guidance. Prior research suggests that although 360° videos are more immersive than traditional 2D videos, they are still less immersive than 360° videos in a VR platform (Rupp et al., 2019). Future researchers should compare standard 360° videos and 360° videos in VR environments to determine whether the level of immersiveness is translatable to the quality of a VFT. We also encourage researchers to replicate our experiment at other levels of education. Prior works mainly used VR for conducting a VFT (e.g., Harron et al., 2019).

Although our study addressed the potential obstacle of needing to acquire high-fidelity VR headsets for a wide-scale VFT implementation, it also introduces potential constraints for future adopters. For instance, we relied on available 360° videos since it was impossible to shoot our own because of the COVID-19 pandemic. We found that the dependency on the existing 360° videos is not ideal for a personalized VFT (e.g., selecting the field trip locations and how they are presented in the videos). More importantly, these videos were not deliberately captured for VFT purposes and may need to be reshot to meet the curriculum needs. Therefore, special camera equipment capable of capturing 360° videos is required. The expertise of instructional designers is also recommended to ensure that the content area is covered, and the instruction is developmentally appropriate for the intended audience. VFT also demands ample time to design the experience, create video materials, and ensure that everything is compliant with specific educational objectives. As echoed by existing studies (Hudak, 2003; Kisiel, 2006; Knapp & Barrie, 2001; Nadelson & Jordan, 2012), a well-organized field trip yields successful cognitive, affective, and psychomotor outcomes. On the positive side, the design and development of VFT using 360° videos can become a professional development experience (e.g., learning content design, videography, video editing, etc.) for all school administrators and teachers involved.

Conclusion

Our research addressed the gap in the literature related to the use of 360° videos for VFTs as an alternative to PFTs in primary education. Through four rounds of experimentation, we found that students assigned to the 360V group consistently scored higher than the REGV group, although not significantly. Further, the VFT experience of pupils varies depending on the allocated treatment (i.e., REGV or 360V). By exploring the possibility of substituting PFTs with 360° videos, our study has provided insights for schools, policymakers, administrators, and teachers to design, develop, and deploy VFTs that allow students to gain experience that cannot be duplicated in traditional school experiences. These results call attention to a possible implementation of VFT and continue the long-standing tradition that has been acknowledged as an integral student-centered, interactive educational method.