"Hey IDE, Display Hello World": Integrating a Voice Coding Approach in Hands-on Computer Programming Activities

Abstract

Following recent advancements in automatic speech recognition (ASR) technologies, we replicated an experiment four decades ago that utilized voice as an input modality for computer programming. We also extended this experiment by investigating the pedagogical effectiveness of ‘programming by voice’ in terms of attitude, self-efficacy, code correctness, and coding speed. A total of 96 students from an institute of technology in the capital region of the Philippines were randomly selected to participate in a quasi-experimental study using a one-group pretest-posttest design. We subjected students to programming activities with different levels of difficulty to compare voice and keyboard. Our results show that although voice decreases negativity, it likewise decreases control, which means that both attitude and self-efficacy are positively and negatively affected, respectively. Using voice as an input modality also allows students to code faster when the activities are easy but not when they are moderate or difficult. Code correctness analysis shows that voice is only preferable for easy and moderate machine problems. With the deviation of our findings from an experiment four decades ago, we can now conclude that ASR technologies and voice as input modality provide substantial implications and new opportunities for teaching and learning computer programming.

Keywords: Automatic Speech Recognition, Human-Computer Interaction, Computer Programming, Coding

Introduction

In the interdisciplinary subfield of computational linguistics and computer science, Automatic Speech Recognition (ASR) is defined as the automated process of converting voice input into its corresponding transcript. It is an important research domain in human-computer interaction (HCI) due to its vast real-world applications that unceasingly alter the way people live (Fendji et al., 2022; Hu et al., 2011; Proksch et al., 2019). The implications of ASR technologies led to classifications based on how they strengthen human-human communication (HHC) and human-machine communication (HMC) (Yu & Deng, 2015). One example is the obstacle brought by monolingualism in HHC, particularly when people communicate with people from another language group. ASR alleviates this difficulty and eliminates the language barrier through automated speech-to-speech translation (e.g., Asian and English languages (Nakamura et al., 2006)). In terms of HMC, there have been many exciting advancements and voice assistants (e.g., Siri, Cortana, and Alexa) have become mainstream technologies because they come inbuilt into numerous devices, including smartphones and computers. The ubiquitousness of ASR is most apparent in the persistent proliferation of voice assistants in many sectors, such as health (Miner et al., 2020; Schulte et al., 2020; Seródio Figueiredo et al., 2022), business (Chang, 2000; Hegdepatil & Davuluri, 2021; Rabassa et al., 2022), education (Al Shamsi et al., 2022; Fox Carly et al., 2021; Russell et al., 1996), and others.

The reputation of computer programming as an intellectually challenging course has conceived a new phenomenon known as programming anxiety. Sometimes referred to as fear of coding, many researchers have studied the reasons for its occurrence and prospective solutions due to its negative effects on the learning process. It has been posited that this psychological state occurs because students mistakenly assess their programming learning ability compounded by the underdevelopment of requisite skills (Connolly et al., 2009). With the shortage of self-efficacy and a sense of control, it is consequently crucial to formulate strategies that foster student motivation and confidence. Meanwhile, fear was examined as a descriptor that symbolizes a lack of appreciation of or interest in computer programming as a discipline (Rogerson & Scott, 2010). Accordingly, there is a prevalence of feelings of discomfort or apprehension among tertiary students. In addition to internal factors (e.g., attitude and motivation), external factors (e.g., teachers and their strategies) were also emphasized as catalysts for the inhibiting force of fear. In an attempt to reduce, if not eliminate, programming anxiety, different strategies have been proposed, such as utilizing online interactive coding platforms (Figueroa Jr. & Amoloza, 2015), gamified online courses (Garcia & Revano, 2021), technology-supported interactive strategies (Jiang et al., 2020), group learning approaches (Garcia, 2021), educational programming languages (Demir, 2022), and others. A common denominator among these pedagogies is the provision for a more engaging and active learning experience.

Almost four decades ago, one study conducted a controlled experiment to compare voice and keyboard as input modalities in writing computer programs (Leggett & Williams, 1984). The basis of this study leans on the promising enrichments offered by voice inputs in crafting effective user interfaces that minimize keyboard operations and maximize real-time naturalistic HCI. Through the measures of accuracy, speed, and efficiency, the experiment discovered that voice competes reasonably well with the keyboard. Nonetheless, it was hypothesized further that voice could have outperformed the keyboard in all aspects had the computer processing power been sufficient and participants were more experienced in voice-enabled devices. This assumption warrants further investigation. Following recent advancements in ASR technologies, this study replicates the experiment and likewise extends the evaluation by assessing the pedagogical effectiveness of a voice-driven coding approach. Understanding the effectiveness of voice inputs using ASR technologies may present new and exciting opportunities for teaching and learning computer programming. Accordingly, the findings of this experiment will benefit teachers and students of computer programming in their intended academic outcomes.

In this study, we explicitly referred to the use of voice instead of a keyboard to write source code as voice programming. This programming by voice approach is a relatively underdeveloped research area and deserves further investigation in light of recent developments in ASR technologies and HCI concepts. There are also significant health concerns about computer-related injuries caused by excessive typing. In addition, some motor disabilities prevent computer users from using a keyboard and mouse. Thus, the significance of our study is not constrained to programming pedagogies but also implications that extend to health disability inclusion. These research questions (RQs) guided our study:

Is there a significant difference in terms of attitude and self-efficacy before and after the activities?
Is there a significant difference in the code correctness between voice and typed input?
Is there a significant difference in the coding speed in terms of modality and difficulty of machine problems?

Background of the Study

Automatic Speech Recognition

For human beings, verbal communication is the most natural form of communication. Therefore, teaching computers to learn and understand natural human languages (e.g., natural language processing) is an unsurprising idea. Supported by existing voice and speech technologies, a naturalistic interaction between users and computers and other digital devices using verbal commands are no longer a fictional scenario (Alharbi et al., 2021). This interaction is largely attributed to the continuous advancements in the field of speech signal processing, particularly in ASR technologies.

Voice User Interface

There has been a growing number of devices that integrate a voice-user interface (VUI) making ASR more convenient. VUI is a technology that provides ASR capabilities to assist users in interacting with their electronic devices using voice commands. Some applications include home automation systems (Roy et al., 2021), smart speakers (Baimirov et al., 2022), service robots (Stavropoulou et al., 2020), and others. In education, VUI can also support the teaching and learning process. For instance, schools can enhance their student support services through VUI-enabled applications. One example is the chat system that assists computer science students in their academic concerns, including referencing, academic writing, and programming (Seeroo & Bekaroo, 2021).

Computer-Related Injuries

Computer-related injuries (e.g., carpal tunnel syndrome) are a prevalent health issue among computer users. Daily computer use of at least four hours significantly increased frequent health complaints, especially on hands, fingers, and wrists (Hakala et al., 2010). There is also extensive evidence that keyboard users are susceptible to Repetitive Strain Injury (also known as overuse syndrome) (Keller et al., 1998). For computer programmers who are prolonged keyboard users, this health issue is a serious problem. Ergonomic keyboards are a solution that has been extensively used by typists (Ripat et al., 2010). Albeit this product is also applicable for coders, eliminating keyboards and replacing them with VUI devices is worth exploring. Some studies have already examined a hands-free computer interface approach in voice-assisted software modeling (Black et al., 2019) and speech-based integrated programming environments (Begel & Graham, 2006; Elmaghraby, 1989).

Figure 1: Voice-Driven Integrated Development Environment Using Serenade

Students with Disabilities

VUIs are also appealing to users who experience difficulties in the conventional graphical user interface (Vacher et al., 2015). For instance, a series of co-design workshops were conducted to develop VUIs with and for visually-impaired students (Metatla et al., 2019). The inspiration for this study was the proliferation of voice-based personal assistant devices. It was learned that VUIs possess a significant potential for creating accessible and inclusive interactions in an academic setting. Another proof supporting this claim is the utilization of a “programming by voice” approach for motorically challenged children (Cordero et al., 2021; Okafor, 2022; Wagner et al., 2012). The empirical findings from prior works have propelled ASR technologies and VUI devices into the education sector as a potential instructional technology intervention.

Methodology

Research Design

This cross-sectional experimental research with a one-group pretest-posttest design is an empirical investigation of ASR and voice programming. In recent years, ASR technologies through VUI devices have been catapulted into many sectors of society, making them a subject of interest among researchers. Instead of an exploratory nature of research like in previous investigations, we purposely selected an experimental methodology to examine cause-effect relationships. Driven by the same notion that voice is the most natural form of human communication, we replicated a controlled experiment conducted almost four decades ago (Leggett & Williams, 1984). This experiment evaluated voice versus keyboard as modalities for writing computer programs. We extended this experiment by assessing the efficacy of voice programming in terms of attitude, self-efficacy, code correctness, and coding speed.

Setting and Sample

Students from an institute of technology in the capital region of the Philippines were randomly selected to join the study. This non-sectarian, private higher educational institution offers four-year information technology and computer science programs. At the time of our research, the Bachelor of Science in Information Technology (BSIT) has four specializations, such as Animation and Game Development (BSIT-AGD), Digital Arts (BSIT-DA), Business Analytics and/or Service Management (BSIT-SMBA), and Web and Mobile Applications (BSIT-WMA). Regardless of multiple specializations, the BSIT degree program positions the computer programming courses as a potent foundation not only for academic development but also for a future computing career and employability prospects. This standpoint is noticeable in the numerous programming courses in the curriculums. The sample size was computed using Slovin’s formula n = N ÷ (1+Ne2) and the inclusion criteria were (1) a passing grade in the introductory programming course and (2) enrollment in any of the subsequent courses (e.g., Object-Oriented Programming). With a population of 126 programming students, the preferred sample size was 96. All students agreed and submitted an informed consent form.

Measurement and Data Collection

In our data collection, we utilized a survey questionnaire that has been similarly used in another programming study (Garcia, 2021). This questionnaire contains demographic information and measures of programming attitude and self-efficacy using validated scales namely the Attitude Scale of Computer Programming Learning (ASCOPL) and the Computer Programming Self-Efficacy Scale (CPSES), respectively. The demographic information section is composed of students’ age, gender, program specialization, and prior programming grade. The ASCOPL is a five-point Likert with three constructs (i.e., willingness, negativity, and necessity) measuring attitude toward learning computer programming. The CPSES is an evaluation tool with five constructs (i.e., algorithm, logical thinking, debug, control, and cooperation) that measure students’ beliefs in their capability to perform well in computer programming courses. The questionnaire was distributed before and after the experiment for the pretest and posttest comparison. The experiment was comprised of programming activities with three difficulty levels. The easy level includes Input and Output, Arithmetic Operation, and Variable Manipulation. The average level includes Conditional Statements, Looping Structures, and Standard Library Functions. Finally, the difficult level includes Array Data Structures and User-Defined Functions. Some of the programming activities we used for each level are as follows:

Easy: Write a program that will prompt users to input their name, section, and other custom information about them as well as class schedules (at least five courses with complete details such as the time, room, teacher, etc.) After students encoded all their details, clear the whole screen and then display the output in an organized, COR-inspired layout.

Moderate: Write a program that reads a date from the user and computes its immediate successor. For example, if the user enters values that represent 2022-11-18 then your program should display a message indicating that the day immediately after 2022-11-18 is 2022-11-19. If the user enters values that represent 2022-11-30 then the program should indicate that the next day is 2022-12-01. If the user enters values that represent 2022-12-31 then the program should indicate that the next day is 2023-01-01. The date will be entered in numeric form with three separate input statements: one for the year, one for the month, and one for the day. Never mind the leap year for now.

Difficult: Write a program that can simulate an ATM. In this program, you should be able to integrate different programming techniques such as conditional statements, looping constructs, and functions. It is also important to use a multidimensional array for the card and PIN (login).

Data Analysis

The collected data were analyzed using IBM SPSS Statistics 26.0. We utilized descriptive statistics to report the demographic information. Since ordinal data were produced by the ASCOPL and CPSES instruments, we used the Wilcoxon signed-rank test for the comparison of pretest and posttest scores of attitude and self-efficacy (RQ1). Rather than the efficiency of the algorithm, we graded the programming activities based on code correctness (i.e., code is running and achieves the correct output) making it a dichotomous variable. Thus, McNemar's test was utilized via a cross-over design where programming students write solutions to machine problems using both keyboard and voice (RQ2). For RQ3, we employed MANOVA to determine whether the coding speed varies in terms of difficulty levels of activities. All levels have ten pre-made activities and each activity was written in three different formats (n = 30) for randomization purposes.

Results

Demographic Profile

Table 1 presents the profile of the respondents. A total of 96 programming students joined the experimental study. The mean age was 18.78 ± 0.81 years, and the majority were male students enrolled in a BSIT-WMA program. In the earlier programming course, their average grade ranged from 81-85% (mean = 83%).

Table 1: Demographic Profile of Programming Students

Profile	Classification	f	%
Age	Less than 18 years old 18 years old and above	13 83	18.75 81.25
Gender	Male Female	66 30	68.75 31.25
Specialization	BSIT-DA BSIT-WMA BSIT-AGD BSIT-SMBA	13 39 25 19	13.54 40.63 26.04 19.79
Previous Programming Grade	70-75 76-80 81-85 86-90 91-95 96-100	7 19 37 18 10 5	7.29 19.79 38.54 18.75 10.42 5.21

Attitude and Self-Efficacy

The evaluation of programming input modalities in terms of attitude and self-efficacy exhibited mixed findings according to Wilcoxon signed-rank test (Table 2). In terms of attitude, only negativity was the significant construct decreasing from 3.85 ± 1.44 to 3.55 ± 1.15 (p = 0.174). Although the scores decreased, it is still a positive result as it implies that negative perceptions of computer programming were reduced after the experiment. Both willingness (3.11 ± 1.50 to 3.49 ± 1.15) and necessity (from 3.22 ± 1.42 to 3.40 ± 1.18) increased but not significantly (p = 1.000). When it comes to self-efficacy, control was the only significant construct decreasing from 4.01 ± 0.81 to 3.36 ± 1.12 (p = 0.11). Unfortunately, this finding indicates that students were more in control of using the keyboard than voice inputs.

Table 2: Attitude and Self-Efficacy Scores

Factor	Constructs	Pretest (M ± SD)	Posttest (M ± SD)	p-value
Attitude	Willingness Negativity Necessity	3.11 ± 1.50 3.85 ± 1.44 3.22 ± 1.42	3.49 ± 1.15 3.35 ± 1.15 3.40 ± 1.18	1.000 0.044 1.000
Self-Efficacy	Logical Thinking Algorithm Debug Control Cooperation	3.43 ± 1.50 2.53 ± 1.17 3.58 ± 1.11 4.01 ± 0.81 3.65 ± 1.16	3.10 ± 1.40 2.69 ± 1.15 3.98 ± 0.85 3.36 ± 1.12 4.05 ± 0.83	1.000 1.000 0.823 0.011 1.000

Code Correctness

The evaluation of programming input modalities in terms of code correctness likewise revealed mixed findings according to McNemar's test. Both easy (p = 0.031) and moderate (p = 0.008) activities yielded significant changes when students switched to voice as the input modality. For the difficult level, switching the modalities did not prompt significant changes (p = 0.250). In the crosstabulation (Table 3), it was shown that 29 students initially got incorrect solutions but correctly answered the easy problems after switching to voice programming. The same results can be observed in average problems, where 20 students originally got incorrect answers but made correct solutions after using voice as the input modality. Finally, there were more negative changes in the difficult level (correct to incorrect) albeit not significantly.

Table 3: Code Correctness Using Keyboard and Voice

Difficulty Levels	Keyboard	Voice		p-value
Difficulty Levels	Keyboard	Incorrect	Correct	p-value
Level 1 – Easy	Incorrect Correct Total	9 6 15	29 52 81	0.031
Level 2 – Average	Incorrect Correct Total	15 8 23	20 53 73	0.008
Level 3 – Difficult	Incorrect Correct Total	15 13 28	8 60 68	0.250

Coding Speed

According to the one-way MANOVA analysis (Table 4), the coding speed between keyboard and voice as the input modality was significant in terms of difficulty levels (p = 0.039; Wilk's Λ = 0.542, partial η² = 0.434). Interestingly, voice is faster (167.03 ± 81.06 seconds) than keyboard (173.30 ± 76.45 seconds) when the difficulty level of the machine problem is easy. On the other hand, students finish their moderate and difficult activities faster when they use a keyboard (538.76 ± 249.16 and 886.80 ± 172.24 seconds) instead of the voice programming technique (545.07 ± 253.09 and 900.80 ± 176.68 seconds). This finding indicates that voice programming may only be useful for easy activities.

Table 4: Comparison of Coding Speed

Coding Speed	Keyboard	Voice
Level 1 – Easy M ± SD Min – Max	173.30 ± 76.45 43 – 292	167.03 ± 81.06 30 – 294
Level 2 – Average M ± SD Min – Max	538.76 ± 249.16 107 – 994	545.07 ± 253.09 112 – 997
Level 3 – Difficult M ± SD Min – Max	886.80 ± 172.24 608 – 1196	900.80 ± 176.68 631 – 1314

Discussion

In this experimental study, we examined voice programming as a coding approach in terms of attitude, self-efficacy, coding speed, and code correctness among programming students. This is a replication and extension of an experiment on using voice as an input modality in computer programming (Leggett & Williams, 1984). A total of 96 random students from an institute of technology in the capital region of the Philippines participated in a series of programming activities with three difficulty levels for both voice and keyboard modalities. Overall, our study showed mixed findings.

According to our findings, the negativity towards computer programming as a discipline significantly decreased after using the voice modality. Computer programming has a reputation for being a difficult course and many students are afraid of learning it because of this perception (Garcia, 2021). More importantly, students are more likely to believe computer programming is difficult when they have negative impressions of the subject. The fear factor is also detrimental because it diminishes intrinsic motivation and negatively influences student attitudes (Rogerson & Scott, 2010). Therefore, the effect of voice programming opens positive opportunities for teachers to engage their students who have a negative attitude toward this subject. One possible explanation for the reduction of negativity could be attributed to the enjoyment when using voice inputs. In a coding workshop, it was found that fun programming activities have a positive effect on student attitudes towards coding (Tisza & Markopoulos, 2021).

Despite the positive effect of voice programming on attitude, the opposite is evident in the self-efficacy construct. According to students, using a voice interface significantly decreased their perceived control. This finding is consistent with what has been discovered in the original experiment (Leggett & Williams, 1984), where less input task completion rate through the voice editor was recorded compared to the key editor. One possible explanation is that the keyboard has been the primary text input device since the introduction of computers and so people are more accustomed to it than to voice and speech recognition technologies. The positive and negative impact of employing a voice programming approach to attitude and self-efficacy, respectively, requires teachers to balance their implementation of this programming pedagogy. Importantly, it was found that perceived academic control is a crucial factor to predict dropout intention through the mediation of anxiety (Respondek et al., 2017).

In terms of code correctness, it appears that voice modality benefits students only when solving easy and moderate machine problems. This finding could be attributed to the small cognitive load demanded by these activities, allowing students to enjoy the voice interface more. However, when the activity is difficult and requires strict concentration, using a voice modality did not elicit significantly positive changes. Thus, it would be more beneficial to introduce the voice programming approach in the early stage of students’ coding journey. This tactic is also compatible with the necessity to teach basic concepts first (e.g., syntax, variables, expressions, and operators) before complex algorithms (Garcia et al., 2022).

The coding speed also varied depending on the difficulty of the programming activities, which is expected because easy ones require less time to finish and vice versa. However, it was clear that students were able to maximize the voice interface to finish easy activities but not moderate and difficult ones. This finding is similar to previous studies that found the speech input method was faster than keyboard input (Hauptmann & Rudnicky, 1990; Ruan et al., 2018). However, it contradicts the original experiment (Leggett & Williams, 1984), but the authors claimed that voice input as the mode of coding would have been more competitive with the keyboard had the technologies been sufficient. With the readiness of more advanced ASR technologies, it explains why voice inputs can be faster than keyboard inputs. This finding also verifies our result regarding code correctness rate, indicating the benefits of voice programming to easy machine problems.

Our results offer considerable implications for programming teachers and students given the widespread belief that computer programming is a challenging subject (Connolly et al., 2009; Garcia, 2021). With the positive influence of a voice programming approach on student attitudes, teachers may incorporate the voice interface into their laboratory activities. When students do not have a negative attitude toward the course, they are more likely to develop their self-efficacy and academic performance (Tisza & Markopoulos, 2021). However, caution must be taken in terms of the timing of implementation. According to our results, the voice programming approach is only advantageous when the activities are easy and short. Thus, implementing it in activities that impose a higher cognitive load may result in negative effects academically. Finally, previous studies underscored the need for ASR technologies with adequate technical capabilities to ensure the success of a voice-driven programming approach (Leggett & Williams, 1984; Ruan et al., 2018). It emphasizes the necessity for schools to invest in equipment and training to ensure the success of its implementation.

There are limitations to our study that present future avenues for research. First, our study was limited to students enrolled in a computing degree, which indicates that we did not incorporate other non-computing degrees (e.g., engineering) that also offer computer programming courses. In coding-based events, it was highlighted to involve students from different programs as they may have different perceptions of the activity (Garcia, 2022). In terms of attitude and self-efficacy, the instrument was self-administered, which may result in bias and social desirability. The experiment was also reliant on the employment of voice technologies, which indicates that schools must modify their classroom and ensure a voice-enabled development environment and a microphone are available. It may also be challenging to have all students talking at the same time – a potential issue that we have avoided because of the mandatory online education during the pandemic. Finally, future researchers may consider replicating the experiment with the participation of programmers with different levels of ability. It is possible that more advanced programmers who are used to writing codes and able to formulate logic correctly may find the voice programming approach more beneficial.

Conclusion

In this paper, we investigated the pedagogical potential of a voice programming approach concerning attitude, self-efficacy, code correctness, and coding speed. Our results demonstrate that although voice as an input modality decreases negativity, it also decreases control. This opposite effect reveals that both attitude and self-efficacy factors are positively and negatively affected, respectively, by the voice programming approach. Using a voice interface also allows students to code faster when the activities are easy but not when they are moderate or difficult. In our code correctness analysis, we found that utilizing voice input is only desirable for easy and moderate machine problems. Overall, our study upholds the pedagogical potential of utilizing voice as an input modality in writing computer programs. Considering these results, future researchers may explore the best way to integrate voice technologies either to replace or supplement keyboards.

Related Research

Tomayto, Tomahto: A Machine Learning Approach for Tomato Ripening Stage Identification Using Pixel-Based Color Image Classification

2019 IEEE 11th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

Read Paper

References

01
Al Shamsi, J. H., Al-Emran, M., & Shaalan, K. (2022). Understanding Key Drivers Affecting Students’ Use of Artificial Intelligence-Based Voice Assistants. Education and Information Technologies, 27, 8071-8091. https://doi.org/10.1007/s10639-022-10947-3
Search at Google Scholar
02
Alharbi, S., Alrazgan, M., Alrashed, A., Alnomasi, T., Almojel, R., Alharbi, R., Alharbi, S., Alturki, S., Alshehri, F., & Almojil, M. (2021). Automatic Speech Recognition: Systematic Literature Review. IEEE Access, 9, 131858-131876. https://doi.org/10.1109/ACCESS.2021.3112535
Search at Google Scholar
03
Baimirov, K., Mergengali, E., & Baimirov, B. (2022). Overview of the Latest Research Related to Smart Speakers. 2022 IEEE 7th International Energy Conference (ENERGYCON). doi: 10.1109/ENERGYCON53164.2022.9830196
Search at Google Scholar
04
Begel, A., & Graham, S. L. (2006). An Assessment of a Speech-Based Programming Environment. Visual Languages and Human-Centric Computing (VL/HCC'06). doi: 10.1109/VLHCC.2006.9
Search at Google Scholar
05
Black, D., Rapos, E. J., & Stephan, M. (2019). Voice-Driven Modeling: Software Modeling Using Automated Speech Recognition. 2019 ACM/IEEE 22nd International Conference on Model Driven Engineering Languages and Systems Companion (MODELS-C). doi: 10.1109/MODELS-C.2019.00040
Search at Google Scholar
06
Chang, H. M. (2000). Is ASR Ready for Wireless Primetime: Measuring the Core Technology for Selected Applications. Speech Communication, 31(4), 293-307. https://doi.org/10.1016/S0167-6393(99)00063-1
Search at Google Scholar
07
Connolly, C., Murphy, E., & Moore, S. (2009). Programming Anxiety Amongst Computing Students—A Key in the Retention Debate? IEEE Transactions on Education, 52(1), 52-56. https://doi.org/10.1109/TE.2008.917193
Search at Google Scholar
08
Cordero, D. D. L., Ayala, C., & Ordóñez, P. (2021). Kavita Project: Voice Programming for People with Motor Disabilities. 23rd International ACM SIGACCESS Conference on Computers and Accessibility. doi: 10.1145/3441852.3476516
Search at Google Scholar
09
Demir, F. (2022). The Effect of Different Usage of the Educational Programming Language in Programming Education on the Programming Anxiety and Achievement. Education and Information Technologies, 27(3), 4171-4194. https://doi.org/10.1007/s10639-021-10750-6
Search at Google Scholar
10
Elmaghraby, A. S. (1989). Voice Recognition Applications for Programming Environments. IEEE Energy and Information Technologies in the Southeast. doi: 10.1109/SECON.1989.132471
Search at Google Scholar
11
Fendji, J. L. K. E., Tala, D. C. M., Yenke, B. O., & Atemkeng, M. (2022). Automatic Speech Recognition Using Limited Vocabulary: A Survey. Applied Artificial Intelligence, 36(1), 1-35. https://doi.org/10.1080/08839514.2022.2095039
Search at Google Scholar
12
Figueroa Jr., R. B., & Amoloza, E. M. (2015). Addressing Programming Anxiety among Non-Computer Science Distance Learners: A UPOU Case Study. International Journal for Educational Media and Technology, 9(1), 56-67. https://jaems.jp/contents/icomej/vol9/7_Figuroa.pdf
Search at Google Scholar
13
Fox Carly, B., Israelsen-Augenstein, M., Jones, S., & Gillam Sandra, L. (2021). An Evaluation of Expedited Transcription Methods for School-Age Children's Narrative Language: Automatic Speech Recognition and Real-Time Transcription. Journal of Speech, Language, and Hearing Research, 64(9), 3533-3548. https://doi.org/10.1044/2021_JSLHR-21-00096
Search at Google Scholar
14
Garcia, M. B. (2021). Cooperative Learning in Computer Programming: A Quasi-Experimental Evaluation of Jigsaw Teaching Strategy with Novice Programmers. Education and Information Technologies, 26(4), 4839-4856. https://doi.org/10.1007/s10639-021-10502-6
Search at Google Scholar View at author's website
15
Garcia, M. B. (2022). Hackathons as extracurricular activities: Unraveling the motivational orientation behind student participation. Computer Applications in Engineering Education, 1-16. https://doi.org/10.1002/cae.22564
Search at Google Scholar View at author's website
16
Garcia, M. B., Juanatas, I. C., & Juanatas, R. A. (2022). TikTok as a Knowledge Source for Programming Learners: a New Form of Nanolearning? 2022 10th International Conference on Information and Education Technology (ICIET). doi: 10.1109/ICIET55102.2022.9779004
Search at Google Scholar View at author's website
17
Garcia, M. B., & Revano, T. F. (2021). Assessing the Role of Python Programming Gamified Course on Students’ Knowledge, Skills Performance, Attitude, and Self-Efficacy. 2021 IEEE 13th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM). doi: 10.1109/HNICEM54116.2021.9731935
Search at Google Scholar View at author's website
18
Hakala, P. T., Saarni, L. A., Ketola, R. L., Rahkola, E. T., Salminen, J. J., & Rimpelä, A. H. (2010). Computer-Associated Health Complaints and Sources of Ergonomic Instructions in Computer-Related Issues Among Finnish Adolescents: A Cross-Sectional Study. BMC Public Health, 10(11), 1-8. https://doi.org/10.1186/1471-2458-10-11
Search at Google Scholar
19
Hauptmann, A. G., & Rudnicky, A. I. (1990). A Comparison of Speech and Typed Input. Workshop on Speech and Natural Language. doi: 10.3115/116580.116652
Search at Google Scholar
20
Hegdepatil, P., & Davuluri, K. (2021). Business Intelligence Based Novel Marketing Strategy Approach using Automatic Speech Recognition and Text Summarization. 2021 2nd International Conference on Computing and Data Science (CDS). doi: 10.1109/CDS52072.2021.00108
Search at Google Scholar
21
Hu, R., Zhu, S., Feng, J., & Sears, A. (2011). Use of Speech Technology in Real Life Environment. Universal Access in Human-Computer Interaction. Applications and Services. doi: 10.1007/978-3-642-21657-2_7
Search at Google Scholar
22
Jiang, Y., Zhao, Z., Wang, L., & Hu, S. (2020). Research on the Influence of Technology-Enhanced Interactive Strategies on Programming Learning. 2020 15th International Conference on Computer Science & Education (ICCSE). doi: 10.1109/ICCSE49874.2020.9201627
Search at Google Scholar
23
Keller, K., Corbett, J., & Nichols, D. (1998). Repetitive Strain Injury in Computer Keyboard Users: Pathomechanics and Treatment Principles in Individual and Group Intervention. Journal of Hand Therapy, 11(1), 9-26. https://doi.org/10.1016/S0894-1130(98)80056-2
Search at Google Scholar
24
Leggett, J., & Williams, G. (1984). An Empirical Investigation of Voice as an Input Modality for Computer Programming. International Journal of Man-Machine Studies, 21(6), 493-520. https://doi.org/10.1016/S0020-7373(84)80057-7
Search at Google Scholar
25
Metatla, O., Oldfield, A., Ahmed, T., Vafeas, A., & Miglani, S. (2019). Voice User Interfaces in Schools: Co-designing for Inclusion with Visually-Impaired and Sighted Pupils. 2019 CHI Conference on Human Factors in Computing Systems. doi: 10.1145/3290605.3300608
Search at Google Scholar
26
Miner, A. S., Haque, A., Fries, J. A., Fleming, S. L., Wilfley, D. E., Terence Wilson, G., Milstein, A., Jurafsky, D., Arnow, B. A., Stewart Agras, W., Fei-Fei, L., & Shah, N. H. (2020). Assessing the Accuracy of Automatic Speech Recognition for Psychotherapy. npj Digital Medicine, 3(82), 1-8. https://doi.org/10.1038/s41746-020-0285-8
Search at Google Scholar
27
Nakamura, S., Markov, K., Nakaiwa, H., Kikui, G., Kawai, H., Jitsuhiro, T., Zhang, J. S., Yamamoto, H., Sumita, E., & Yamamoto, S. (2006). The ATR Multilingual Speech-to-Speech Translation System. IEEE Transactions on Audio, Speech, and Language Processing, 14(2), 365-376. https://doi.org/10.1109/TSA.2005.860774
Search at Google Scholar
28
Okafor, O. (2022). Helping Students with Cerebral Palsy Program via Voice-Enabled Block-Based Programming. ACM SIGACCESS Accessibility and Computing. https://doi.org/10.1145/3523265.3523267
Search at Google Scholar
29
Proksch, S.-O., Wratil, C., & Wäckerle, J. (2019). Testing the Validity of Automatic Speech Recognition for Political Text Analysis. Political Analysis, 27(3), 339-359. https://doi.org/10.1017/pan.2018.62
Search at Google Scholar
30
Rabassa, V., Sabri, O., & Spaletta, C. (2022). Conversational Commerce: Do Biased Choices Offered by Voice Assistants’ Technology Constrain its Appropriation? Technological Forecasting and Social Change, 174, 1-12. https://doi.org/10.1016/j.techfore.2021.121292
Search at Google Scholar
31
Respondek, L., Seufert, T., Stupnisky, R., & Nett, U. E. (2017). Perceived Academic Control and Academic Emotions Predict Undergraduate University Student Success: Examining Effects on Dropout Intention and Achievement. Frontiers in Psychology, 8(243). https://doi.org/10.3389/fpsyg.2017.00243
Search at Google Scholar
32
Ripat, J., Giesbrecht, E., Quanbury, A., & Kelso, S. (2010). Effectiveness of an Ergonomic Keyboard for Typists with Work Related Upper Extremity Disorders: A Follow-up Study. Work, 37, 275-283. https://doi.org/10.3233/WOR-2010-1079
Search at Google Scholar
33
Rogerson, C., & Scott, E. (2010). The Fear Factor: How It Affects Students Learning to Program in a Tertiary Environment. Journal of Information Technology Education: Research, 9, 147-171. https://doi.org/10.28945/1183
Search at Google Scholar
34
Roy, R. M., Sabu, B., N, A., & A. R, P. (2021). Voice Controlled Home Automation System. 2021 7th International conference on Bio Signals, Images, and Instrumentation (ICBSII). doi: 10.1109/ICBSII51839.2021.9445150
Search at Google Scholar
35
Ruan, S., Wobbrock, J. O., Liou, K., Ng, A., & Landay, J. A. (2018). Comparing Speech and Keyboard Text Entry for Short Messages in Two Languages on Touchscreen Phones. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 1(4), 1-23. https://doi.org/10.1145/3161187
Search at Google Scholar
36
Russell, M., Brown, C., Skilling, A., Series, R., Wallace, J., Bonham, B., & Barker, P. (1996). Applications of Automatic Speech Recognition to Speech and Language Development in Young Children. 4th International Conference on Spoken Language Processing (ICSLP). doi: 10.1109/ICSLP.1996.607069
Search at Google Scholar
37
Schulte, A., Suarez-Ibarrola, R., Wegen, D., Pohlmann, P.-F., Petersen, E., & Miernik, A. (2020). Automatic Speech Recognition in the Operating Room – An Essential Contemporary tool or a Redundant Gadget? A Survey Evaluation Among Physicians in Form of a Qualitative Study. Annals of Medicine and Surgery, 59, 81-85. https://doi.org/10.1016/j.amsu.2020.09.015
Search at Google Scholar
38
Seeroo, O., & Bekaroo, G. (2021). Enhancing Student Support via the Application of a Voice User Interface System: Insights on User Experience. International Conference on Artificial Intelligence and its Applications (icARTi). doi: 10.1145/3487923.3487936
Search at Google Scholar
39
Seródio Figueiredo, C. M., de Melo, T., & Goes, R. (2022). Evaluating Voice Assistants' Responses to COVID-19 Vaccination in Portuguese: Quality Assessment. JMIR Human Factors, 9(1), 1-8. https://doi.org/10.2196/34674
Search at Google Scholar
40
Stavropoulou, P., Spiliotopoulos, D., & Kouroupetroglou, G. (2020). Voice User Interfaces for Service Robots: Design Principles and Methodology. Universal Access in Human-Computer Interaction. Design Approaches and Supporting Technologies. doi: 10.1007/978-3-030-49282-3_35
Search at Google Scholar
41
Tisza, G., & Markopoulos, P. (2021). Understanding the Role of Fun in Learning to Code. International Journal of Child-Computer Interaction, 28, 1-10. https://doi.org/10.1016/j.ijcci.2021.100270
Search at Google Scholar
42
Vacher, M., Caffiau, S., Portet, F., Meillon, B., Roux, C., Elias, E., Lecouteux, B., & Chahuara, P. (2015). Evaluation of a Context-Aware Voice Interface for Ambient Assisted Living: Qualitative User Study vs. Quantitative System Evaluation. ACM Transactions on Accessible Computing, 7(2), 1-36. https://doi.org/10.1145/2738047
Search at Google Scholar
43
Wagner, A., Rudraraju, R., Datla, S., Banerjee, A., Sudame, M., & Gray, J. (2012). Programming by Voice: A Hands-Free Approach for Motorically Challenged Children. CHI '12 Extended Abstracts on Human Factors in Computing Systems. doi: 10.1145/2212776.2223757
Search at Google Scholar
44
Yu, D., & Deng, L. (2015). Automatic Speech Recognition: A Deep Learning Approach. Springer. https://doi.org/10.1007/978-1-4471-5779-3
Search at Google Scholar
Show References

Cite this paper

Garcia, M. B., Enriquez, J. B. R., Adao, R. T., & Happonen, A. (2022). "Hey IDE, Display Hello World": Integrating a Voice Coding Approach in Hands-on Computer Programming Activities. 2022 IEEE 14th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM), 1-6. https://doi.org/10.1109/HNICEM57413.2022.10109412.

Download Citation

Keywords

Authors

Garcia, Manuel B.

College of Computer Studies

FEU Institute of Technology, Philippines

iD 0000-0003-2615-422X

Enriquez, John Benedic R.

College of Computer Studies

FEU Institute of Technology, Philippines

Adao, Rossana T.

College of Computer Studies

FEU Institute of Technology, Philippines

Happonen, Ari

School of Engineering Science

LUT University, Finland

iD 0000-0003-0744-1776

Contact Info

Follow Me

"Hey IDE, Display Hello World": Integrating a Voice Coding Approach in Hands-on Computer Programming Activities

Abstract

Introduction

Background of the Study

Automatic Speech Recognition

Voice User Interface

Computer-Related Injuries

Students with Disabilities

Methodology

Research Design

Setting and Sample

Measurement and Data Collection

Data Analysis

Results

Demographic Profile

Attitude and Self-Efficacy

Code Correctness

Coding Speed

Discussion

Conclusion

Tomayto, Tomahto: A Machine Learning Approach for Tomato Ripening Stage Identification Using Pixel-Based Color Image Classification

References

Cite this paper

Keywords

Authors

Garcia, Manuel B.

Enriquez, John Benedic R.

Adao, Rossana T.

Happonen, Ari

"Hey IDE, Display Hello World": Integrating a Voice Coding Approach in Hands-on Computer Programming Activities

Garcia, Manuel B.

Enriquez, John Benedic R.

Adao, Rossana T.

Happonen, Ari

Abstract

Delivering Effective Game-Based Learning: A Comparative Study Between Computer and Mobile as the Learning Framework for Preschoolers

Introduction

Background of the Study

Automatic Speech Recognition

Voice User Interface

Computer-Related Injuries

Students with Disabilities

Methodology

Research Design

Setting and Sample

Measurement and Data Collection

Data Analysis

Results

Demographic Profile

Attitude and Self-Efficacy

Code Correctness

Coding Speed

Discussion

Conclusion

Tomayto, Tomahto: A Machine Learning Approach for Tomato Ripening Stage Identification Using Pixel-Based Color Image Classification

References

Cite this paper

Keywords

Authors

Garcia, Manuel B.

Enriquez, John Benedic R.

Adao, Rossana T.

Happonen, Ari