This section presents and discusses the results of the quantitative and qualitative analysis conducted to explore the impact of Intelligent Tutoring Systems on students with varying levels of programming experience. Through chi-square and ANOVA tests, the quantitative analysis examines associations between programming proficiency levels and the effectiveness of ITS. The qualitative analysis highlights themes related to the positive impact of ITS, challenges faced by students, and suggestions for improvement. By integrating these findings, this section offers valuable insights into optimizing ITS interventions for students with diverse programming backgrounds.
5.1. Quantitative analysis
This study performed chi-square tests to identify potential associations between students' level in programming and the increase in their confidence and motivation in programming since using ITS on the one hand. On the other hand, chi-square tests were performed to check potential associations between students' level of programming and the usefulness of ITS in completing programming tasks. Results are displayed in Tables 2 and 3 below.
Table 2
| Value | df | Asymptotic Significance (2-sided) |
Pearson Chi-Square | 4.983a | 2 | .083 |
Likelihood Ratio | 8.515 | 2 | .014 |
Linear-by-Linear Association | 4.317 | 1 | .038 |
N of Valid Cases | 160 | | |
a. 2 cells (33.3%) have expected count less than 5. The minimum expected count is 1.20.
Table 3
| Value | df | Asymptotic Significance (2-sided) |
Pearson Chi-Square | 1.486a | 2 | .476 |
Likelihood Ratio | 2.674 | 2 | .263 |
Linear-by-Linear Association | 1.287 | 1 | .257 |
N of Valid Cases | 160 | | |
a. 2 cells (33.3%) have expected count less than 5. The minimum expected count is .40.
The chi-square is 4.983, with p-values of 0.083. Given that the p-value is higher than 0.05, there is a non-significant association between students' level in programming and the increase in their confidence and motivation in programming since using ITS. In other words, there is insufficient evidence to claim a significant association between students' level in programming and the increase in their confidence and motivation in programming since using ITS. The higher p-value indicates that the difference in confidence and motivation levels between different levels of programming students may be due to random chance rather than an actual relationship.
Similarly, the following chi-square is 1.486, with p-values of 0.476. Given that the p-value is higher than 0.05, there is a non-significant association between students' level of programming and the usefulness of ITS in completing programming tasks. Put differently, findings suggest that there is not enough evidence to suggest a significant association between students' level of programming and the usefulness of ITS in completing programming tasks. The higher p-value indicates that the difference in the perceived usefulness of ITS in completing programming tasks among different levels of programming students may be due to random chance rather than an actual relationship.
We conducted three ANOVA tests regarding the influence of ITS in improving programming skills, the usefulness of ITS feedback on programming tasks, and the satisfaction with using the interface and functionality of the ITS. Tables 4 and 5 show the result of the first ANOVA test.
Table 4
ANOVA 1: Influence of ITS in improving programming skills
| Sum of Squares | df | Mean Square | F | Sig. |
Between Groups | 5.365 | 2 | 2.682 | 5.524 | .005 |
Within Groups | 76.235 | 157 | .486 | | |
Total | 81.600 | 159 | | | |
Table 5
Multiple Comparisons 1: Post Hoc Tests
(I) Level in programming | (J) Level in programming | Mean Difference (I-J) | Std. Error | Sig. | 95% Confidence Interval |
Lower Bound | Upper Bound |
Beginner | Intermediate | .294 | .184 | .250 | − .14 | .73 |
Advanced | − .706* | .254 | .016 | -1.31 | − .11 |
Intermediate | Beginner | − .294 | .184 | .250 | − .73 | .14 |
Advanced | -1.000* | .302 | .003 | -1.71 | − .29 |
Advanced | Beginner | .706* | .254 | .016 | .11 | 1.31 |
Intermediate | 1.000* | .302 | .003 | .29 | 1.71 |
*. The mean difference is significant at the 0.05 level.
The significance level of 0.005 in Table 4 is lower than the set level of 0.05. Therefore, there is a statistically significant difference in the influence of ITS in improving programming skills among students of different levels of programming experience, with the advanced students showing the most significant improvement compared to beginner and intermediate students. The post hoc tests offer ample details about differences, notably between "Beginner" and "Advanced" and between "Intermediate" and "Advanced," with significance levels of 0.16 and 0.003, respectively. Students' proficiency in programming may impact the influence of ITS in improving their programming skills.
Tables 6 and 7 display the results of the second ANOVA test.
Table 6
ANOVA 2: Usefulness of ITS feedback on programming tasks
| Sum of Squares | df | Mean Square | F | Sig. |
Between Groups | 5.835 | 2 | 2.918 | 3.476 | .033 |
Within Groups | 131.765 | 157 | .839 | | |
Total | 137.600 | 159 | | | |
Table 7
Multiple Comparisons 2: Post Hoc Tests
(I) Level in programming | (J) Level in programming | Mean Difference (I-J) | Std. Error | Sig. | 95% Confidence Interval |
Lower Bound | Upper Bound |
Beginner | Intermediate | .176 | .242 | .747 | − .40 | .75 |
Advanced | − .824* | .333 | .038 | -1.61 | − .03 |
Intermediate | Beginner | − .176 | .242 | .747 | − .75 | .40 |
Advanced | -1.000* | .397 | .034 | -1.94 | − .06 |
Advanced | Beginner | .824* | .333 | .038 | .03 | 1.61 |
Intermediate | 1.000* | .397 | .034 | .06 | 1.94 |
*. The mean difference is significant at the 0.05 level.
The ANOVA test results for the usefulness of ITS feedback on programming tasks show a significant difference in the effectiveness of feedback based on the student's level of programming (F(2, 157) = 3.476, p = .033). The between-groups variance is 5.835, while the within-groups variance is 131.765, indicating a significant difference in mean scores between at least two of the three levels of programming.
The post-hoc tests further reveal the mean differences between the different levels of programming. The results indicate significant differences in mean scores between the beginner and advanced groups (Mean Difference = -0.824, p = .038) and intermediate and advanced groups (Mean Difference = -1.000, p = .034). At the same time, there are no significant differences between the beginner and intermediate groups (Mean Difference = 0.176, p = .747).
In summary, these results suggest significant differences in the usefulness of ITS feedback on programming tasks among students of different levels of programming experience. The feedback is more effective for advanced students than for beginner and intermediate students. This information can be valuable for educators and instructional designers aiming to enhance the effectiveness of ITS feedback in programming education.
Tables 8 and 9 present the results of the third ANOVA test.
Table 8
ANOVA 3: Satisfaction with using the interface and functionality of the ITS
| Sum of Squares | df | Mean Square | F | Sig. |
Between Groups | 5.718 | 2 | 2.859 | 6.813 | .001 |
Within Groups | 65.882 | 157 | .420 | | |
Total | 71.600 | 159 | | | |
Table 9
Multiple Comparisons 3: Post Hoc Tests
(I) Level in programming | (J) Level in programming | Mean Difference (I-J) | Std. Error | Sig. | 95% Confidence Interval |
Lower Bound | Upper Bound |
Beginner | Intermediate | .471* | .171 | .018 | .07 | .88 |
Advanced | − .529 | .236 | .067 | -1.09 | .03 |
Intermediate | Beginner | − .471* | .171 | .018 | − .88 | − .07 |
Advanced | -1.000* | .281 | .001 | -1.66 | − .34 |
Advanced | Beginner | .529 | .236 | .067 | − .03 | 1.09 |
Intermediate | 1.000* | .281 | .001 | .34 | 1.66 |
*. The mean difference is significant at the 0.05 level.
The ANOVA test results for satisfaction with using the interface and functionality of the ITS show a significant difference in satisfaction levels based on the student's level of programming (F(2, 157) = 6.813, p = .001). The between-groups variance is 5.718, while the within-groups variance is 65.882, indicating a significant difference in mean scores between at least two of the three levels of programming.
The post-hoc tests (multiple comparisons) reveal the mean differences between the different levels of programming. The results indicate significant differences in mean scores between the beginner and intermediate groups (Mean Difference = 0.471, p = .018) and intermediate and advanced groups (Mean Difference = 1.000, p = .001). At the same time, there was no significant difference between the beginner and advanced groups (Mean Difference = -0.529, p = .067).
In conclusion, these findings suggest significant differences in satisfaction levels with using the interface and functionality of the ITS among students of different levels of programming experience. Intermediate and advanced students show higher levels of satisfaction compared to beginner students. This information can be helpful for developers and educators looking to tailor the ITS interface and functionality to different levels of programming expertise to improve user satisfaction.
5.2. Qualitative analysis: Thematic analysis
Three themes emerged from the thematic analysis: positive impact on the learning process, challenges faced, and suggestions for improvement. This analysis offers a comprehensive understanding of the potential benefits and limitations of ITS in supporting students with varying levels of programming experience.
Many respondents highlighted the positive influence of the Intelligent Tutoring System on their learning process. For example, one participant stated, "ITS helped me understand the topics I did not understand in the classes," while another mentioned, "It makes me calmer." These responses indicate that the ITS has aided comprehension and reduced anxiety during the learning process.
While some participants reported no significant challenges, others mentioned specific difficulties. For instance, one respondent stated, "Sometimes I cannot find a course on the topic I want." At the same time, another mentioned, "Having restrictions on programming languages, tools, or project types may limit the scope and depth of my learning." These responses suggest that there may be limitations or barriers that impact the effectiveness of the ITS in programming education.
Participants offered suggestions for enhancing the ITS to support their programming education needs better. For example, one respondent recommended, "There should be more cooperation with Intelligent Agent (IA)," while another suggested, "Provides diverse learning resources such as video tutorials, interactive experiments, and programming challenges." These suggestions highlight the importance of incorporating additional features and resources to optimize the ITS for varied learning preferences and requirements.
The thematic analysis of the responses indicates that while the ITS positively impacts the learning process for many students, there are also challenges and areas for improvement that should be addressed to enhance its effectiveness in supporting programming education.
5.3. Integration of the findings
Integrating the quantitative and qualitative findings provides a comprehensive understanding of the effectiveness of Intelligent Tutoring Systems in supporting students with varying levels of programming experience. The quantitative analysis revealed significant differences in the influence of ITS on improving programming skills, the usefulness of ITS feedback on programming tasks, and satisfaction levels with the interface and functionality of the ITS based on students' programming experience levels.
The qualitative analysis further enriched these findings by highlighting the positive impact of ITS on the learning process, challenges faced by students, and suggestions for improvement. The qualitative data shed light on the benefits of ITS in aiding comprehension and reducing anxiety, as well as the limitations and barriers students may encounter when utilizing ITS in programming education. Additionally, students' suggestions for enhancing the ITS to meet their learning needs better emphasized the importance of incorporating diverse learning resources and enhancing collaboration with Intelligent Agents.
Overall, integrating quantitative and qualitative findings underscores the importance of optimizing Intelligent Tutoring Systems to cater to students' diverse programming backgrounds and preferences.
5.4. Discussions
The findings of the quantitative analysis align with previous research indicating the effectiveness of Intelligent Tutoring Systems in programming education. Studies like those by Swanson (2019) and Fan et al. (2023) have highlighted the benefits of personalized e-learning and automated feedback in enhancing the learning experience. The current study builds on this foundation by examining the impact of ITS on students with varying levels of programming experience, shedding light on how proficiency levels can influence the effectiveness of ITS feedback and satisfaction.
In line with the studies by Gavrilovic and Ovanovićo (2015) and Lee and Baba (2005), our research deviates by uncovering the nuanced differences in the influence of ITS based on students' programming proficiency levels. Contrary to previous assumptions about ITS efficacy being consistent across all learners, our findings suggest that the effectiveness of ITS feedback and satisfaction levels varied significantly among beginner, intermediate, and advanced programming students. This departure from the general trend underscores the importance of considering students' proficiency levels in designing and implementing ITS in programming education.
The novelty of this study lies in its focus on the impact of ITS on students with diverse programming backgrounds. By conducting chi-square and ANOVA tests to examine the association between programming proficiency levels and the effectiveness of ITS, the study offers valuable insights into how individual student characteristics can influence the outcomes of ITS interventions. Furthermore, the qualitative thematic analysis highlights the specific challenges and suggestions for improvement provided by students, adding a qualitative dimension to the existing literature on ITS in programming education. Overall, the study contributes to the growing body of research on ITS by emphasizing the importance of tailoring ITS interventions to meet the unique needs of students with varying levels of programming experience.