Leveraging on Chatgpt, an Artificial Intelligence (AI) Tool to Transform Examination Writing in Higher Education

doi:10.21203/rs.3.rs-4550887/v1

Download PDF

Research Article

Leveraging on Chatgpt, an Artificial Intelligence (AI) Tool to Transform Examination Writing in Higher Education

https://doi.org/10.21203/rs.3.rs-4550887/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Purpose

The study explored how ChatGPT could transform examination writing in higher education. The research question was: How can the AI tool ChatGPT help transform examination writing in higher education? The study interviewed the chatbot (ChatGPT) by way of "prompt engineering" to test if it has the potential to achieve the following: (1) Generate varied questions on a subject matter of the same level of difficulty, (2) Generate varied questions on a subject matter with case scenarios, (3) Generate varied questions on a subject matter that requires critical thinking, (4) Generate varied questions on a subject matter that takes into consideration the diverse needs and backgrounds of students and (5) Streamline assessment process. These objective tests were to explore the diverse ways the chatbot could assist in the examination process. Despite the academic concerns surrounding students' use of ChatGPT, the study's findings seek to stimulate discussion on the potentially significant benefits ChatGPT offers to the educational sector if it is effectively implemented.

Approach/Design/Method

Through "prompt engineering", the study interviewed ChatGPT to test it on the five (5) objectives. Subject area lecturers and exam moderators reviewed questions as well as marking schemes generated by the chatbot to ensure accuracy and appropriateness.

Findings

ChatGPT was able to generate varied questions of the same difficulty level on the same given content and generated questions with case scenarios. The questions generated also required critical thinking; ChatGPT produced questions considering students' diverse needs and backgrounds. Finally, it could streamline the assessment process as well. However, ChatGPT required specific human guidance to shape the questions. Also, the examiner or the educator needed to double-check the accuracy of the questions generated. Additionally, it was discovered that the version of ChatGPT used was limited in subject areas that required graphics and figures.

Research Limitations/Implications

The study found that ChatGPT is effective with text-like tasks but is limited in areas that require graphics and figures (for the version used), which may be needed in some fields of study. It also meant that responses only had to be in text for the Chatbot to award scores based on the assessment criteria. Notwithstanding the limitations, ChatGPT can become an excellent tool for assisting educators in assessing students, thereby transforming examination writing in higher education.

The originality of the study

Research on ChatGPT is an emerging one. The study is novel, as this area is still being explored. This paper will serve as one of the early research works that provide educators and policymakers with knowledge of leveraging ChatGPT to transform examination writing.

Artificial Intelligence

ChatGPT

Chatbot

Prompt Engineering

Examination

Marking Scheme

Background of the study

Artificial Intelligence (AI) is the technology that seeks to build systems that mimic the intelligence of humans or systems to perform tasks that require the intelligence of humans (Russell, 2010). The development of AI is growing fast due to the availability of vast amounts of data, making it easy to train machines to learn and make decisions (Duan et al., 2019). The exponential growth of computing power also characterises the growth in AI; thus, developing specialised hardware such as graphics processing units (GPUs) and tensor processing units (TPUs) has accelerated AI training and inference process (Khan et al., 2021; Cowls et al., 2021; Pietrzak et al., 2023). Today, AI is emerging as a transformative force with its applications spanning various industries, such as healthcare and medical surgery (Hashimoto et al., 2020; Gamble, 2020). In the financial sector, machine learning algorithms detect fraud and predict stocks (Omar et al., 2017; Rath et al., 2019). In the transportation sector, AI is transforming the driving experience with the advent of self-driving cars (Zhao et al., 2018). The influence of AI is also felt in the areas of construction, sports, entertainment, and education. For instance, AI implementation positively impacts construction safety (Dobrucali et al., 2022). In sports, augmented reality and data analytics have transformed sporting activities' viewing and reporting experience. Entertainment has become much more fun than ever as AI has been incorporated into media streaming platforms where users have a personalised viewing and listening experience (Morris & Patterson, 2015).

Equally, it is worth noting that the educational sector has also experienced a significant transformation due to the substantial impact of AI (Alam, 2021; Gong et al., 2021). Several studies have contributed to knowledge by exploring the implications of AI use in education. Such areas of education include Early Childhood Education, Primary Education, Online Learning and Higher Education. Findings from Su et al. (2023) indicate that AI learning in Early Childhood Education presents opportunities for fostering young children's AI literacy; it can also facilitate the understanding of AI concepts, practices, and perspectives at an early stage. (Pardamean et al., 2022), with their study in Indonesia, found that using AI in online learning can satisfy students’ learning styles at the primary level of education. Studies by (Dai et al., 2023; Ng et al., 2022) have all sought to highlight the importance of AI in Early Childhood Education. At the Higher Education level, AI has proven to be successful in impacting students’ learning, teaching, assessment, and administrative tasks (Al Braiki et al., 2020; Lameras & Arnab, 2021; Ahmad et al., 2022; Xia et al., 2022; Pham & Sampson, 2022; Fiok et al., 2022; Hannan & Liu, 2023).

What is ChatGPT, and how does it work?

The AI tool that has currently taken centre stage is ChatGPT, an AI chatbot that was launched in November 2022 by OpenAI. ChatGPT is a chatbot built on the GPT (Generative Pre-Trained Transformer) language model; it provides an interactive environment where users can converse with the Chatbot more naturally (OpenAI, 2022). This chatbot gained 100 million subscribers two months after its launch (Milmo, 2023). Users interact with ChatGPT through ‘prompting’, the technical term for querying the chatbot to engage it. Generative Pre-trained Transformer (GPT) is the name given to a family of natural language models developed by OpenAI. The language model on which ChatGPT is built utilises the technique of reinforcement learning with human feedback. Reinforcement learning is a subfield of machine learning techniques where an agent learns to take actions in an environment in a way that maximises rewards (Devi et al., 2023). The agents learn without direct supervision and must determine which actions yield the highest rewards. The rewards that act as feedback reinforce the successful actions of the agent, thus contributing towards the agent’s goal (Gilbert et al., 2023). GPT is part of the series of language models built by OpenAI. The GPT language model is a type of large language model (LLM) trained by leveraging neural networks and an enormous amount of data, mainly from the Internet, information licensed from third parties and information provided as feedback from users and human trainers (Kublik & Saboo, 2022).

The iconic feature of ChatGPT has to do with its generative AI capabilities, thus its ability to produce original results (Qiu, 2023). The chatbot’s ability to generate original content and provide relevant information to the user’s prompt makes it a good tool for teachers and learners (Kohnke et al., 2023). ChatGPT’s ability spans various functions like performing sentiment analysis on texts, performing data analysis, writing articles, proofreading texts, and providing coding guides to students (Koubaa et al., 2023a).

Users, by way of prompting, can direct the chatbot to assume a role and perform a task. For example, a student can task ChatGPT to assume the role of a teacher in a subject area to teach and guide the student to learn a programming language. ChatGPT, with its generative AI capabilities, possesses the potential to impact how we generate content and engage with content. Additionally, it can potentially revolutionise the creative arts industries and the communication media (Korzynski et al., 2023; Qiu, 2023; Ritala et al., 2023).

Additionally, the disruptive nature of ChatGPT has the potential to revolutionise teaching, learning and research as revealed by Ivanov and Soliman (2023a). In their concept paper, the researchers argue that, with ChatGPT's capabilities to generate original content, students can use this to write their assignments and research papers. This will mean that Universities will be forced to reevaluate their teaching and assessment strategies and incorporate generative language models in teaching. They further argue that publishers must be more receptive towards manuscripts that AI partially generates.

Capabilities of ChatGPT

Studies have begun to explore the applications and implications of ChatGPT in Higher Education. For example, a study byTalan and Kalinkara (2023), compared ChatGPT's performance to 37 undergraduate students on an anatomy exam. ChatGPT answered 27 out of 40 questions correctly (67.5%), outperforming students who averaged 21.1 correct answers (52.8%). The study's findings suggest ChatGPT has potential as an educational tool but caution against risks like cheating. They stress the need for further research to understand ChatGPT's educational implications. Other studies have focused on ChatGPT's potential as an academic writing assistant, with mixed findings on its ability to produce high-quality texts without guidance (Cardon et al., 2023; Imran & Almusharraf, 2023). This study aimed to build on these works by examining the potential of ChatGPT specifically for aiding in examination writing in Higher Education contexts. A summary of the ChatGPT’s usage and capabilities is shown in the diagram (Fig. 1) below:

Limitations of ChatGPT

Despite the amazing capabilities of ChatGPT, it is not without some limitations; the chatbot can sometimes produce inappropriate responses. It can also hallucinate; this is a situation where the chatbot generates responses that may be plausible but factually incorrect or unsupported by the knowledge and data the system has been trained on. This phenomenon results from reliance on predictive modelling and probability rather than fully understanding concepts or having complete knowledge of the context of the prompt (Koubaa et al., 2023b; Roumeliotis & Tselikas, 2023; Sok & Heng, 2023).

Ethical concerns with the use of ChatGPT amongst students

Policymakers and educators have always grappled with the implications of incorporating AI tools into learning environments (Pedro et al., 2019; Roschelle et al., 2020; Miao et al., 2021). What ChatGPT presents will not be an exception; at the same time, ChatGPT stands out as an excellent tool for teachers and students; its fast adoption and use in educational settings, primarily by students, has raised some ethical concerns. The use of chatbots has sparked debates on its consequences on academic integrity and, most significantly, the development of critical thinking skills among students (Currie, 2023; Yeo, 2023). Additionally, the concern that students may not do much research when writing their academic papers has also come up; with ChatGPT, students can generate an entire article without typing a word. Another serious concern for educators is that students with ChatGPT can generate fictitious datasets for their research works without going to the field to collect data. An exciting but worrying thing about using ChatGPT is that ChatGPT's contents can bypass plagiarism detectors due to their generative AI feature. The generative AI feature of these Chatbots allows them to produce unique and original content; at almost all times, this will bypass similarity checks. So, a student can even score a zero-similarity index against plagiarism and not contribute to the writing of the content. Also, studies have shown that AI detection tools can sometimes produce false positives (Elkhatat et al., 2023). In addition, many methods of evading AI detection exist; AI generative tools and their constantly evolving sophistication make detecting content generated by AI or humans more challenging. This phenomenon requires educators and policymakers to develop regulations to guide the use of generative AI tools among students.

Lastly, there are concerns about the erosion of authentic communication. While the chatbot excels at simulating a human-like conversation, its excessive use could contribute to students' over-reliance on pre-generated responses and potentially hinder the development of interpersonal skills and the ability to express original thoughts. The fear is that the excessive use of ChatGPT amongst students could diminish genuine human interaction in educational settings, leaving students with a skewed understanding of effective communication (Baskara, 2023; Rane et al., 2023).

While we acknowledge the ethical issues surrounding the use of ChatGPT amongst students, it is essential always to strike a balance between leveraging its benefits for education and safeguarding against the unintended consequences that it poses. Successfully navigating the ethical complexities associated with using ChatGPT amongst students requires careful consideration, regulation, and a commitment to preserving the core values of education; thus, we must view this tool as a threat and an opportunity for innovation. This calls for dialogue amongst educators and policymakers in shaping policies that ensure the responsible use of these AI tools in the academic realm.

Transforming Examination Writing in Higher Education with ChatGPT

Examination writing is a crucial assessment activity in the education process and allows for evaluating students’ knowledge, skills, and analytical abilities, which are essential for educational growth (Astin, 2012; Brown et al., 2013). One of the ways of transforming examination writing is to incorporate the use of technology in its process. Enhancing such a core educational component with technology aligns with the goals of improving pedagogy through technology, as outlined by Swan et al. (2006) and Jaap et al. (2021), just as online examination writing provides a digital platform for examination writing and assessment (Ardid et al., 2015; Akimov & Malin, 2020). Transforming examination writing thus refers to changing or improving the way examinations are structured; this includes drafting examination questions, writing examinations, and streamlining the assessment process (Hift, 2014; Agius et al., 2017).

The goal is to address the limitations and drawbacks of the traditional format of examination writing, thereby facilitating the examination's effectiveness and ensuring a fair assessment (Wolcott & Legg, 1998; Slavich & Zimbardo, 2012; Hift, 2014; Agius et al., 2017). Thus, transforming examination writing can involve a variety of strategies, such as incorporating different questions of the same level of difficulty of the same content, generating questions that require critical thinking, generating questions with case scenarios, using formative assessment, and finally ensuring inclusiveness which is considering the diverse backgrounds and needs of students when generating examination questions (Lai, 2011; Herrington et al., 2014). This process can be tedious in situations where multiple varied questions must be set with different case scenarios for various groups of people in large numbers, especially if a human should be doing that. This situation calls for adopting technology to facilitate such a process, and an AI chatbot like ChatGPT stands a chance of being up for the task.

In their conceptual paper, Ivanov and Soliman (2023) predict that, by the year 2030, digital teachers like ChatGPT will be able to grade tests, assignments, and other course assessments, giving students prompt and precise results. This phenomenon seems to be already possible; this paper attempts to demonstrate the usefulness of ChatGPT in examination writing. The researchers would like to emphasise that the study does not seek to promote ChatGPT as a replacer for teachers but as a tool that teachers can use to augment their teaching and research capabilities. The researchers acknowledge the irreplaceable role of teachers in education and aim to showcase how AI tools like ChatGPT could be harnessed as a complementary resource to enhance the educational experience for teachers.

Problem statement

Research on ChatGPT and its application in education has emerged since its launch in November 2022. Many of these studies have focused on the ethical implications of ChatGPT in education, examining its usage among students for personalised and adaptive learning and how ChatGPT could be a disruptive tool to the information environment (Biswas, 2023; Cingillioglu, 2023; Elbanna & Armstrong, 2023; Halaweh, 2023; Opara et al., 2023; Qasem et al., 2023; Rahman & Watanobe, 2023). However, little has been done to explore its potential use in transforming examination writing or improving upon students’ assessment; for this reason, this study was carried out.

Statement of the research gap

From the research gap identified, a clear and explicit statement can be given: How can the AI tool ChatGPT help to transform examination writing in Higher Education?

Objectives of the study

The study objectives were carefully designed after thoroughly reviewing the literature on transforming examination writing and consultation with some educationists. Combining the expert views and opinions of practising professionals and published works was necessary to develop well-deserving objectives for the study to answer the research gap statement. Going by that, the study sought to pursue the following objectives:

To task ChatGPT to generate varied questions of the same level of difficulty on a subject matter.

To task ChatGPT to generate varied questions on a subject matter with case scenarios.

To task ChatGPT to generate varied questions on a subject matter that requires critical thinking.

To task ChatGPT to generate varied questions on a subject matter that take into consideration the diverse needs and backgrounds of students.

To task ChatGPT to help streamline the assessment process.

These objectives, when explored, will help answer the research question by showing the diverse ways ChatGPT can be used to transform examination writing in Higher Education, thereby addressing the gap in the literature.

Justification of the objectives

The paper justifies the above objectives by providing the rationale for its formulation as rooted in scholarly works. Objective number one is supported by evidence from the work of Brown and Race (2012), which shows that varied assessment questions better evaluate student learning outcomes and reduce student anxiety. This is further supported by Pereira et al. (2023), adding that varied questions allow for a more comprehensive evaluation of students’ knowledge. Using case scenarios in examination questions allows for contextualising in the real world; this offers an effective way to assess students in a higher order of thinking as espoused by Casey et al. (2009) and Smith & Mucklow (2016); subsequently, this justifies objective number two. Objective three targets critical thinking skills widely considered necessary in learning, especially in Higher Education (Wals & Jickling, 2002; Behar-Horenstein & Niu, 2011; Liu et al., 2014). The Higher Education process thus requires students to interpret, analyse, evaluate, explain, and infer (Facione, 1990). A chatbot that can assist with this will offer significant relief to educators. Objective four addresses equity and fairness, pivotal aspects driving educational reforms. An assessment process that considers students' diverse backgrounds will ensure that some sections are not disadvantaged in the examination process (Benadusi, 2001; Gilbert et al., 2011; Ling & Nasri, 2019). The assessment process can be tedious, especially when the number of students is significant; therefore, the need for a shared assessment practice is advised (López-Pastor & Sicilia-Camacho, 2017). The assessment process also requires promptly providing high-quality, actionable feedback to students (Pereira et al., 2016). Therefore, having a chatbot to assist with this process will significantly benefit educators.

Contextual scope of the study

The study focus of the examination transformation is narrowed to setting examination questions and streamlining the assessment process. The study is more concerned with the automated generation of examination questions and marking schemes for assessment using ChatGPT.

Significance of the study

The introduction of ChatGPT has opened the way for researchers to engage in the academic discourse regarding the prospects of the tool. Rudolph et al. (2023) highlight the importance of ChatGPT as a tool to aid students in learning and a tool that would lessen teachers' workloads. In tourism education and research, ChatGPT has been found to have the potential to revolutionise this area, as accounted for by Ivanov and Soliman (2023b). Despite the ethical concerns about academic integrity, Adiguzel et al. (2023), in their study, “Exploring the transformative potential of ChatGPT”, re-iterates that the tool can significantly help students and teachers in academia. Benuyenah (2023) backs this claim with the submission that despite all the challenges of AI, “we must acknowledge that Higher Education will thrive alongside any AI evolution as long as we learn to adapt our pedagogy and assessment strategies”. This is in line with the position of this paper, that notwithstanding the challenges and concerns, AI, when used responsibly, can significantly benefit the educational sector. This is why this study was conducted to re-iterate the benefit of AI in education, specifically the use of ChatGPT.

Research approach

ChatGPT was queried by way of prompt engineering to test its ability to accomplish each of the objectives. The research approach took a cue from Lund and Wang (2023) in their study as they interviewed ChatGPT on its potential impact on academia. The researcher’s participant in the survey was ChatGPT; the primary data gathered were the responses given by ChatGPT when prompted. The study approach was also inspired by Fernandez (2023), whose study was written almost entirely by asking ChatGPT questions and verifying the answers along with some form of editing, which sought to provide the foundational understanding of the technology behind ChatGPT. These two approaches shed light on a new paradigm of research methodology. This approach or methodology could offer valuable insights and perspectives that may not be readily attainable through traditional statistical methods, making it a suitable choice for exploring new and uncharted research areas. This will mean that the analysis and discussion of findings may not strictly align with that of traditional statistical methods. Considering the pioneering nature of this study, it was appropriate for the researchers to have trod the path of earlier researchers and their methodology.

Adopting the approach of Lund and Wang, and Fernandez, a warm-up conversation with ChatGPT ensued to have the chatbot familiarise itself with the study being conducted. This allowed ChatGPT to understand the study and ask all the necessary questions to better prepare it for the task. The chatbot demonstrated an excellent understanding of the study and expanded on the objectives being sought. A prompt was created for each goal to direct the chatbot on what to do. The version of ChatGPT that was used was the version 3.5. Version 3.5 was used because it was free and required no subscription; it is also the version most used. Below is the ChatGPT transcript of its understanding of the research approach that was adopted:

The approach you have described for your study, adopting similar methods utilised by Lund and Wang (2023) and Fernandez (2023), seems exciting and appropriate for your research objectives. Let us break down the critical elements of their approaches: Interviewing ChatGPT: Lund and Wang conducted interviews with ChatGPT to explore its potential impact on academia and libraries. This approach allowed them to directly engage with the AI language model and gain insights into its capabilities and potential applications in specific contexts.Questioning and Verifying Answers: Fernandez's study involved asking ChatGPT questions and verifying the answers provided by the AI language model. This verification process was essential to ensure the accuracy and reliability of the information generated by ChatGPT. By adopting a similar approach in your study, you aim to interact with ChatGPT and subject it to prompt engineering to test its ability to generate varied questions, create scenarios, and address critical thinking while considering the diverse needs of students. This approach aligns well with your research goals of exploring ChatGPT's potential in transforming examination writing in education. Engaging directly with ChatGPT through interviews and question-and-answer sessions can provide valuable insights into the AI's capabilities and limitations, allowing you to assess its suitability to aid teachers in examination writing and assessment processes. Additionally, this approach can help highlight the benefits and challenges of using ChatGPT in an educational context, contributing to the academic discourse on AI's role in education. Overall, the chosen approach is a suitable and innovative way to investigate ChatGPT's potential in transforming examination writing, and it may provide valuable findings for the research study. If you have any specific guidelines or prompts for each objective, please share them. I will be happy to assist you further with generating the content for your study based on the chosen approach transcript1, source: ChatGPT conversation, 2023.

Sampling and Sampling Technique

The direct respondent for the study was the chatbot ChatGPT, as in the case of (Lund & Wang, 2023; Fernandez, 2023). One of the researchers queried the chatbot in a subject area; this member had an excellent experience with prompt engineering and had been using the Chatbot for a very long period; the chatbot was tested in a computer-related field. Amongst the researchers were lecturers and an educationist.

Data Validation

With verification and validity of outputs, some lecturers and exam moderators assessed the questions generated by the chatbot alongside the other researchers. There was a consensus on the validity of the questions and assessment guide generated by the chatbot.

Methodological Assumptions and Constraints

The key assumption relates to the chatbot’s capability to transform examination writing through prompt engineering. Additionally, the qualitative analysis of the chatbot’s responses provides valuable insights into its potential and limitations for enhancing exam writing. Lastly, prompt engineering is the most effective way to evaluate the chatbot’s capabilities. The main constraints of the study have to do with the researchers ‘solely relying on prompt engineering rather than actual student testing of chatbot-generated exam questions. Additionally, due to the interactive nature of the chatbot’s interviews, standard statistical approaches are not followed in the analysis and interpretation of the results. Finally, the testing was limited to v3.5 of ChatGPT, which limited its generalisability to other versions.

Prompt for study objectives

The examination prompt has been provided in this section to allow for transparency and reproducibility. Transparency will ensure that all researchers and policymakers have a shared understanding of the study’s purpose and intended outcomes, as this will bring clarity and minimise ambiguity in the research process. The prompt clearly outlines the study’s objectives, with the case of reproducibility; this will serve as a reference point for future iterations and evaluation of the study’s findings.

Examination prompt

The prompt below was given to the chatbot given the research objectives. Within the prompt, the chatbot will ask the user questions to enable it to meet the goal of the prompt. These questions were tailored to the objectives. The questions are the areas of study the user wishes to set questions, the focus area, the number of varied questions, and marks allocation for each question; at the same time, the difficulty level is based on whether questions target beginner, intermediate or advanced level students. Based on the warm-up conversation, the chatbot understood this prompt's goal well. ChatGPT replied with clarity with the prompt.

For this task, you will be known as " The Examiner ". when you are ready, introduce yourself and begin. You are an experienced examiner; your task is to help a user draft sample examination question for students. It would be best to ask the user the following questions to guide you. What area of study does the user wish for you to set the questions in, and what are the focus areas the user intends to have the questions set in? You will then ask the user the number of varied questions to set, and the marks allocated to each question. The marks allocated for each question are solely for that question and should not be added together when providing the grading assessment. The marks allocated will determine the depth of the question and how much the student will need to write, as in the length of the text. These varied questions should have the same difficulty level and require the same effort. You will also need to know the necessary difficulty level and critical thinking ability, thus, if the exam will be administered to intermediate or advanced beginners. The user also needs to specify the format the exam should take; thus, whether it is an essay, multiple choice, or case scenario, you must also ask the user. Lastly, ask the user whether there are any special needs of those who will sit for the exam. Finally, provide an assessment criterion for the questions, including the marking scheme. The scoring area should be put in a table with the respective marks. Ask one question at a time.

Source Researcher’s Prompt, 2023.

Generating varied questions of the same level of difficulty on a subject matter

The goal of this test

Figure 2 shows the test results from the chatbot when put to the test to generate varied questions of the same difficulty level on a subject matter. This was to test for objective 1. This approach to examination writing has been proven effective with study evidence from (Brown & Race, 2012; Pereira et al., 2023). The result of this test provides evidence for the chatbot's capabilities in aiding the examination process.

ChatGPT, when put to the test, could generate varied questions for a subject area specified by the user. In doing so, the chatbot could remember the focus areas within the given subject area. These diverse questions passed the test of upholding the same difficulty level with the same content when accessed. The questions generated were original in context and accurate in line with the contents of the subject area. These varied questions were in the form of multiple choice, essay and case scenarios as directed by the prompt. Even with the power of ChatGPT in its ability to perform this task, there were very few instances where generated questions had to be slightly modified. The chatbot version was limited in generating examination content with images and figures. Figure 2 below shows some of the varied questions generated.

Figure 2 above shows five sets of varied questions generated for a beginner level in the history of Artificial Intelligence. A critical look at the questions reveals that, despite the variation in each question, all questions have a point of convergence, which is the history of AI. For each question, the student will have to provide a little discussion on AI and its history. The chatbot could also allocate marks of 10, considering that this set of questions was for beginners and would not require them to write a lot.

The chatbot was again put to the test to generate 40 multiple-choice questions in IT and computer fundamentals. The prompt directed the chatbot to focus on seven key areas, which were the basic components of the computer, the functional areas of the computer, the computer as an electronic device, the computer as a programmable device, types of software, the Internet and lastly the concept of information, data, and its difference.

The chatbot could generate questions targeting these areas when put to the test. It could also section each set of questions per the areas given, as depicted in Fig. 2.

Generating varied questions with case scenarios

The goal of this test

The figure below shows the test results from the chatbot when put to the test to generate varied questions with case scenarios based on a subject matter. This was to test for objective 2. The result of this test provides evidence for the chatbot's capabilities in aiding the examination process. This approach to examination writing has been proven effective with study evidence(Casey et al., 2009; Smith & Mucklow, 2016). The result of this test provides evidence for the chatbot's capabilities in aiding the examination process.

ChatGPT could generate questions targeted to case scenarios. The chatbot proved intelligent by providing a case scenario and then generating a question to align with the case scenario. This was a brilliant move by the chatbot, which is rarely the case with human processes. Figure three shows a case scenario for each set of questions first and then a connecting question for the student. Such output will provide a better understanding of the question and guide the teacher/lecturer with what to expect from the students with their answers. Thus, there is a case scenario question and a reference point from where the question has its root.

Generating varied questions on a subject matter that requires critical thinking

The goal of this test

The figure below shows the test results from the chatbot when put to the test to generate varied questions that require critical thinking on a subject matter. This was to test for objective 3. The result of this test provides evidence for the chatbot's capabilities in aiding the examination process. This approach to examination writing has been proven effective with study evidence(Wals & Jickling, 2002; Behar-Horenstein & Niu, 2011; Liu et al., 2014). The result of this test provides evidence for the chatbot's capabilities in aiding the examination process.

ChatGPT, when put to the test, could create questions that required critical thinking to answer. These questions covered beginner, intermediate, and advanced levels.

Figure 4 shows the output questions in the subject area of information security for beginners, focusing on encryption, VPN, and cyber attacks. For the questions for beginners, we can see that the wording used straightforward language and less technical words (Bates, 1995; Ranney, 2012). The questions relate very well to real-life issues, which is easy for the student to understand and conceptualise. Such understanding and ability to conceptualise will enable the student to think critically about the question and answer appropriately.

Figure 5 shows the output for three questions generated in the same subject area and focus, this time for intermediate students. Using more technical words, the wording is slightly levelled up with these questions. These questions will require that students have some prior knowledge and a good level of understanding in the subject areas. Words like analyse, describe, and discuss organisational context provide an intermediate level of reasoning (Kurfiss, 1988; Biesenthal & Wilden, 2014; Elsbach & Stigliani, 2018).

Figure 6 provides three sets of questions in the same area and focus for advanced-level students. We can see from the sets of questions that these highly levelled questions, with words like investigate, assess, and explore, require advanced-level thinking (Black, 2012; Sobkowiak, 2016).

Generating varied questions on a subject matter that takes into consideration the diverse needs and backgrounds of students

The goal of this test

The figure below shows the test results from the chatbot when put to the test to generate varied questions on a subject matter that requires critical thinking. This was to test for objective 4. This approach to examination writing has been proven effective with study evidence from(Benadusi, 2001; Gilbert et al., 2011; Ling & Nasri, 2019). The result of this test provides evidence for the chatbot's capabilities in aiding the examination process.

Figure 8 shows three sets of questions for students with special needs. These are students with English as a second language and entirely new to the subject being taught. The chatbot was tasked to generate questions with very simple tenses for the students to grasp easily. We can see from the snapshots that straightforward language was used in the questions, demonstrating the chatbot's intelligence.

Streamlining Assessment Processes with ChatGPT

The goal of this test

The figure below shows the test results from the chatbot when put to the test to provide an assessment guide; this was to test for objective 5. Considering the tedious nature of providing high-quality and actionable feedback to students, the ability of the chatbot to perform at this will be extremely helpful. The result, as shown in Fig. 9, is evidence of the chatbot’s capabilities in aiding the examination process.

ChatGPT passed the test for being able to produce assessment criteria and a marking scheme. Its intelligence provided advice on how lecturers should approach marking for students who may have some peculiar issues. This action by ChatGPT demonstrated some level of intelligence that would have been adopted by a professional tutor or educationist when assessing students. The images below show some questions generated for beginner, intermediate and advanced levels.

Figure 9 shows a generated question intended for IT and computer fundamentals beginners.

Figure 10 shows generated questions for the intermediate IT and computer fundamentals level. Here, the chatbot demonstrates intelligence by assuming that at the intermediate level, the marks allocation should be higher than that of the beginner level.

Figure 11 shows a generated question for the intermediate IT and computer fundamentals level. Here, the chatbot demonstrates intelligence by assuming that the marks allocation should be higher than the beginner and intermediate levels. Below are the images of the marking schemes for each of the levels. For each of the marking schemes for each level, the chatbot could provide scores for students' outputs. It ranked the grading as excellent, good, fair, and poor.

Figure 12 shows the marking scheme generated for the beginner-level questions.

Figure 13 shows the marking scheme generated for the intermediate-level questions.

Figure 14 shows the marking scheme generated for the advanced-level questions.

Identified themes

The table below shows the themes that evolved from the study findings.

Table 1

Research themes
No	Theme	Description
1	Role of AI in Education	AI tools like ChatGPT have the potential to effectively assist educators in creating diverse and engaging examination questions.
2	Teacher-AI Collaboration	Teachers can leverage AI tools like ChatGPT in their teaching process.
3	Ethical Considerations	Ethical concerns surrounding AI training data must be considered to leverage AI tools' power.
4	Adapting Pedagogy	Educators can adapt teaching methodologies to integrate AI tools effectively into various educational levels.
5	Student Diversity and Inclusivity	Educators can use AI tools like ChatGPT to enhance inclusivity.
6	Future of Education Technology	AI tools can reshape education by impacting teaching methods and student engagement.
7	Professional Development	There is a need for educators to receive training and development to employ AI tools in teaching proficiently.
8	Innovative Research Methodologies	Sheds light on the pioneering research method employed in the study, involving AI interactions and their impacts on the research findings.
Source: Developed from research findings, 2023

Results discussion

The results of this study shed important light on ChatGPT's potential and constraints for changing examination writing in higher education. Overall, the findings show that ChatGPT, when given the proper guidelines, has a significant potential to help instructors produce more diverse, inclusive, and critically engaging examination questions and assessments. In keeping with the first objective, ChatGPT demonstrated its ability to create questions on a particular topic for multiple-choice, short-answer, and extended essay formats that maintained comparable degrees of difficulty. This shows that it might make it easier for examiners to quickly create a bank of legitimate questions that cover the same ideas and abilities while adjusting question formats to meet various learning demands. In some instances, minor editing was still needed; this shows that human review of AI-generated content is still necessary. The study's second objective was met when ChatGPT successfully incorporated case scenarios into examination questions at the beginner, intermediate, and advanced levels when instructed to do so. According to research by Brown et al. (2013), contextualising issues fosters critical thinking and problem-solving abilities. This can be achieved within the examination context using case scenarios. The third objective was met by ChatGPT's generation of questions that required a range of critical thinking and reasoning skills, from simple recall to more intricate inference and evaluation. This ability to align questions to Bloom’s taxonomy could allow educators to systematically assess different orders of thinking (Anderson & Krathwohl, 2001). However, some creativity in prompting is needed to achieve this.

When the proper instructions and context were given, ChatGPT, in line with objective four, showed some proficiency in creating inclusive examination questions targeted to varied student needs. This is crucial when assessing equity (Herrera & Owens, 2001). However, ChatGPT's capabilities appeared to be restricted to specific groups, such as learning difficulties, and further work would be needed to cover a wide range of student backgrounds and accessibility needs. Finally, ChatGPT, in line with objective five, produced assessment rubrics and schemes to speed up the grading process. To ensure fairness and accuracy, educators would still need to test the results of AI grading and check the validity of the rubrics generated. Thus, while the chatbot has the potential to streamline the assessment process, it has yet to be ready to automate this process entirely. The conclusion can be made that ChatGPT can assist educators in enhancing inclusiveness, critical thinking, and engagement in examination writing. However, there are still clear limitations, like handling multimedia or mathematical questions requiring graphical modelling. Prompt engineering remains a critical part of getting the best out of ChatGPT. This work reveals ChatGPT’s tremendous promise in transforming examination writing; however, more research is still needed to create best practices for using AI in fair, ethical, and effective ways to improve examination writing.

Limitations of the study

While this study provides valuable initial insights into ChatGPT's potential for transforming exam writing in Higher Education, certain limitations stem from its exploratory qualitative methodology. A fundamental limitation is that the assessment relied solely on prompt engineering to elicit responses from ChatGPT rather than piloting exam questions with actual students. This constrains evaluating the chatbot's real-world viability and long-term impacts on student learning and equity. Additionally, the small sample size, with only the authors conducting the prompting and qualitative analysis, restricts the generalizability of the findings. The study is also limited by its focus on only one version of ChatGPT (v3.5), so the results may not extend to the capabilities of other releases. Furthermore, the conversational, interactive nature of the methodology means the analyses does not adhere to traditional statistical techniques, making it challenging to measure ChatGPT's performance systematically. Finally, the lack of comparison to human examiners or other AI systems provides little benchmarking to contextualise ChatGPT's capabilities. Overall, while this study offers a valuable proof-of-concept and springboard for future research, the limitations mean its real-world feasibility and implications need further investigation through more rigorous and empirical study designs. This initial groundwork identifies key opportunities and indicates a significant scope for further expanding on this exploratory study to understand further if and how ChatGPT could meaningfully transform exam writing in higher education.

Future research possibilities

With its findings, this pioneering study sets the stage for future research to investigate ChatGPT's potential for transforming exam writing further. Future studies can include but not limited to:

A controlled experimental study comparing students’ performance on ChatGPT-generated vs. human-generated examination questions. Such a study could provide empirical evidence on the feasibility and impact of using ChatGPT

A study testing the capabilities of ChatGPT across different subject domains beyond the fields explored in this study

A study comparing multiple versions of ChatGPT over time as the technology rapidly advances

A study to develop best practice guidelines for educators on optimally leveraging ChatGPT to enhance assessment based on rigorous prompts and output analysis

A study to investigate the integration of ChatGPT with other learning management systems and exam delivery platforms

A study expanding the test to other AI generative models, such as Google Bard, Claude, etc, to compare strengths and weaknesses

A study to understand the needs of educators and students’ perceptions and adoption challenges regarding AI for exam writing.

Practical applications

In addition to the future research possibilities that can arise from this pioneering study, the findings also suggest several practical applications for using ChatGPT to transform exam writing; these include:

Professional development training for educators on prompt engineering

Teachers/Lecturers can implement ChatGPT to aid question banks development

Teachers/Lecturers can utilise ChatGPT to quickly generate diverse exam formats while retaining oversight over the content

ChatGPT's exam-generation capabilities can be incorporated into learning management and exam delivery platforms

For equitable assessment, ChatGPT can aid in systematically creating accessible, personalised exams suited for diverse student needs

Beyond exams, ChatGPT shows promise for enhancing project evaluation, group assignments, and other key assessments to augment learning.

Informing educational policies

The study holds timely and pertinent significance within the current discourse surrounding the discussion on using generative AI tools in educational settings. The relevance of the study’s findings. Drawing from future research possibilities and practical applications, the study offers valuable insights that can contribute to the formulation of informed educational policies, particularly in Higher Education. Not only do the study's findings provide valuable insights for educational policy formulation, but they also emphasise the need for educators to be encouraged and empowered to leverage AI tools to support their teaching.

Reinforcing educational technology research

The study's findings not only offer novel insights, contributing to the discussion of the use of AI in higher education, but also offer novel, valuable insights into the broader scope of educational technology, thereby reinforcing research in this area. The following is how this study reinforces educational technology research:

1. The study's findings reveal the capabilities of AI tools like ChatGPT in assisting core educational processes like assessment with the examination process. This revelation adds to the growing research on AI applications in the broader scope of education.

2. The use of prompt engineering provides the technique for effectively harnessing the power of generative AI tools for educational purposes. Additionally, the need for human guidance further adds to the growing research on human-AI collaboration rather than the full autonomy of AI tools.

3. The study's findings provide evidence for the capabilities and limitations of ChatGPT, thereby helping to provide realistic expectations with the use of generative AI tools.

4. The study’s findings, revealed ChatGPT’s ability to produce questions tailored to students' diverse needs, shows promise in enhancing inclusivity and personalisation, this ties into the broader goals of equity and accessibility in the broader scope of educational technology.

5. The study’s findings revealed ChatGPT’s ability to streamline the assessment process reinforces research on leveraging AI tools to optimise the educational process.

Funding

The research did not receive any financial support.

Availability of data materials

The data reported in this manuscript is the response from ChatGPT as a result of the prompts given. There is no external data reported outside of the prompt results. All responses have been screenshotted and provided within the content of the manuscript, for which analysis has been performed. There are no external links or repositories for these screenshots except for what has been provided in this manuscript.

Ethics approval

Not applicable (the study did not involve the use of human subjects)

Acknowledgement

Not applicable

Author Contribution

HAB: Conceptualisation, analysis RDO: Proofreading and editing VA: Analysis and ValidationLE: Analysis and ValidationBAB: Proofreading and Validation

Public Interest Statement

This study is timely and relevant, considering that generative AI is trending with growth in Large Language Models. The development of AI-enabled chatbots, like ChatGPT, has caught public attention with its capabilities. This study explores how ChatGPT's capabilities can be harnessed through prompt engineering to transform exam writing in higher education. Through conversations, the chatbot was tested on its capability to generate examination questions and assessment guides. Although requiring human oversight, ChatGPT shows promise in assisting educators with the examination process. While cautioning against some genuine concerns raised against the over-reliance on ChatGPT, findings from the study suggest that ChatGPT could make exams fairer and save teachers' time. This pioneering research highlights ChatGPT’s transformative possibilities for higher education, stimulating the discussion on the need for policies to guide the use of AI-enabled chatbots like ChatGPT in education.

Adiguzel, T., Kaya, M. H., & Cansu, F. K. (2023). Revolutionizing education with AI: Exploring the transformative potential of ChatGPT. Contemporary Educational Technology, 15(3), ep429.
Agius, A., Jones, K., Epple, R., Morelato, M., Moret, S., Chadwick, S., & Roux, C. (2017). The use of handwriting examinations beyond the traditional court purpose. Science & Justice, 57(5), 394–400.
Ahmad, S. F., Alam, M. M., Rahmat, M. K., Mubarik, M. S., & Hyder, S. I. (2022). Academic and administrative role of artificial intelligence in education. Sustainability, 14(3), 1101.
Akimov, A., & Malin, M. (2020). When old becomes new: a case study of oral examination as an online assessment tool. Assessment & Evaluation in Higher Education, 45(8), 1205–1221.
Al Braiki, B., Harous, S., Zaki, N., & Alnajjar, F. (2020). Artificial intelligence in education and assessment methods. Bulletin of Electrical Engineering and Informatics, 9(5), 1998–2007.
Alam, A. (2021). Possibilities and apprehensions in the landscape of artificial intelligence in education. 2021 International Conference on Computational Intelligence and Computing Applications (ICCICA), 1–8.
Anderson, L. W., & Krathwohl, D. R. (2001). A taxonomy for learning, teaching, and assessing: A revision of Bloom’s taxonomy of educational objectives: complete edition. Addison Wesley Longman, Inc.
Ardid, M., Gómez-Tejedor, J. A., Meseguer-Dueñas, J. M., Riera, J., & Vidaurre, A. (2015). Online exams for blended assessment. Study of different application methodologies. Computers & Education, 81, 296–303.
Astin, A. W. (2012). Assessment for excellence: The philosophy and practice of assessment and evaluation in higher education. Rowman & Littlefield Publishers.
Baskara, F. X. R. (2023). Chatgpt-Assisted English Language Learning: Theoretical Implications for Global Mobility and Cross-Cultural Communication. International Conference on Language and Language Teaching, 105–120.
Bates, M. (1995). Models of natural language understanding. Proceedings of the National Academy of Sciences, 92(22), 9977–9982.
Behar-Horenstein, L. S., & Niu, L. (2011). Teaching critical thinking skills in higher education: A review of the literature. Journal of College Teaching & Learning (TLC), 8(2).
Benadusi, L. (2001). Equity and Education: a critical review of sociological research and thought. In Pursuit of Equity in Education: Using International Indicators to Compare Equity Policies, 25–64.
Benuyenah, V. (2023). Commentary: ChatGPT use in higher education assessment: Prospects and epistemic threats. Journal of Research in Innovative Teaching and Learning, 16(1), 134–135. https://doi.org/10.1108/JRIT-03-2023-097
Biesenthal, C., & Wilden, R. (2014). Multi-level project governance: Trends and opportunities. International Journal of Project Management, 32(8), 1291–1308.
Biswas, S. (2023). Role of Chat GPT in Education. Available at SSRN 4369981.
Black, B. (2012). An overview of a programme of research to support the assessment of critical thinking. Thinking Skills and Creativity, 7(2), 122–133.
Brown, G. A., Bull, J., & Pendlebury, M. (2013). Assessing student learning in higher education. Routledge.
Brown, S., & Race, P. (2012). Using effective assessment to promote learning. In University teaching in focus (pp. 74–91). Routledge.
Cardon, P., Fleischmann, C., Aritz, J., Logemann, M., & Heidewald, J. (2023). The Challenges and Opportunities of AI-Assisted Writing: Developing AI Literacy for the AI Age. Business and Professional Communication Quarterly, 23294906231176516.
Casey, P. M., Goepfert, A. R., Espey, E. L., Hammoud, M. M., Kaczmarczyk, J. M., Katz, N. T., Neutens, J. J., Nuthalapaty, F. S., & Peskin, E. (2009). To the point: reviews in medical education—the Objective Structured Clinical Examination. American Journal of Obstetrics and Gynecology, 200(1), 25–34.
Cingillioglu, I. (2023). Detecting AI-generated essays: the ChatGPT challenge. The International Journal of Information and Learning Technology, 40(3), 259–268. https://doi.org/10.1108/IJILT-03-2023-0043
Cowls, J., Tsamados, A., Taddeo, M., & Floridi, L. (2021). The AI gambit: leveraging artificial intelligence to combat climate change—opportunities, challenges, and recommendations. Ai & Society, 1–25.
Currie, G. M. (2023). Academic integrity and artificial intelligence: is ChatGPT hype, hero or heresy? Seminars in Nuclear Medicine.
Dai, Y., Liu, A., Qin, J., Guo, Y., Jong, M. S., Chai, C., & Lin, Z. (2023). Collaborative construction of artificial intelligence curriculum in primary schools. Journal of Engineering Education, 112(1), 23–42.
Dan Milmo. (2023). ChatGPT reaches 100 million users two months after launch. The Guardian. https://www.theguardian.com/technology/2023/feb/02/chatgpt-100-million-users-open-ai-fastest-growing-app
Devi, K. V., Manjula, V., & Pattewar, T. (2023). ChatGPT: Comprehensive Study on Generative AI Tool. Academic Guru Publishing House.
Diwaker, C., Sharma, A., & Tomar, P. (2020). Artificial Intelligence in Higher Education and Learning. https://doi.org/10.4018/978-1-7998-4763-2.ch004
Dobrucali, E., Demirkesen, S., Sadikoglu, E., Zhang, C., & Damci, A. (2022). Investigating the impact of emerging technologies on construction safety performance. Engineering, Construction and Architectural Management, ahead-of-p(ahead-of-print). https://doi.org/10.1108/ECAM-07-2022-0668
Duan, Y., Edwards, J. S., & Dwivedi, Y. K. (2019). Artificial intelligence for decision making in the era of Big Data–evolution, challenges and research agenda. International Journal of Information Management, 48, 63–71.
Elbanna, S., & Armstrong, L. (2023). Exploring the integration of ChatGPT in education: adapting for the future. Management & Sustainability: An Arab Review, ahead-of-p(ahead-of-print). https://doi.org/10.1108/MSAR-03-2023-0016
Elkhatat, A. M., Elsaid, K., & Almeer, S. (2023). Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text. International Journal for Educational Integrity, 19(1), 17.
Elsbach, K. D., & Stigliani, I. (2018). Design thinking and organizational culture: A review and framework for future research. Journal of Management, 44(6), 2274–2306.
Facione, P. (1990). Critical thinking: A statement of expert consensus for purposes of educational assessment and instruction (The Delphi Report).
Fernandez, P. (2023). “Through the looking glass: envisioning new library technologies” AI-text generators as explained by ChatGPT. Library Hi Tech News, 40(3), 11–14. https://doi.org/10.1108/LHTN-02-2023-0017
Fiok, K., Farahani, F. V, Karwowski, W., & Ahram, T. (2022). Explainable artificial intelligence for education and training. The Journal of Defense Modelling and Simulation, 19(2), 133–144.
Gamble, A. (2020). Artificial intelligence and mobile apps for mental healthcare: a social informatics perspective. Aslib Journal of Information Management, 72(4), 509–523. https://doi.org/10.1108/AJIM-11-2019-0316
Gilbert, R., Keddie, A., Lingard, B., Mills, M., & Renshaw, P. (2011). Equity and education research, policy and practice: A review. Australian College of Educators National Conference, 201(1).
Gilbert, T. K., Lambert, N., Dean, S., Zick, T., Snoswell, A., & Mehta, S. (2023). Reward reports for reinforcement learning. Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84–130.
Gong, X., Wen, Y., Liu, X., Zhao, H., Jing, S., Li, Q., Gao, K., & Luo, J. (2021). Development of GPU Enhanced AI Education Platforms for K-12 Schools. 2021 China Automation Congress (CAC), 5662–5666. https://doi.org/10.1109/CAC53003.2021.9727283
Halaweh, M. (2023a). ChatGPT in education: Strategies for responsible implementation.
Halaweh, M. (2023b). ChatGPT in education: Strategies for responsible implementation.
Hannan, E., & Liu, S. (2023). AI: new source of competitiveness in higher education. Competitiveness Review: An International Business Journal, 33(2), 265–279. https://doi.org/10.1108/CR-03-2021-0045
Hashimoto, D. A., Ward, T. M., & Meireles, O. R. (2020). The role of artificial intelligence in surgery. Advances in Surgery, 54, 89–101.
Herrera, T. A., & Owens, D. T. (2001). The" new math"?: Two reform movements in mathematics education. Theory into Practice, 40(2), 84–92.
Herrington, J., Reeves, T. C., & Oliver, R. (2014). Authentic learning environments. Springer.
Hift, R. J. (2014). Should essays and other “open-ended”-type questions retain a place in written summative assessment in clinical medicine? BMC Medical Education, 14(1), 1–18.
Imran, M., & Almusharraf, N. (2023). Analysing the role of ChatGPT as a writing assistant at higher education level: A systematic review of the literature. Contemporary Educational Technology, 15(4), ep464.
Ivanov, S., & Soliman, M. (2023a). Game of algorithms: ChatGPT implications for the future of tourism education and research. Journal of Tourism Futures, 9(2), 214–221. https://doi.org/10.1108/JTF-02-2023-0038
Ivanov, S., & Soliman, M. (2023b). Game of algorithms: ChatGPT implications for the future of tourism education and research. Journal of Tourism Futures, 1–8. https://doi.org/10.1108/JTF-02-2023-0038
Jaap, A., Dewar, A., Duncan, C., Fairhurst, K., Hope, D., & Kluth, D. (2021). Effect of remote online exam delivery on student experience and performance in applied knowledge tests. BMC Medical Education, 21(1), 1–7.
Khan, F. H., Pasha, M. A., & Masud, S. (2021). Advancements in microprocessor architecture for ubiquitous AI—An overview on history, evolution, and upcoming challenges in AI implementation. Micromachines, 12(6), 665.
Kohnke, L., Moorhouse, B. L., & Zou, D. (2023). ChatGPT for language teaching and learning. RELC Journal, 00336882231162868.
Korzynski, P., Altmann, A., & Paliszkiewicz, J. (2023). Generative artificial intelligence as a new context for management theories: analysis of ChatGPT. https://doi.org/10.1108/CEMJ-02-2023-0091
Koubaa, A., Boulila, W., Ghouti, L., Alzahem, A., & Latif, S. (2023a). Exploring ChatGPT Capabilities and Limitations: A Survey. IEEE Access.
Koubaa, A., Boulila, W., Ghouti, L., Alzahem, A., & Latif, S. (2023b). Exploring ChatGPT Capabilities and Limitations: A Survey. IEEE Access, 11, 118698–118721. https://doi.org/10.1109/ACCESS.2023.3326474
Kublik, S., & Saboo, S. (2022). GPT-3. O’Reilly Media, Incorporated.
Kurfiss, J. G. (1988). Critical Thinking: Theory, Research, Practice, and Possibilities. ASHE-ERIC Higher Education Report No. 2, 1988. ERIC.
Lai, E. R. (2011). Critical thinking: A literature review. Pearson’s Research Reports, 6(1), 40–41.
Lameras, P., & Arnab, S. (2021). Power to the teachers: an exploratory review on artificial intelligence in education. Information, 13(1), 14.
Ling, T., & Nasri, N. M. (2019). A systematic review: Issues on equity in education. Creative Education, 10(12), 3163.
Liu, O. L., Frankel, L., & Roohr, K. C. (2014). Assessing critical thinking in higher education: Current state and directions for next‐generation assessment. ETS Research Report Series, 2014(1), 1–23.
López-Pastor, V., & Sicilia-Camacho, A. (2017). Formative and shared assessment in higher education. Lessons learned and challenges for the future. Assessment and Evaluation in Higher Education, 42(1), 77–97. https://doi.org/10.1080/02602938.2015.1083535
Miao, F., Holmes, W., Huang, R., & Zhang, H. (2021). AI and education: A guidance for policymakers. UNESCO Publishing.
Morris, J. W., & Patterson, E. (2015). Podcasting and its apps: Software, sound, and the interfaces of digital audio. Journal of Radio & Audio Media, 22(2), 220–230.
Ng, D. T. K., Lee, M., Tan, R. J. Y., Hu, X., Downie, J. S., & Chu, S. K. W. (2022). A review of AI teaching and learning from 2000 to 2020. Education and Information Technologies, 1–57.
Ng, D. T. K., Leung, J. K. L., Su, M. J., Yim, I. H. Y., Qiao, M. S., & Chu, S. K. W. (2022). AI Literacy Education in Primary Schools BT - AI Literacy in K-16 Classrooms (D. T. K. Ng, J. K. L. Leung, M. J. Su, I. H. Y. Yim, M. S. Qiao, & S. K. W. Chu, Eds.; pp. 75–86). Springer International Publishing. https://doi.org/10.1007/978-3-031-18880-0_6
Omar, N., Johari, Z. ‘Amirah, & Smith, M. (2017). Predicting fraudulent financial reporting using artificial neural network. Journal of Financial Crime, 24(2), 362–387. https://doi.org/10.1108/JFC-11-2015-0061
Opara, E., Mfon-Ette Theresa, A., & Aduke, T. C. (2023). ChatGPT for teaching, learning and research: Prospects and challenges. Opara Emmanuel Chinonso, Adalikwu Mfon-Ette Theresa, Tolorunleke Caroline Aduke (2023). ChatGPT for Teaching, Learning and Research: Prospects and Challenges. Glob Acad J Humanit Soc Sci, 5.
OpenAI. (2022). Introducing ChatGPT. OpenAI.Com. https://openai.com/blog/chatgpt
Pardamean, B., Suparyanto, T., Cenggoro, T. W., Sudigyo, D., & Anugrahana, A. (2022). AI-Based Learning Style Prediction in Online Learning for Primary Education. IEEE Access, 10, 35725–35735. https://doi.org/10.1109/ACCESS.2022.3160177
Pedro, F., Subosa, M., Rivas, A., & Valverde, P. (2019). Artificial intelligence in education: Challenges and opportunities for sustainable development.
Pereira, D., Flores, M. A., & Niklasson, L. (2016). Assessment revisited: a review of research in Assessment and Evaluation in Higher Education. Assessment and Evaluation in Higher Education, 41(7), 1008–1032. https://doi.org/10.1080/02602938.2015.1055233
Pereira, D. S. M., Falcão, F., Costa, L., Lunn, B. S., Pêgo, J. M., & Costa, P. (2023). Here’s to the future: Conversational agents in higher education-a scoping review. International Journal of Educational Research, 122, 102233.
Pham, S. T. H., & Sampson, P. M. (2022). The development of artificial intelligence in education: A review in context. Journal of Computer Assisted Learning, 38(5), 1408–1421.
Pietrzak, P., Szczęsny, S., Huderek, D., & Przyborowski, Ł. (2023). Overview of Spiking Neural Network Learning Approaches and Their Computational Complexities. Sensors, 23(6), 3037.
Qasem, F., Ghaleb, M., Mahdi, H. S., Al Khateeb, A., & Al Fadda, H. (2023). Dialog chatbot as an interactive online tool in enhancing ESP vocabulary learning. Saudi Journal of Language Studies, 3(2), 76–86. https://doi.org/10.1108/SJLS-10-2022-0072
Qiu, R. (2023). Editorial: GPT revolutionizing AI applications: empowering future digital transformation. Digital Transformation and Society, 2(2), 101–103. https://doi.org/10.1108/DTS-05-2023-066
Rahman, M. M., & Watanobe, Y. (2023). ChatGPT for education and research: Opportunities, threats, and strategies. Applied Sciences, 13(9), 5783.
Rane, N. L., Choudhary, S. P., Tawde, A., & Rane, J. (2023). ChatGPT is not capable of serving as an author: ethical concerns and challenges of large language models in education. International Research Journal of Modernization in Engineering Technology and Science, 5(10), 851–874.
Ranney, S. (2012). Defining and teaching academic language: Developments in K‐12 ESL. Language and Linguistics Compass, 6(9), 560–574.
Rath, S., Sahu, B. K., & Nayak, M. R. (2019). Application of quasi-oppositional symbiotic organisms’ search based extreme learning machine for stock market prediction. International Journal of Intelligent Computing and Cybernetics, 12(2), 175–193. https://doi.org/10.1108/IJICC-10-2018-0145
Ritala, P., Ruokonen, M., & Ramaul, L. (2023). Transforming boundaries: how does ChatGPT change knowledge work? Journal of Business Strategy, ahead-of-p(ahead-of-print). https://doi.org/10.1108/JBS-05-2023-0094
RN, E. S., & RN, M. C. (2002). Critical thinking in nursing education: Literature review. International Journal of Nursing Practice, 8(2), 89–98.
Roschelle, J., Lester, J., & Fusco, J. (2020). AI and the Future of Learning: Expert Panel Report. Digital Promise.
Roumeliotis, K. I., & Tselikas, N. D. (2023). ChatGPT and Open-AI Models: A Preliminary Review. Future Internet, 15(6), 192.
Rudolph, J., Tan, S., & Tan, S. (2023). ChatGPT: Bullshit spewer or the end of traditional assessments in higher education? Journal of Applied Learning & Teaching, 6(1). https://doi.org/10.37074/jalt.2023.6.1.9
Russell, S. J. (2010). Artificial intelligence a modern approach. Pearson Education, Inc.
Slavich, G. M., & Zimbardo, P. G. (2012). Transformational teaching: Theoretical underpinnings, basic principles, and core methods. Educational Psychology Review, 24, 569–608.
Smith, P. E. M., & Mucklow, J. C. (2016). Writing clinical scenarios for clinical science questions. Clinical Medicine, 16(2), 142.
Sobkowiak, P. (2016). Critical thinking in the intercultural context: Investigating EFL textbooks. Studies in Second Language Learning and Teaching, 6(4), 697–716.
Sok, S., & Heng, K. (2023). ChatGPT for education and research: A review of benefits and risks. Available at SSRN 4378735.
Su, J., Ng, D. T. K., & Chu, S. K. W. (2023). Artificial Intelligence (AI) Literacy in Early Childhood Education: The Challenges and Opportunities. Computers and Education: Artificial Intelligence, 4, 100124. https://doi.org/https://doi.org/10.1016/j.caeai.2023.100124
Swan, K., Shen, J., & Hiltz, S. R. (2006). Assessment and collaboration in online learning. Journal of Asynchronous Learning Networks, 10(1), 45–62.
Talan, T., & Kalinkara, Y. (2023). The role of artificial intelligence in higher education: ChatGPT assessment for anatomy course. Uluslararası Yönetim Bilişim Sistemleri ve Bilgisayar Bilimleri Dergisi, 7(1), 33–40.
Wals, A. E. J., & Jickling, B. (2002). “Sustainability” in higher education: From doublethink and newspeak to critical thinking and meaningful learning. International Journal of Sustainability in Higher Education, 3(3), 221–232.
Wolcott, W., & Legg, S. M. (1998). An Overview of Writing Assessment: Theory, Research, and Practice. ERIC.
Xia, Q., Chiu, T. K. F., Zhou, X., Chai, C. S., & Cheng, M. (2022). Systematic literature review on opportunities, challenges, and future research recommendations of artificial intelligence in education. Computers and Education: Artificial Intelligence, 100118.
Yeo, M. A. (2023). Academic integrity in the age of Artificial Intelligence (AI) authoring apps. TESOL Journal, e716.
Zhao, J., Liang, B., & Chen, Q. (2018). The key technology toward the self-driving car. International Journal of Intelligent Unmanned Systems, 6(1), 2–20. https://doi.org/10.1108/IJIUS-08-2017-0008

No competing interests reported.

Download PDF

Editorial decision: Revision requested
23 Aug, 2024
Reviews received at journal
17 Aug, 2024
Reviewers agreed at journal
17 Aug, 2024
Reviews received at journal
24 Jul, 2024
Reviewers agreed at journal
24 Jul, 2024
Reviewers invited by journal
02 Jul, 2024
Editor assigned by journal
19 Jun, 2024
Submission checks completed at journal
18 Jun, 2024
First submitted to journal
08 Jun, 2024

You are reading this latest preprint version

Leveraging on Chatgpt, an Artificial Intelligence (AI) Tool to Transform Examination Writing in Higher Education

Status:

Version 1

Abstract

Figures

INTRODUCTION

Background of the study

METHOD

Examination prompt

RESULTS

The goal of this test

The goal of this test

The goal of this test

The goal of this test

The goal of this test

DISCUSSION AND CONCLUSION

Results discussion

Declarations

References

Additional Declarations

Status:

Version 1