The performance of ChatGPT in day surgery and pre-anesthesia risk assessment: a case-control study across on 150 simulated patient presentations

doi:10.21203/rs.3.rs-4343329/v1

Download PDF

Research Article

The performance of ChatGPT in day surgery and pre-anesthesia risk assessment: a case-control study across on 150 simulated patient presentations

https://doi.org/10.21203/rs.3.rs-4343329/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Background

Day surgery has developed rapidly in China in recent years, although it still faces the shortage of anesthesiologists to handle pre-anesthesia routine for day surgery. We hypothesized that Chat Generative Pre-trained Transformer (ChatGPT) may assist anesthesiologists in preoperative assessment and answer questions on the patients' concerns. The aim of this study was to examine the ability of ChatGPT in pre-operative risk assessment and its accuracy to answer questions regarding knowledge and management for day surgery anesthesia.

Methods

150 patient profiles were generated to simulate day surgery patient presentations, with complications of varying acuity and severity. Both ChatGPT group and expert group has to evaluate 150 patients for their ASA classification and whether day surgery was recommended. Then ChatGPT was asked to answer 145 questions about day surgery anesthesia that represented the most common issues encountered in clinical practice. The performance of ChatGPT was assessed and graded independently by two experienced anesthesiologists.

Results

A total of 150 patients for assessment were included in the study (74 males [49.5%] and 76 females [50.7%]). There was no difference between ChatGPT group and the expert group for ASA classification and assessment of anesthesia risk in the patients (P > 0.05). When comes to day surgery recommendation, for patients with certain comorbidities (ASA ≥ II), the expert group was inclined to need further examination or treatment, the proportion of conclusions made by ChatGPT was smaller (ChatGPT n (%) vs. expert n (%): day surgery can be performed, 67 (47.9) vs. 31(25.4); need further treatment and evaluation, 56 (37.3) vs. 66 (44.0); day surgery is not recommended, 18 (12.9) vs. 29 (9.3), P < 0.05). We showed that ChatGPT had extensive knowledge related to day surgery anesthesia (94.0% correct), with most of the points (70%) considered comprehensive. The performance was better in the domains of peri-anesthesia concerns, lifestyle, and emotional support.

Conclusions

ChatGPT has certain reference value for ASA classification and assessment of anesthesia risk in patients. ChatGPT can also be trusted to answer questions and concerns related to pre-anesthesia and therefore has the potential to provide important assistance in clinical work.

ChatGPT

pre-anesthesia

risk assessment

ASA classification

peri-anesthesia concerns

Day case-surgery involves management of a patient who is scheduled to be admitted and discharged on the same day(Bailey et al.2019). This process involves admitting patients for investigation or an operation on a planned, nonresidential basis, with provision of adequate facilities for recovery in a ward or separate unit(Goodwin and Ogg et al.1992). In the past, patients were usually admitted to hospital for surgical operations and remained until they were self-sufficient, ambulant, and had their sutures removed. The necessity for this policy is due to the unavailability of comprehensive health care in the community, suboptimal home conditions to cater for patients, and to decrease incomplete wound healing and the high anesthetic and surgical complication rates(Ojo et al.2010). The benefits of early mobilization after an operation are well appreciated, while minimally invasive surgery is well established, resulting in more procedures being performed as day surgery(et al.2011).

When considering the characteristics of daytime surgery, the selection of surgical methods should take several factors into account(et al.2011). Firstly, the procedure has a low risk of serious postoperative complications. Secondly, the postoperative symptoms are capable of being treated by oral medications or local anesthetic techniques. Thirdly, patients should be able to move freely before discharge. Day surgery allows for a quicker recovery with less disruption to the patients and also reduces the risk of hospital-acquired infections.

Fitness for a procedure should relate to the patient’s functional status determined at a pre-anesthetic assessment. Assessment of preoperative anesthesia risk should necessarily be conducted before surgery to exclude medium- to high-risk patients who are not suitable for daytime surgery in either outpatient clinics, emergency departments, or primary care. Of these evaluations, the ASA (American Society of Anesthesiologists) physical classification is the most basic and effective(Voney et al.2007). The ASA classification is a suitable index for assessing the physical status of surgical patients and predicting adverse events during surgical anesthesia, such as the duration of stay in the operating room, duration of anesthesia, duration of surgery, and blood loss and infections, with these factors resulting in higher ASA scores(Ansell and Montgomery et al.2004). As the number of patients presenting for day-case surgery and anesthesia increases, so does the challenge of adequate pre-operative assessment. China has a large population and therefore requires a large number of surgeries, although this is compromised by a lack of anesthesiologists.

The ChatGPT (Chat Generative Pre-trained Transformer) was developed in recent years and should be considered to help doctors relieve the pressure of pre-operative assessments. ChatGPT is an advanced NLP (Natural Language Processing) model pioneered by OpenAI(Artificial intelligence)(Ali et al.2023). This model exhibits a human-like capacity for generating text, especially in the domain of chatbot dialogues. Many medical professionals are already investigating the possibility of using ChatGPT in clinical work(Grünebaum et al.2023; Lahat et al.2023). However, to date there have been no extensive studies that have evaluated the effectiveness of ChatGPT to answer questions accurately and holistically about day-case surgery based on pre-operative assessments.

In our medical center, the type of day surgery is usually simple and singular, but requires a higher investment in labor costs for patients with more complications. The objective of our research was to use ChatGPT to evaluate patient's physical condition before surgery and test the reliability and accuracy of ChatGPT for preoperative evaluation of anesthesia. The precision, comprehensiveness, and consistency of ChatGPT’s answers to common queries regarding treatment and patient care before anesthesia were also evaluated.

The study was conducted using the ChatGPT Version 4.0, which model was trained by OpenAI. ChatGPT was questioned on the patient’s medical history, physical examination, current vital signs, and the results of diagnostic tests. A panel of experienced anesthesiologists evaluated the correctness and accuracy of ChatGPT’s responses, with the panel consisting of three anesthesiologists, all of whom had over 10 years of experience.

ChatGPT's ability for pre-operative risk assessment

Hypothetical and standardized patient profiles were generated in order to simulate day surgery patient presentations for each of the defined complications. The checklist components is based on national and international guidelines for pre-operative risk assessment of adult patients presented by Vogelsang and colleagues in 2020(Vogelsang et al.2020). The score contains current guidelines and the assessment of vital signs(Vogelsang et al.2020; et al.2010; Kristensen et al.2014). Their contents are expressed as 234 single parameters, with these items sorted by organ systems and following the A (airway), B (breathing), C (circulation), D (disability) and E (endocrinology) scheme(Vogelsang et al.2020). Patient presentations were generated for the same complication to assess the tool’s ability to differentiate presentations of the same complication across varying acuities and severity. Overall, a total of 150 patient profiles and presentations were generated for the purpose of this study.

Based on the patient’s information(such as history, symptoms, and test results) we provided, ChatGPT drew conclusions about the patient’s ASA classification(Voney et al.2007) (Table 1)and also determined whether they could undergo the day surgery procedure. For the expert group, two anesthesiologists were invited to analyze the ASA classification of these patients and conclude whether day surgery could be performed, with a third experienced expert presiding over controversial results.

Patients who are too complex for ChatGPT to analyze or who could not be treated in the day surgery unit were excluded: 1) patients with difficult airways caused by anatomical problems in the upper respiratory tract, 2) comatose patients, and 3) patients with heart murmurs. The following are the reasons why these three types of factors were excluded:

(1) Due to the invention of video laryngoscopy, common difficult airways (i.e., small mandibles or short spacing) are no longer challenging problems in airway management. However, surgery and tumor space-occupying lesions remain difficult to treat. Since the identification of difficult airways in patients requires professional anesthesiologists, it was not conducive to providing clinical treatment for the problems described in this experiment. (2) Patients with a high Glasgow score, or coma were not admitted to the day surgery clinic. (3) Heart murmurs are more subjective and could not be evaluated scientifically, so were assessed by more objective echocardiography.

Table 1

ASA classification: physical status of the patients as classified by the American Society of Anesthesiologists.
ASA class	Patient’s physical status
I	Normal and healthy patient.
II	Patient with mild systemic disease without a limitation of activity.
III	Patient with severe systemic disease that limits activity but is not incapacitating.
IV	Patient with systemic disease that is incapacitating and a constant threat to life.
V	Moribund patient; survival for more than 24h not expected, with or without an operation.
E is appended to any of the classifications if the patient requires an emergency operation.

Performance of ChatGPT for answering questions regarding anesthesia

ChatGPT was asked to answer relevant questions. Each question was entered as a separate, independent prompt using the “New Chat” function. There were 145 questions about day surgery anesthesia which are the most commonly encountered conditions in clinical practice. Of these questions, 11 similar or duplicate questions were deleted, while 134 questions were answered by ChatGPT. Two questions with no clear answer and one question unrelated to the topic were also excluded, leaving a total of 131 questions in the final analysis (Fig. 1).

Frequently asked questions about knowledge or concerns related to peri-anesthesia were collected from questions on networks and from professional associations and institutions.

The responses of ChatGPT to the 145 questions were graded independently by two anesthesiologists and if necessary resolved by a third reviewer. The performance of ChatGPT was graded and divided into four types: 1) Comprehensive, 2) Correct but inadequate, 3) Mixed with correct and incorrect/outdated data, and 4) Completely incorrect. Discrepancies in grading between the two reviewers were reviewed independently and resolved by a blinded third anesthesiologist who had considerable experience in clinic work.

Statistical analysis

The data collected were analyzed using standard statistical methods. All the calculations were performed using IBM SPSS Statistical Package version 28. Descriptive statistics were calculated to describe the baseline characteristics of the simulated patients. Categorical variables were expressed as frequencies and percentages.

The differences in ASA assessment results between the ChatGPT group and the expert group were analyzed using the Mann-Whitney U test. The results of the conclusion of recommendations for day surgery between the two groups were analyzed by the chi-square test. All patients who were labelled ASA ≥ II during the initial risk assessment were later re-evaluated, with differences in the recommendations between the two groups analyzed by the chi-square test. A P value < 0.05 between the two groups was considered to indicate a statistically significant difference.

The proportions of each aforementioned grading for responses of each pre-anesthesia domain were calculated and reported as percentages.

Assessments of the 150 simulated patients (74 males [49.3%] and 76 females [50.7%]) were performed during the study. As shown in Table 2, the characteristics of the patients consisted of sex, age, BMI and comorbidities (e.g., airway-related diseases, pulmonary inhalation risk, respiratory diseases, circulation-related diseases, neurological disorders, endocrine/blood system diseases, allergic history, and operation history) (Table 2).

Table 2

Characteristics of the simulated patients
Demographics	N (%)
Sex, M/F	74/76 (49.3/50.7)
Age, years	57.77 ± 21.56
10 to 29	18 (12.0)
30 to 39	19 (12.7)
40 to 49	18 (12.0)
50 to 59	21 (14.0)
60 to 69	25 (16.7)
70 to 79	31 (20.7)
>80	18 (12.0)
BMI, kg/m²	23.12 ± 3.58
<18.5	13 (8.7)
18.5 to 23.9	77 (51.3)
24 to 27.9	44 (29.3)
>28	16 (10.7)
Comorbidities	N (%)
Airway-related diseases	23 (15.3)
Pulmonary inhalation risk	39 (26.0)
Respiratory diseases	28 (18.7)
Circulation-related diseases	35 (23.3)
Neurological disorders	21 (14.0)
Endocrine/Blood system diseases	28 (18.7)
Allergic history	7 (4.7)
Operation history	39 (26.0)

ChatGPT had certain reference value for ASA grading assessment and anesthesia risk assessment of the simulated patients. There were no significant differences between the GPT and expert responses in the majority of cases (P = 0.064, > 0.05) (Table 3). However, there were some differences between ChatGPT and the experts for the conclusion as to whether day surgery could be performed using comprehensive consideration of patients' conditions (ASA ≥ II). For patients with certain comorbidities (ASA ≥ II), the expert group was more inclined to evaluate whether the patient was suitable for day surgery after further examination or treatment, or consider that the patient's current physical condition was not suitable for day surgery. In contrast, the proportion of conclusions made by ChatGPT was smaller (ChatGPT n (%) vs. expert n (%): day surgery can be performed, 67 (47.9) vs. 31(25.4); need further treatment and evaluation, 56 (37.3) vs. 66 (44.0); day surgery is not recommended, 18 (12.9) vs. 29 (9.3), P = 0.001,<0.05) (Table 4).

Table 3

Reliability of ChatGPT preoperative assessment
ASA	ChatGPT n (%)	Expert n (%)
I	10 (6.7)	28 (18.7)	Mann-Whitney U test/P 12548.00/ P = 0.064, > 0.05
II	66 (44.0)	58 (38.7)
III	63 (42.0)	52 (34.7)
IV	10 (6.7)	12 (8.0)
V	1 (0.7)	0 (0.0)
Conclusion
Day surgery can be performed	76 (50.7)	55 (36.7)	χ²/P 6.761/ P = 0.034, < 0.05
Need further treatment and evaluation	56 (37.3)	66 (44.0)
Day surgery is not recommended	18 (12.0)	29 (9.3)

Table 4

Comparison of assessment results for patients with comorbidities
	ChatGPT n (%)	Expert n (%)
ASA ≥ 2	140 (93.3)	122 (81.3)
Conclusion
Day surgery can be performed	67 (47.9)	31 (25.4)	χ²/P 15.052/ P = 0.001, < 0.05
Need further treatment and evaluation	55 (39.3)	62 (50.8)
Day surgery is not recommended	18 (12.9)	29 (23.8)

ChatGPT answered 131 questions that were then evaluated by three anesthesiologists. For the answers of ChatGPT on basic knowledge related to anesthesia, the anesthesiologists considered that 70% were comprehensive, 24% were correct but inadequate, and 6% were mixed with correct and incorrect/outdated data. For peri-anesthesia concerns, 95.3% were comprehensive and 4.7% were correct but inadequate. Questioners obtained 100% emotional support from ChatGPT. Although there was no standard answer to questions on emotional support, it was possible to judge ChatGPT's potential in this area. In terms of answers to lifestyle and other questions, ChatGPT was 91.7% comprehensive. Thankfully, none of the problems eventuated and the evaluations turned out to be completely incorrect. Taken together, these results indicated that although the evaluation of basic knowledge was relatively low, ChatGPT still had a certain reference value for solving problems related to anesthesia (Fig. 2).

The percentage of responses being graded as comprehensive, correct but inadequate, mixed with correct and incorrect/outdated data, and completely incorrect are provided.

It is well known that China has a large population, although the country is still facing considerable medical pressure due to a lack of anesthesiologists. At present, the number of day surgery procedures is increasing and therefore preoperative evaluation is a very important part of anesthesia(Ojo et al.2010). At present, ChatGPT's exploration in the medical field is still focused mainly on medical education and scientific writing, and there is relatively little use of it in clinical and research scenarios(Shay et al.2023; Kung et al.2023). One of the key benefits of ChatGPT is its ability to provide instant, accurate, and personalized responses to a wide range of questions related to health care(Liu et al.2023; Cascella et al.2023; Odom-Forren et al.2023). A study by Gupta(Gupta et al.2024) searched the database to determine how it could be helpful to anesthesia providers, including pre-operative management, ICU management, pain management, and palliative care, and also to provide additional assistance to anesthesiologists (e.g., education, quality assurance, and research).

ASA is an important index for preoperative evaluation of both anesthesia and surgical risk, and has been used widely and recognized in the world(Riley et al.2014; Mayhew et al.2019). Lim(Lim et al.2023) suggested that GPT is able to classify ASA-PS consistently and correctly in multiple simulated patient scripts with appropriate justification, and has similar performance to that of human anesthesiologists in the majority of cases. In our study, ChatGPT has a certain reference value in the ASA classification of patients' physical condition, with its ASA classifications being basically similar to those of the expert group.

This study used ChatGPT extracts to learn information on the patient's medical history, examination results, type of surgery, and method of anesthesia. Using this information, appropriate risk assessment indicators can be obtained, thereby saving a certain amount of energy and time for surgeons and anesthesiologists, making it a very efficient preoperative evaluation method. As far as we are aware, this is the first study to assess the ability of ChatGPT to make ASA grading and preoperative evaluation of patients that would have some clinical value.

ChatGPT and the experts had different opinions on whether patients could enter the day surgery procedure. ChatGPT mostly recommended patients for day surgery after assessing their physical condition, surgical method, and anesthesia risk. For patients with an ASA ≥ 2, the panel preferred to recommend further examination and treatment before considering the suitability for day surgery. Even for more seriously ill patients, the panel recommended canceling day surgery at a higher rate.

Do these results mean that ChatGPT is more aggressive in considering the risks of anesthesia surgery, while the panel is more conservative? The reasons for this difference analyzed by our team may be as follows: 1. Possibly related to the working habits of each expert group, with some groups having a stricter level for indication of day surgery; 2. ChatGPT analyzes the patients’ objective indicators and the conclusion is made after synthesizing all these indicators, thereby defining the reference value; 3. The medical staff cannot consider which side is right or wrong, with the actual decision based on either the surgeon or anesthesiologist's understanding of the guidelines or the patient's condition.

Although our study showed the benefits of ChatGPT as a tool, there remain problems and limitations that need to be considered. Firstly, the correctness and validity of the content must be considered, as incorrect content may mislead the patient. While ChatGPT can provide a lot of information and assistance, at the moment it cannot completely replace human healthcare workers in all situations. Compared to search engines, it is not possible to find the source of ChatGPT's information. Taken together, these findings indicate that ChatGPT sometimes answers questions incorrectly, with the information it uses currently not updated since 2021. In addition, ChatGPT cannot access the Internet in real time. To overcome these limitations, manual auditing can be used to screen the content generated, and allow its accuracy to be judged (Lee and Choi et al.2023).

Secondly, attention should be paid to ethical and privacy issues. During the process of communicating with ChatGPT, patients provide their personal basic information and medical conditions, sometimes including pictures of their private parts. Although ChatGPT claims that it does not save conversations with users, it needs to be understood that sensitive health information may be damaged or abused during transmission and browsing. It is therefore necessary to implement sound data protection measures, including encryption of sensitive information and secure data transmission. In addition, because ChatGPT provides extremely rich emotional value, it is necessary to be careful that anxious patients do not become psychologically dependent on this "friend". Strict ethical and privacy regulations therefore need to be established to limit the scale of information input and emotional output of ChatGPT.

Above all, ChatGPT has the potential to revolutionize the way programs are evaluated for patients by providing accurate and effective clinical help. With any new technology, there are shortcomings that need to be addressed, although the potential benefits of ChatGPT in the field of ambulatory surgical evaluation are enormous. Development is the name of the day, and healthcare workers need to keep up with this trend and explore this promising area of technology. In this regard, it has been proposed that LLM represented by ChatGPT has the potential to add a new dimension to solving clinical problems.

It is also important to realize that ChatGPT is just a machine and cannot replace the humanity and compassion that are so essential to our profession(Odom-Forren et al.2023). As we continue to explore the possibilities of AI in healthcare, it is important to embrace these new technologies and use them to augment, rather than replace, important clinical work(Odom-Forren et al.2023).

ChatGPT

Chat Generative Pre-trained Transformer

ASA

American Society of Anesthesiologists

NLP

Natural Language Processing

Artificial intelligence

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Competing interests

The authors declare that they have no competing interests.

Funding Source:

All phases of this study were supported by Shanghai Municipal Jiading District New Key Subject Program, (2020-jdyxzdxk-03); Shanghai Jiading District Health Commission Traditional Chinese Medicine Project (Youth) (2022-QN-ZYY-03).

Author Contribution

YL and TTC helped design the study, conduct the study, collect data, analyze the data, and prepare the manuscript. JQG helped design the study, analyze the data, and prepare the manuscript. YBH helped collect data and prepare the manuscript. GBHand PPZ helped collect data and prepare the manuscript. SYL and HX helped design the study, conduct the study, collect data, analyze the data, and prepare the manuscript. YB and XJW attests to the integrity of the data, approved the final manuscript, and is the archival author. All authors read and approved the final manuscript.

Acknowledgement

The authors would like to express their gratitude to EditSprings (https://www.editsprings.cn ) for the expert linguistic services provided.

Availability of data and materials

The datasets during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Bailey CR, Ahuja M, Bartholomew K, et al. Guidelines for day-case surgery 2019: Guidelines from the Association of Anaesthetists and the British Association of Day Surgery. Anaesthesia. 2019;74:778-92.https://doi.org/10.1111/anae.14639.
Goodwin AP, Ogg TW. Preoperative preparation for day surgery. Br J Hosp Med. 1992;47:197-201,
Ojo EO. Day case surgery and developing countries: a review. Niger J Clin Pract. 2010;13:459-66,
Day case and short stay surgery: 2. Anaesthesia. 2011;66:417-34.https://doi.org/10.1111/j.1365-2044.2011.06651.x.
Voney G, Biro P, Roos M, et al. Interrelation of peri-operative morbidity and ASA class assignment in patients undergoing gynaecological surgery. Eur J Obstet Gynecol Reprod Biol. 2007;132:220-5.https://doi.org/10.1016/j.ejogrb.2006.04.028.
Ansell GL, Montgomery JE. Outcome of ASA III patients undergoing day case surgery. Br J Anaesth. 2004;92:71-4.https://doi.org/10.1093/bja/aeh012.
Ali H, Qadir J, Alam T, et al. Revolutionizing Healthcare with Foundation AI Models. Stud Health Technol Inform. 2023;305:469-70.https://doi.org/10.3233/shti230533.
Grünebaum A, Chervenak J, Pollet SL, et al. The exciting potential for ChatGPT in obstetrics and gynecology. Am J Obstet Gynecol. 2023;228:696-705.https://doi.org/10.1016/j.ajog.2023.03.009.
Lahat A, Shachar E, Avidan B, et al. Evaluating the use of large language model in identifying top research questions in gastroenterology. Sci Rep. 2023;13:4164.https://doi.org/10.1038/s41598-023-31412-2.
Vogelsang H, Herzog-Niescery J, Botteck NM, et al. Improvement in pre-operative risk assessment in adults undergoing noncardiac surgery by a process-oriented score: A prospective single-centre study. Eur J Anaesthesiol. 2020;37:629-35.https://doi.org/10.1097/eja.0000000000001190.
[Preoperative evaluation of adult patients prior to elective, non-cardiac surgery: joint recommendations of German Society of Anesthesiology and Intensive Care Medicine, German Society of Surgery and German Society of Internal Medicine]. Anaesthesist. 2010;59:1041-50.https://doi.org/10.1007/s00101-010-1793-8.
Kristensen SD, Knuuti J, Saraste A, et al. 2014 ESC/ESA Guidelines on non-cardiac surgery: cardiovascular assessment and management: The Joint Task Force on non-cardiac surgery: cardiovascular assessment and management of the European Society of Cardiology (ESC) and the European Society of Anaesthesiology (ESA). Eur Heart J. 2014;35:2383-431.https://doi.org/10.1093/eurheartj/ehu282.
Shay D, Kumar B, Bellamy D, et al. Assessment of ChatGPT success with specialty medical knowledge using anaesthesiology board examination practice questions. Br J Anaesth. 2023;131:e31-e34.https://doi.org/10.1016/j.bja.2023.04.017.
Kung TH, Cheatham M, Medenilla A, et al. Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models. PLOS Digit Health. 2023;2:e0000198.https://doi.org/10.1371/journal.pdig.0000198.
Liu J, Wang C, Liu S. Utility of ChatGPT in Clinical Practice. J Med Internet Res. 2023;25:e48568.https://doi.org/10.2196/48568.
Cascella M, Montomoli J, Bellini V, et al. Evaluating the Feasibility of ChatGPT in Healthcare: An Analysis of Multiple Clinical and Research Scenarios. J Med Syst. 2023;47:33.https://doi.org/10.1007/s10916-023-01925-4.
Odom-Forren J. The Role of ChatGPT in Perianesthesia Nursing. J Perianesth Nurs. 2023;38:176-77.https://doi.org/10.1016/j.jopan.2023.02.006.
Gupta B, Ahluwalia P, Gupta A, et al. ChatGPT in anesthesiology practice - A friend or a foe. Saudi J Anaesth. 2024;18:150-53.https://doi.org/10.4103/sja.sja_336_23.
Riley R, Holman C, Fletcher D. Inter-rater reliability of the ASA physical status classification in a sample of anaesthetists in Western Australia. Anaesth Intensive Care. 2014;42:614-8.https://doi.org/10.1177/0310057x1404200511.
Mayhew D, Mendonca V, Murthy BVS. A review of ASA physical status - historical perspectives and modern developments. Anaesthesia. 2019;74:373-79.https://doi.org/10.1111/anae.14569.
Lim DYZ, Ke YH, Sng GGR, et al. Large language models in anaesthesiology: use of ChatGPT for American Society of Anesthesiologists physical status classification. Br J Anaesth. 2023;131:e73-e75.https://doi.org/10.1016/j.bja.2023.06.052.
Lee SW, Choi WJ. Utilizing ChatGPT in clinical research related to anesthesiology: a comprehensive review of opportunities and limitations. Anesth Pain Med (Seoul). 2023;18:244-51.https://doi.org/10.17085/apm.23056.

No competing interests reported.

Download PDF

Editorial decision: Revision requested
25 May, 2024
Reviews received at journal
24 May, 2024
Reviewers agreed at journal
13 May, 2024
Reviews received at journal
11 May, 2024
Reviewers agreed at journal
07 May, 2024
Reviewers invited by journal
07 May, 2024
Submission checks completed at journal
06 May, 2024
Editor assigned by journal
06 May, 2024
First submitted to journal
29 Apr, 2024

You are reading this latest preprint version

The performance of ChatGPT in day surgery and pre-anesthesia risk assessment: a case-control study across on 150 simulated patient presentations

Status:

Version 1

Abstract

Background

Methods

Results

Conclusions

Figures

Introduction

Methods

Statistical analysis

Results

Discussion

Conclusions

Abbreviations

Declarations

Competing interests

Funding Source:

Author Contribution

Acknowledgement

Availability of data and materials

References

Additional Declarations

Status:

Version 1