Exploring the Prospects of LLMs in Hospital Management: A Perspective on Medical data inquiry

doi:10.21203/rs.3.rs-3990012/v1

Download PDF

Research Article

Exploring the Prospects of LLMs in Hospital Management: A Perspective on Medical data inquiry

https://doi.org/10.21203/rs.3.rs-3990012/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

In light of the rapid expansion of hospital operations and the increasing digitization of medical data, there is a pressing need for efficient and intelligent methods to process and analyze large-scale medical data.

Methods

To tackle these challenges, the study integrates the QLoRA algorithm with ChatGLM2-6b and Llama2-6b models. These models undergo fine-tuning on a local SQL dataset, with a specific emphasis on optimizing performance, especially for simpler queries. Subsequently, we employ Prompt-Engineering with ChatGPT-3.5, enabling us to effectively leverage its capabilities and tailor its outputs to execute SQL queries.

Results

The comprehensive big data platform illustrates the evolution of inpatient operations, encompassing diverse information such as patient diagnoses, surgeries, medications, and examinations across various healthcare domains. The integration of the QLoRA algorithm with ChatGLM2-6b and Llama2-6b models, combined with fine-tuning on a local SQL dataset, enhances the model's performance on simple and moderately difficult SQL queries. Notably, after minimal training, the ChatGPT3.5 model closely approximates the results of human engineers in terms of SQL query performance, achieving an accuracy of approximately 90%.

Conclusion

The strategic utilization of Large Language Models (LLMs) and Natural Language to SQL (NL2SQL) generation enhances the efficiency of medical data analysis. This approach provides a robust foundation for decision-making in hospital management amid the evolving landscape of healthcare operations and data queries.

In the context of the swift expansion of hospital operations, widespread digitization, and the escalating volume of medical data, the pivotal role of information support by different analytical systems in hospital management becomes increasingly conspicuous [1, 2]. To meet the growing demands of hospital management for decision-making and assessment, various management indicators must undergo statistical analysis [3], offering crucial data support for hospital decision-makers. However, the data involved in this process is scattered across diverse hospital information systems [4]. Manually configuring and maintaining data statistical codes, primarily SQL queries, not only involves a significant workload but also fails to promptly meet management requirements, thereby leading to a susceptibility to errors. There is an urgent need for efficient and intelligent methods to process and analyze large-scale medical data [5]. In the current information technology environment, leveraging large language models (LLMs) [6] on data integration platforms [7] and employing AI to unlock data potential emerges as an effective approach to address this challenge. By establishing a data platform to consolidate dispersed data "islands" into a centrally managed data center and implementing robust data governance [8], a standardized and unified data service environment can be achieved.

The capabilities of LLMs, such as Natural Language to SQL (NL2SQL) [9], enable the conversion of user queries or requirements expressed in natural language into database query language (SQL), meeting the needs of management and providing timely responses for hospital management. The application of this technology not only enhances the efficiency of data processing but also reduces the manual handling workload, allowing hospital informatization efforts to focus more on in-depth analysis and interpretation of data results [10]. Combining the data platform with large models makes medical information processing more flexible, better adapting to the diverse needs of hospital management. This intelligent data processing approach holds promise for providing more accurate and reliable support for hospital decision-making, accelerating the analysis, and understanding of medical data for management department. Considering the complexity of clinical operations, this study concentrates on data analysis related to inpatient medical treatment processes, establishing an inpatient database view. Addressing indicators such as medical expense management, medical service capabilities, and medical quality. Building upon the database view, we aim to fine-tune open-source LLMs such as ChatGlm2-6b [11] and Llama2-6b [12] using QLoRA [13] technology to generate NL2SQL models. Simultaneously, we are exploring the implementation of intelligent data queries by combining Prompt-Engineering [14] with ChatGPT3.5 [15]. This provides hospital management with more robust decision-making tools, further advancing the development of medical informatization.

2.1 Establishment of Data Domain in the Field of Hospitalization

To optimize hospital management efficiency and elevate the quality of medical services, it is crucial to establish a robust database view within the inpatient treatment process. Leveraging a comprehensive big data platform [8] and employing Extract, Transform, Load (ETL) tools [16], the synchronization and integration of crucial data, including patient medical records, diagnostic and procedural cataloging, transfer records, and essential information extracted from the Hospital Information System (HIS), encompassing patient demographics and prescription details, are effectively accomplished. The establishment of inpatient database views, incorporating MR_Base (Patient Basic Information), MR_Fee (Patient Financial Information), MR_Ops (Patient Surgical Information), MR_Diag (Patient Diagnostic Information), and MR_Drug (Patient Medication Information), is optimized through the integration of the Enterprise Master Patient Index (EMPI) [17] into the big data platform. This integration facilitates patient correlation across various systems such as electronic medical record systems (EMR), laboratory testing systems (LIS), radiology systems (RIS), anesthesia systems (AS), and more. Ultimately, this methodology culminates in the analysis of key indicators pertinent to inpatient care. The stepwise process for establishing the inpatient database view is delineated in Fig. 1:

2.2 Establishment of the Dataset

During the training process of large models, the importance of data quality surpasses the sheer size of the dataset [18]. In the context of the NL2SQL task, the dataset primarily encompasses various questions and their corresponding pairs of SQL queries. Taking inspiration from the Spider dataset [19], we categorize the associated SQL queries based on their level of complexity—classified as simple, medium, and hard. It is noteworthy that this dataset is specifically tailored for the NL2SQL task and is composed of Chinese language data. For instance, a sample categorization might appear as follows:

Table 1

SQL examples of varying difficulty levels.
Simple:
Q: "How many admitted patients were there in September 2023?" A: "SELECT COUNT(*) FROM MR_base WHERE PAInDate between '2023-09-01' and '2023-09-30' "
Medium:
Q: "What is the average length of stay for patients discharged in September 2023? " *A: "SELECT SUM(DATEDIFF(DAY,PAInDate,PADisDate)) / COUNT() from MR_base where PADisDate between '2023-09-01' and '2023-09-30'"**
Hard:
Q: "The mortality rate for patients undergoing Level-4 surgeries in September 2023. " A: "SELECT (SELECT count(distinct A.Patient_ID) from MR_base A left join MR_Ops B where A.Patient_ID = B.Patient_ID where B.Opclass = 4 and A.PADisDate between '2023-09-01' and '2023-09-30' and A.PADeathDate is not NULL) / (SELECT count(distinct A.Patient_ID) from MR_base A left join MR_Ops B where A.Patient_ID = B.Patient_ID where B.Opclass = 4 and A.PADisDate between '2023-09-01' and '2023-09-30') "

Considering the independence of the training and testing sets, the SQL queries within the training set are predominantly derived from the Management Measures for Graded Evaluation of Electronic Medical Record System Application Proficiency and the medical quality monitoring indicators of three-tier comprehensive hospitals [20]. This compilation totals 543 queries and primarily encompasses aspects of medical quality and healthcare safety risks. Besides, the testing set draws its queries mainly from the performance assessments of three-tier public hospitals, daily management statistics [21], and related metrics such as healthcare service capability, medical cost control, and revenue. The testing set comprises a total of 100 data points, with a proportional distribution of query difficulty, where the ratios for simple, medium, and hard queries are maintained at 5:3:2. This design ensures a diverse and representative testing environment that evaluates the model's proficiency across a spectrum of complexities.

The essence of the NL2SQL task lies in transforming user's natural language statements into a computer-understandable and executable formal semantic representation. It is a subtask within the Semantic Parsing domain, and currently, the predominant methods rely on sequence-to-sequence models based on the Transformer architecture. These models are then enhanced using techniques such as syntax trees [22] and schema linking [23] to improve overall performance. Additionally, some research has explored the utilization of graph neural networks to represent relational databases, aiming to enhance the efficiency of utilizing database information. Examples include Shadow GNN [] and LGESQL [25]. With the increase in model parameter size, large language models have demonstrated remarkable capabilities, as seen in models like ChatGLM, ChatGPT, and Llama. These models have achieved astonishing results in various natural language processing domains, such as text generation and translation. In this research endeavor, we plan to leverage commonly used large language models like ChatGLM2-b, Llama2-6b, and ChatGPT3.5. We will integrate these models with existing computational resources using methods like QLoRA or Prompt Engineering to achieve the NL2SQL task.

3.1 QLoRA with ChatGLM and Llama2

Due to constraints in computational resources, this research explores the utilization of the QLoRA algorithm in conjunction with open-source models like ChatGLM2-6b and Llama2-6b. This method involves fine-tuning these models on a local SQL dataset. QLoRA, designed specifically for deep neural networks, is a low-precision quantization and fine-tuning technique known for significantly reducing memory usage. This enables fine-tuning of models on consumer-grade GPUs. The algorithm executes LoRA (Low Rank Approximation) [26] on the frozen, 4-bit quantized pre-trained language models for backpropagation gradients, maintaining the performance level achieved through 16-bit fine-tuning. Throughout this process, QLoRA employs two key techniques: 4-bit NormalFloat (NF4) quantization and Double Quantization. NF4 quantization reduces the weight parameters to lower bit sizes, effectively reducing memory footprint. Double Quantization further optimizes the quantization process, enhancing the training efficiency of the model. It's noteworthy that to prevent memory errors during gradient checkpoint operations, QLoRA introduces the Paged Optimizers technique. The application of this technique effectively optimizes gradient checkpoint operations, improving the stability and reliability of model fine-tuning.

By combining the QLoRA algorithm with the ChatGLM2-6b and Llama2-6b models and fine-tuning them on a local SQL dataset, the loss calculation method during the fine-tuning of Language Models (LLMs) involves summing the cross-entropy loss for all tokens in each sequence and dividing it by the sequence length. The training process's loss curve is depicted below, indicating that ChatGLM2-6b performs well on the training set. This research has successfully overcome challenges under limited computational resources, providing an effective solution for the application of deep learning in resource-constrained environments:

3.2 Prompt-Engineering with ChatGPT-3.5

In addition, we have ventured into the realm of prompt engineering, utilizing zero-shot [27] and few-shot [28] methodologies to fine-tune ChatGPT3.5. Prompt engineering stands as an artificial intelligence technique that directs model output through the intentional design of prompts. This approach enables us to guide the model's generation of specific outputs by strategically crafting input prompts. Zero-shot and few-shot learning represent two distinct strategies for training models in scenarios with limited or no annotated data. Zero-shot learning involves training a model to perform a task without any specific examples or labeled data for that task. On the other hand, few-shot learning incorporates a small set of examples or labeled data to enhance the model's understanding and performance. In the context of our research, the design of prompts for zero-shot and few-shot learning serves as a pivotal component. These prompts are meticulously crafted to provide guidance to the model, shaping its responses in a targeted manner. This strategic prompt engineering process is integral to optimizing the fine-tuning of ChatGPT3.5, allowing us to leverage its capabilities effectively and tailor its outputs to meet specific objectives. The specific prompt designs are outlined in Fig. 4:

Leveraging the inpatient business view established on the foundation of a comprehensive big data platform, it encompasses a wide array of inpatient information, including patient diagnoses, surgeries, medications, and examinations across various healthcare domains. These data undergo integration and analysis on the big data platform to ensure the accuracy and completeness of information. Simultaneously, the business view allows for multidimensional analysis and presentation of data as per requirements, facilitating a thorough understanding of various aspects during a patient's hospital stay. This inpatient view extends its coverage to encompass the analysis of hospital operations and healthcare service quality, meeting the diverse management needs. The final data table structure is illustrated in Fig. 5, ensuring a comprehensive representation of the integrated and analyzed inpatient information.

We validated the results by inputting the testing datasets into the models of ChatGLM2-6b and Llama2-6b before and after fine-tuning, as well as the zero-shot and few-shot variants of the ChatGPT3.5 model. In addition, we sought the assistance of three database engineers to generate SQL statements based on table structure and provided queries. The evaluation metric employed is execution accuracy (EX) [29], representing the proportion of correctly executed SQL results in the test set. Acknowledging the inherent randomness of LLMs, we performed three distinct tests for each set of models, calculating the average and variance of their results. Subsequently, an independent sample t-test was employed to conduct statistical analysis on the differences between the results of various major models and those from the engineering team, yielding respective p-values. We observed that the ChatGPT3.5 model, following few-shot training, demonstrates SQL query performance in each level of difficulty that closely approximates the results obtained by human engineers, achieving an accuracy of around 90%:

Table 2

The test results for different models: the results are expressed as mean(std))
Model	Easy	Medium	Hard	all	P (LLMs vs Engineer)
Original- ChatGLM2-6b	0	0	0	0	/
Original-Llama2-6b	0	0	0	0	/
QLoRA-ChatGLM2-6b	0.65(0.052)	0.16(0.042)	0	0.37(0.025)	< 0.001
QLoRA-Llama2-6b	0.50(0.099)	0.08(0.042)	0	0.27(0.054)	< 0.001
Zero-shot-ChatGPT3.5	0.76(0.043)	0.20(0.047)	0	0.44(0.008)	< 0.001
Few-shot-ChatGPT3.5	0.97(0.019)	0.96(0.027)	0.80(0.041)	0.94(0.019)	0.17
Engineer (#1~#3)	0.99(0.009)	0.96(0.031)	0.92(0.047)	0.97(0.017)	/

Additionally, in response to these statistical findings, we utilized the average of the engineering team's results as a benchmark. Employing box plots, which offer a visual depiction of the data and its variability, we could observe the ultimate outcomes of each model in Fig. 6.

In our study, we utilized a comprehensive inpatient business view constructed on a big data platform, encompassing various aspects of patients' hospitalization, including diagnoses, surgeries, medications, and examinations across different business domains. Its flexibility enables multidimensional data analysis and presentation, allowing customized analyses tailored to specific needs. This facilitates a more comprehensive understanding of patients' conditions during their hospital stay. This not only aids healthcare professionals in better comprehending patient conditions but also meets management requirements for hospital operations and healthcare service quality. By integrating the QLoRA algorithm with ChatGLM2-6b and Llama2-6b models and fine-tuning them on a local SQL dataset, we successfully improved the model's performance on simple and moderate difficulty-level SQL queries. Notably, ChatGLM2-6b outperformed Llama2-6b, possibly due to its superior performance on a Chinese-language dataset. Our research also affirmed the effectiveness of the QLoRA algorithm on simple queries under limited computational resources, suggesting that with LLMs and local datasets, finer results could be achieved through fine-tuning.

Additionally, leveraging ChatGPT3.5 combined with zero-shot and few-shot learning, we successfully fine-tuned the model in scenarios with limited or unlabeled data. In zero-shot learning, the model showcased remarkable performance without any additional input samples, relying on prior knowledge and well-designed prompts, particularly outperforming the post-fine-tuned 6b model on simple queries. Few-shot learning further demonstrated the model's high adaptability to a small number of input samples, with results comparable to those of professional database engineers, especially on hard and medium queries. Compared to traditional supervised learning, these learning methods offer evident advantages in resource-constrained environments, reducing dependence on extensive labeled data. This provides an effective solution for scenarios with data scarcity and limited computational resources in practical applications.

Overall, through the integration of a big data platform's inpatient business view, the QLoRA algorithm, and ChatGLM2-6b and Llama2-6b models, along with fine-tuning on a local SQL dataset, we have successfully obtained robust empirical support in the medical domain. Compared to manually crafting SQL statements, NL2SQL generation based on large models rapidly translates natural language into SQL statements, significantly saving time and human resources. With sufficiently intelligent large models, achieving the level of professional database engineers can be realized with appropriate prompts and a small number of samples, enabling non-professionals to easily conduct data analysis.

By integrating this approach with the hospital's big data platform, we can establish diverse database view covering outpatient, inpatient, medical insurance, financial settlements, and more. The careful selection of foundational LLMs and the utilization of NL2SQL generation empower the handling of various medical data analysis tasks. Furthermore, the integration of daily work activities facilitates the development of a comprehensive data query knowledge base, thereby enhancing the precision of the LLMs through the incorporation of extensive external knowledge. Utilizing Python toolkits such as LangChain [30], and configuring distinct prompts, provides tailored interfaces for various hospital management applications. These applications include medical retrieval for patients [31], medical quality management [32], dialogue response generation task [33], as well as BI-systems in medicine [34]. This approach lays the groundwork for future applications to conform to the outlined patterns, offering a promising avenue for enhanced efficiency and functionality.

The study is built upon a comprehensive inpatient database view constructed on a big data platform. We integrated the QLoRA algorithm with the ChatGLM2-6b and Llama2-6b models, fine-tuning them on a local SQL dataset. This integration significantly improved model performance, especially in handling simple queries, and ChatGLM2-6b exhibited better adaptability, particularly in Chinese datasets when compared to Llama2-6b. Due to hardware resource constraints, we were temporarily unable to fine-tune open-source LLMs with larger parameter size and heightened intelligence, such as GLM-130B [35] and Blip-2 [36], but we plan to conduct more in-depth experiments on larger models in the future. Simultaneously, we intend to expand the training dataset in subsequent phases, anticipating even more outstanding experimental results. Furthermore, by combining ChatGPT's with Prompt Engineering, we successfully fine-tuned the model with limited annotated data. Furthermore, we intend to conduct analogous experiments utilizing the GPT-4 model [37]. Nevertheless, given its constrained relevance in domestic contexts, our ongoing efforts remain at a preliminary stage. This achievement markedly reduced dependence on extensive labeled data, achieving results comparable to those of data engineers. Looking ahead, intelligent NL2SQL generation based on large models holds significant potential for applications in healthcare data analysis and hospital management. It promises to deliver efficient and intelligent solutions for the healthcare industry.

Ethics approval and consent to participate

This study does not involve patient-related data; only SQL scripts are used. If necessary, the corresponding author can be contacted for access.

Consent for publication

The authors declare no conflicts of interest. All authors agree to publish this paper in BMC Health Services Research.

Acknowledgement

We express our profound thanks to YH for the instrumental role in offering the indispensable platform and technical assistance required for this study.

Funding

Not applicable.

Shangala V. Effect of Hospital Management Information System Functionalities on the Performance of Health Care Institutions in Kenya: A Case of the Nairobi Hospital[D]. Daystar University, School of Business and Economics; 2020.
Kolling ML, Furstenau LB, Sott MK, et al. Data mining in healthcare: Applying strategic intelligence techniques to depict 25 years of research development[J]. Int J Environ Res Public Health. 2021;18(6):3099.
Kulasegaran S, Wang Y, Woodhouse B et al. Quality Performance Indicators for the Surgical Management of Oesophageal Cancer: A Systematic Literature Review[J]. World J Surg, 2023: 1–8.
Pramanik PKD, Pal S, Mukhopadhyay M. Healthcare big data: A comprehensive overview[J]. Research anthology on big data analytics, architectures, and applications, 2022: 119–147.
Oakden-Rayner L. Exploring large-scale public medical image datasets[J]. Acad Radiol. 2020;27(1):106–12.
Ufuk F. The role and limitations of large language models such as ChatGPT in clinical settings and medical journalism[J]. Radiology. 2023;307(3):e230276.
Galetsi P, Katsaliaki K, Kumar S. Big data analytics in health sector: Theoretical framework, techniques and prospects[J]. Int J Inf Manag. 2020;50:206–16.
Wang M, Li S, Zheng T, et al. Design, development, and application[J]. JMIR Med Inf. 2022;10(4):e36481. Big data health care platform with multisource heterogeneous data integration and massive high-dimensional data governance for large hospitals:.
Baig MS, Imran A, Yasin AU, et al. Natural language to sql queries: A review[J]. Int J Innovations Sci Technol. 2022;4:147–62.
Liao Z, Liu L, Wu Q et al. Medical data inquiry using a question answering model[C]//2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). IEEE, 2020: 1490–1493.
Du Z, Qian Y, Liu X et al. Glm: General language model pretraining with autoregressive blank infilling[J]. arXiv preprint arXiv:2103.10360, 2021.
Touvron H, Martin L, Stone K et al. Llama 2: Open foundation and fine-tuned chat models[J]. arXiv preprint arXiv:2307.09288, 2023.
Dettmers T, Pagnoni A, Holtzman A et al. Qlora: Efficient finetuning of quantized llms[J]. arXiv preprint arXiv:2305.14314, 2023.
Wang J, Shi E, Yu S et al. Prompt engineering for healthcare: Methodologies and applications[J]. arXiv preprint arXiv:2304.14670, 2023.
OpenAI. Introducing ChatGPT. OpenAI Blog Post 2022. https://openai.com/blog/chatgpt (4 May 2023, date last accessed).
Quiroz JC, Chard T, Sa Z, et al. Extract, transform, load framework for the conversion of health databases to OMOP[J]. PLoS ONE. 2022;17(4):e0266911.
Mason J, Dave R, Chatterjee P, et al. An investigation of biometric authentication in the healthcare environment[J]. Array. 2020;8:100042.
Chen L, Li S, Yan J et al. Alpagasus: Training a better alpaca with fewer data[J]. arXiv preprint arXiv:2307.08701, 2023.
Yu T, Zhang R, Yang K et al. Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task[J]. 2018.10.18653/v1/D18-1425.
State Council Information Office. (2018, December 9). Management Measures for Graded Evaluation of the Application Level of Electronic Medical Record Systems (Trial) China Government Website. https://www.gov.cn/xinwen/2018-12/09/content_5347261.htm.
State Council Information Office. (2023, March 2). Operational Manual for the Performance Assessment of National Level III Public Hospitals (2023 Edition) China Government Website. https://www.gov.cn/zhengce/zhengceku/2023-03/02/content_5744105.htm.
Lin XV, Socher R, Xiong C. Bridging textual and tabular data for cross-domain text-to-sql semantic parsing[J]. arXiv preprint arXiv:2012.12627, 2020.
Lei W, Wang W, Ma Z et al. Re-examining the Role of Schema Linking in Text-to-SQL[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2020: 6943–6954.
Chen Z, Chen L, Zhao Y et al. ShadowGNN: Graph projection neural network for text-to-SQL parser[J]. arXiv preprint arXiv:2104.04689, 2021.
Cao R, Chen L, Chen Z et al. LGESQL: line graph enhanced text-to-SQL model with mixed local and non-local relations[J]. arXiv preprint arXiv:2106.01093, 2021.
Hu EJ, Shen Y, Wallis P et al. Lora: Low-rank adaptation of large language models[J]. arXiv preprint arXiv:2106.09685, 2021.
Pourpanah F, Abdar M, Luo Y et al. A review of generalized zero-shot learning methods[J]. IEEE Trans Pattern Anal Mach Intell, 2022.
Wang Y, Yao Q, Kwok JT, et al. Generalizing from a few examples: A survey on few-shot learning[J]. ACM Comput Surv (csur). 2020;53(3):1–34.
Zhang X, Yin F, Ma G, et al. M-SQL: Multi-task representation learning for single-table Text2sql generation[J]. IEEE Access. 2020;8:43156–67.
Topsakal O, Akinci TC. Creating large language model applications utilizing langchain: A primer on developing llm apps fast[C]//Proceedings of the International Conference on Applied Engineering and Natural Sciences, Konya, Turkey. 2023: 10–12.
Tan TF, Thirunavukarasu AJ, Campbell JP, et al. Generative Artificial Intelligence through ChatGPT and Other Large Language Models in Ophthalmology: Clinical Applications and Challenges[J]. Ophthalmol Sci. 2023;3(4):100394.
Montagna S, Ferretti S, Klopfenstein LC et al. Data decentralisation of LLM-based chatbot systems in chronic disease self-management[C]//Proceedings of the 2023 ACM Conference on Information Technology for Social Good. 2023: 205–212.
Wang H, Wang R, Mi F et al. Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs[C]//Findings of the Association for Computational Linguistics: EMNLP 2023. 2023: 12047–64.
Iliashenko OY, Iliashenko VM, Dubgorn A. IT-architecture development approach in implementing BI-systems in medicine[C]//Cyber-Physical Systems and Control. Springer Int Publishing, 2020: 692–700.
Zeng A, Liu X, Du Z et al. Glm-130b: An open bilingual pre-trained model[J]. arXiv preprint arXiv:2210.02414, 2022.
Li J, Li D, Savarese S et al. Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models[J]. arXiv preprint arXiv:2301.12597, 2023.
Achiam J, Adler S, Agarwal S et al. GPT-4 Technical Report[J]. arXiv preprint arXiv:2303.08774, 2023.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Exploring the Prospects of LLMs in Hospital Management: A Perspective on Medical data inquiry

Status:

Version 1

Abstract

Background

Methods

Results

Conclusion

Figures

Introduction

1. The construction of Datasets

2.1 Establishment of Data Domain in the Field of Hospitalization

2.2 Establishment of the Dataset

2. NL2SQL Based on LLMs

3.1 QLoRA with ChatGLM and Llama2

3.2 Prompt-Engineering with ChatGPT-3.5

3. Results

4. Discussion

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1