Generative AI and digital twin integrated intelligent process planning：A conceptual framework

doi:10.21203/rs.3.rs-3652246/v1

Download PDF

Research Article

Generative AI and digital twin integrated intelligent process planning：A conceptual framework

https://doi.org/10.21203/rs.3.rs-3652246/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Process planning serves as a critical link between design and manufacturing, exerting a pivotal influence on the quality and efficiency of production. However, current intelligent process planning systems, like computer-aided process planning (CAPP), still contend with the challenge of realizing comprehensive automation in process decision-making. These obstacles chiefly involve, though are not confined to, issues like limited intelligence, poor flexibility, low reliability, and high usage thresholds. Generative artificial intelligence (AI) has attained noteworthy accomplishments in natural language processing (NLP), offering new perspectives to address these challenges. This paper summarizes the limitations of current intelligent process planning methods and explores the potential of integrating generative AI into process planning. With synergistically incorporating digital twins, this paper introduces a conceptual framework termed generative AI and digital twin-enabling intelligent process planning (GIPP). The paper elaborates on two supporting methodologies: process generative pre-trained transformer (ProcessGPT) modelling and digital twin-based process verification method. Moreover, a prototype system is established to introduce the implementation and machining execution mechanism of GIPP for milling a specific thin-walled component. Three potential application scenarios and a comparative analysis are employed to elucidate the practicality of GIPP, providing new insights for intelligent process planning.

Intelligent process planning

CAPP

Generative AI

Transformer

Digital twin

As a crucial link between product design and manufacturing, process planning can bring improvements in cost, quality, and time-to-market and affect all manufacturing activities [1]. Disappointingly, process planning is more of an art than a science [2], heavily relying on planners' experience, skills, and intuition, resulting in limited process decision-making efficiency and suboptimal process plans [2, 3]. Developing an intelligent process planning approach to address the aforementioned issues is essential for manufacturers [4, 5].

Benefiting from its domain independence [6] and the capabilities of enabling process planning systems to possess self-adaptivity and self-learning, AI has become an essential tool to support decision-making in process planning, scheduling, machining, inspection, and more [7]. As a part of AI, the emergence of generative AI, like ChatGPT, has garnered widespread attention from researchers across various domains. According to research, ChatGPT is one of the generative AI models (GAIM) capable of generating human-like text responses for user queries based on knowledge acquired from massive datasets [8]. From the perspective of intelligent process planning, GAIM shares similar functionalities with knowledge-based systems (KBS) including querying, interpreting, reasoning, and generating solutions [9]. In research [10], GPT-4 is utilized in the design and manufacturing process, revealing its abilities in generating, applying, and iteratively supporting process knowledge. However, limitations were noted in quantitative reasoning, accuracy, and verification. Furthermore, as shown in Fig. 1, [11] uncovered the performance of ChatGPT in manufacturing domain with two requirements of evaluations. The results indicate that ChatGPT is impressive in providing information, generating coherent and structured content, and proposing initial solutions. However, when answering questions of critical analysis and intricate details within the manufacturing domain, ChatGPT's answers tend to lack reliability, traceability, and verifiability, posing a severe and unacceptable concern for manufacturers.

Thus, it is imperative to construct a GAIM trained from scratch specifically for process planning and to rigorously validate the process knowledge or plans it generates. Digital twin is characterized by seamless integration between physical and virtual spaces [12], which could prove to be a key enabler for efficient verification and validation processes [13]. It can be used to reduce the time and risk of reconfiguration by early detection of design or process sequence flaws of the system in virtual commissioning and simulation [14]. As an illustration, to reduce the total number of experiments and decrease costs, [15] and [16] employed a digital twin of a 3D printer to predict the quality of printed parts. Digital twin can serve as a crucial technology for validating the process knowledge or plans generated by GAIM, enhancing their reliability.

This paper aims to bridge the gap between conventional AI and generative AI within the context of intelligent process planning. By constructing GAIM trained from scratch specifically for process planning and introducing a validation method based on digital twins, a novel framework called generative AI and digital twin-enabling intelligent process planning (GIPP) is proposed.

The remainder of the paper is organised as follows. In Section 2, we introduce the research background behind this paper. Section 3 explores the characteristics, definition and framework of GIPP. Section 4 presents the two key methodologies of GIPP. A test bed of GIPP is constructed and then three application examples are analysed in Section 5. The conclusion and future work are found in Section 6.

This section introduces the research background of the paper, encompassing generative AI and intelligent process planning. It analyses and summarizes the current limitations of CAPP while presenting the research motivation.

2.1 Generative AI

This section provides a brief introduction to generative AI and summarizes the state-of-the-art research related to current applications of generative AI in three specific domains, including healthcare, education, and art.

2.1.1 Brief Introduction

Generative AI refers to AI that can generate novel content, rather than simply analysing or acting on existing data like expert systems [17]. Moreover, the emerging generative AI is trained on large-scale corpora or datasets using a discriminator or transformer model. By mapping input information into a latent high-dimensional space and employing a generator model, it can generate novel content based on each new input [18]. The ultimate goal is to leverage generative AI to assist or replace humans in creating diverse personalized and high-quality content more rapidly and at a lower cost [19–21].

Generative AI, also known as artificial intelligence generated content (AIGC), technically involves two stages [22]: (i) extracting and understanding user intent information, and (ii) generating the desired content based on the extracted intent. While prior research has explored this field [23, 24], the core advancements in generative AI lie in using a larger foundation model and training sophisticated generative models on significantly larger datasets. For example, GPT-3 [25], a successor to GPT-2 [26], maintains a similar foundational framework but undergoes training on a curated 570GB pre-training dataset instead of 38GB, and its foundation model size increases from 1.5B to 175B. Consequently, GPT-3 demonstrates superior generalization compared to GPT-2.

GAIM can be categorized into unimodal and multimodal models depending on input and output content. The models in unimodal receive instructions in the same form as the desired output content, while multimodal models accept cross-modal instructions and generate results in different forms, as shown in Fig. 2. Consequently, the diversity of AIGC enables its application in various domains, as elaborated in the subsequent section. The following section will introduce the application of generative AI in other fields.

2.1.2 Domain Application

The remarkable achievements of generative AI in the broader field of NLP have spurred extensive research endeavours across diverse domains. As shown in Table 1, this section presents a concise overview of generative AI's application landscape in three distinct domains: healthcare, education, and art.

Table 1

Statistics of the domain applications. Syn. means whether the model is pre-trained from scratch
Domain	References	Name	Base Model	Model Size	Syn.	Code
Healthcare	[27]	BioGPT	GPT-2	380M	✓	https://github.com/microsoft/BioGPT
	[28]	Huatuo	LLaMA	7B	✗	https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese
	[29]	Chatdoctor	LLaMA	7B	✗	https://github.com/Kent0n-Li/ChatDoctor
	[30]	Doctorglm	ChatGLM	6.2B	✗	https://github.com/xionghonglin/DoctorGLM
Education	[31]	Minerva	PaLM	8B/62B	✓	\
	[32]	MathGPT	GPT-2	380M	✓	https://github.com/umass-ml4ed/mathGPT
	[33]	GPTeach	GPT-3	1750B	✗	https://github.com/juliamarkel/GPTeach
Art	[34]	Jukebox	VQ-VAE	2M	✓	https://github.com/openai/jukebox
	[35]	CLAP	CNN14, BERT	80.8M 110M	✓	https://github.com/YuanGongND/vocalsound
	[36]	Stable Diffusion	Diffusion Decoders	1.45B	✓	https://github.com/compvis/stable-diffusion

In the field of healthcare, [27] propose BioGPT, a domain-specific generative transformer language model pre-trained on large-scale biomedical literature. It is a model based on the GPT-2 medium as the foundational model, and it is pre-trained from scratch on medical domain data. [28] propose HuaTuo, a Large Language Model Meta AI (LLaMA)-based [37] model that has been supervised-fine-tuned with generated QA (Question-Answer) instances. Chatdoctor [29] represents the first attempt to adapt LLM to the biomedical field by fine-tuning LLaMa using conversation demonstrations synthesized via ChatGPT. DoctorGLM [30] leverages ChatGLM-6B 38 as the base model and finetunes it with the Chinese translation of the ChatDoctor dataset, obtained through ChatGPT.

In education, Google Research introduce Minerva [31], a model based on the Poly-encoders for Language Modeling (PaLM) [38] and an additional dataset focusing on science and math. Minerva aims to tackle multi-step quantitative tasks at the university level, covering over 200 subjects. Recognizing the importance of mathematical language within scientific communication and educational scenarios, [32] introduce the MathGPT model. By adopting the foundational GPT-2 architecture, they showcase its enhanced performance over the base model in generating mathematical expressions. [33] capitalized on fine-tuning the GPT-3 model, resulting in the introduction of an interactive chat-based tool named GPTeach for teacher training purposes. This tool empowers novice educators to participate in practice sessions with simulated students.

In the domain of art, [34] developed a model known as Jukebox, designed to generate music by directly singing in the raw audio domain. They utilize Vector Quantized Variational Autoencoder (VQ-VAE) [39] to compress the raw audio into discrete codes, effectively handling long contexts. [40] addresses audio generation by leveraging natural language supervision and presents a cross-modal audio generation model. This is achieved through the incorporation of two encoders, specifically the 14-layer Convolutional Neural Networks (CNN14) [41] and Bidirectional Encoder Representations from Transformers (BERT) [42], in conjunction with contrastive learning. [43] introduce the Latent Diffusion Model (LDM), which is built upon the diffusion model [36], enabling high-resolution unconditional image generation and the synthesis of text-to-image.

2.2 CAPP

CAPP refers to the utilization of computer software and hardware technology along with its supporting environment to formulate the machining process for mechanical components through numerical calculations, logical assessments, and reasoning [44]. It primarily encompasses various process activities, including design data interpretation, machining operations, machine tool and cutting tool selection, referencing, fixture decisions, as well as cost and production time calculations. Over the past 50 years, CAPP has evolved significantly since Niebel's pioneering application of computer technology in process design in 1965 [45]. Various techniques have emerged, such as KBS, genetic algorithms, and internet-based approaches [46]. However, CAPP has been lagging in providing practical, mature, professional, and commercial solutions for the manufacturing industry [7], and achieving the desired CAPP remains a challenge.

Additionally, [1, 47] indicate the implementation of CAPP systems in enterprises, especially for small and medium-sized ones, is hindered by various limitations. Figure 3(a) illustrates the factors that restrict the adoption of CAPP systems in enterprise settings, encompassing the following key aspects.

Enterprises perceive that the current CAPP systems do not achieve the expected time savings and enhanced production efficiency.
Domain barriers exist, necessitating specialized training for employees to effectively utilize CAPP systems with extra costs, incurring additional costs.
The systems are domain-specific and not easily transferable, limiting their adaptability to environmental changes.
Automatic knowledge acquisition is lacking, and the system's limited fault tolerance poses challenges for application in small and medium-sized enterprises.
The generated process plans lack evaluation, validation, and optimization processes, leading to suboptimal process options.

With the above commercial feedback, [1] analyse and summarize the gaps between the existing CAPP systems and the desired CAPP systems, visually represented in Fig. 3(a). Therefore, future intelligent process planning approaches can be researched in the five directions mentioned in Fig. 3(b).

2.3 Motivation

The emergence of generative AI presents a novel perspective to address the previously mentioned limitations and to achieve an optimal intelligent process planning system. The subsequent discussion will illuminate the potential of generative AI within process planning, delineating its significance in three key dimensions and underscoring the impetus behind this paper.

Regarding human-computer interaction, generative AI, with its remarkable comprehension of natural language context, holds the potential to significantly streamline users' acclimatization to less user-friendly interactive interfaces. When deployed within manufacturing systems, it facilitates user interaction through natural language, seamlessly translating it into desired textual commands, voice instructions, or even programmable directives.
In terms of data and knowledge management, generative AI, grounded in large-scale datasets, can function as a general approach for identifying and extracting a diverse array of multimodal data pertinent to process planning within manufacturing enterprises. This encompasses textual content, images, videos, audio, and geometric models. Moreover, its extraordinary reasoning and generative capabilities serve to revolutionize the approach to seeking and retrieving process-related knowledge.
In the context of industrial training, the most compelling facet of generative AI currently resides in its rapid and efficient text generation capabilities. As such, when new employees within an enterprise strive to grasp the intricacies of manufacturing a specific standard component, GAIM can promptly provide pertinent insights. These encompass specialized foundational knowledge, instances of process planning, and comprehensive elucidations of pertinent concepts and terminology, all tailored to their input.

Drawing upon the aforementioned introduction of generative AI and CAPP, the integration of generative AI into the manufacturing domain to tackle prevailing process planning challenges emerges as a stimulating undertaking.

This section introduces the characteristics and definition of GIPP. On that basis, a generative AI and digital twins-based framework of intelligent process planning for creating an ideal process planning system is proposed.

3.1 Characteristics of GIPP

In a manufacturing cycle, the traditional process for product manufacturing by the CAPP is illustrated in Fig. 4(a). Initially, product designers create designs using CAD software based on requirements. Subsequently, the CAPP system, which can integrate various extended techniques including Feature Based (FB), Knowledge-Based (KB), Neural Network (NN), Internet Based (IB), Functional Blocks (FBs), Fuzzy Set Theory (FST), Agent-Based (AB), Step-Compliant (STEP), Petri Nets (PN), and Genetic Algorithm (GA) (Yusof and Latif 2013), is employed to generate process plans for the products. These plans are then put into the CAM system to obtain toolpath trajectories and numerical control programs for tool machining. Finally, the products are manufactured using Computer Numerical Control (CNC) machine tool. By drawing an analogy, this paper introduces the process and characteristics of GIPP. Specifically, product designers input their design requirements into GAIM, which can be in the form of text descriptions, 2D CAD drawings, or 3D CAD models. GAIM then directly generates the process plan, presented in structured data, such as text or tables, containing detailed information about the product's machining, including motion paths and numerical control programs. Finally, the process plan is put into the digital twin, enabling real-time interaction between the virtual and physical realms for product manufacturing, as shown in Fig. 4(b).

With the strengths of generative AI and digital twin, GIPP exhibits the ensuing characteristics:

Multimodality. The inputs can be multimodal, including text descriptions, 2D drawings, or 3D models, while the outputs can take various forms, such as text or tables, according to on-site needs.
Efficient management capability. Providing manufacturing enterprises with a generative method to recognize and extract multimodal data, thus establishing an effective mechanism for obtaining and manipulating high-quality process data or knowledge.
Flexibility. Scalability, adaptability, and customizability to suit individual manufacturing companies and novel processes.
User-Friendliness. Powerful human-computer interaction functionality, offering a user-friendly interface that enables non-experts to use it effortlessly. Additionally, it should provide users with instant and inspirational feedback.
Reliability. Utilizing digital twins, it possesses the capacity to evaluate and validate process knowledge and plans.
Explainability. Additionally, it can trace and offer logical inferences for the generated process knowledge and plans.

3.2 Definition and framework of GIPP

Based on the analyses provided above, we propose the following definition for GIPP:

Definition 1

GIPP is an intelligent process planning system that integrates generative AI and digital twin technologies. It leverages the capabilities of generative AI, such as easy access and efficient high-quality content generation, along with the benefits of digital twin technology, including real-time simulation and verification. This integration empowers the process planning system with robust abilities of data and knowledge management, human-computer interaction, process knowledge or plans generation, verification, and feedback optimization. Its primary objective is to optimize enterprise productivity and production quality while ensuring high adaptability to accommodate personalized manufacturing needs and reduce production costs.

With this definition, we introduce the architecture of GIPP, as depicted in Fig. 5, which comprises three primary functional layers, namely the Data Layer, Model Layer, and Application Layer.

Specifically, in the Data Layer, the massive and diverse data from various sources within manufacturing enterprises [48] are effectively managed and utilized by constructing a universal data management method or mechanism. This approach can be knowledge acquisition templates [49], machine learning algorithms [50], or their combination, which enables the extraction and classification of process data to form the pre-training and fine-tuning datasets required for training the GAIM. These datasets are stored and dynamically updated using large-scale databases, such as MySQL, to function as a dynamic knowledge base.

The Model Layer comprises the technologies of generative AI and digital twins. Specifically, the emergence of the Transformer [51] has profound implications for GAIM, serving as the intersection across various domains within AI. Because we used the GPT-2 architecture in the case study, Fig. 5 utilizes the Transformer's decoder component to symbolize the Transformer. Notably, GAIM based on the Transformer architecture can be categorized into three main types: Encoder-only, Encoder-Decoder, and Decoder-only. Regardless of the selected architecture, these GAIMs are inherently complex and resource-intensive. For instance, GPT-2 medium boasts 1.5 billion parameters, while GPT-3 scales up significantly to a staggering 175 billion parameters. Such models demand substantial computational power, thereby leading to elevated research costs. Consequently, it is crucial to integrate pertinent key technologies for lightweighting GAIM, including reinforcement learning from human feedback (RLHF) [52], prompt learning [53], knowledge graph [54], and related lightweight techniques like pruning [55], distilling [56], and data augmentation [57]. The integration aims to reduce the training parameters while preserving the model's functionality, ultimately lowering the training cost. Based on this, the model undergoes pre-training and fine-tuning using the input dataset in the data layer and is then applied to specific process planning scenarios. The digital twin model consists primarily of the physical and virtual spaces of CNC machine tools. The physical space represents the actual carrier of the entire machining process, encompassing CNC, end mill, blank workpiece, force sensor, current sensor, and other components. The virtual space encompasses a geometric model, mechanism model, and behaviour model. These models interact with the physical space by utilizing twin data from a real-time database for perception and control. Subsequently, they integrate and interact with the GAIM through an application programming interface (API). By inputting the decision-making knowledge generated by the GAIM, the virtual space conducts process simulation, machining monitoring, quality prediction, and process evaluation. These functions serve to validate the generated content, thereby improving its reliability. Ultimately, the verification results will be fed back to GAIM as novel data, culminating in the iterative enhancement of the training dataset and conferring upon GAIM a certain degree of explainability.

In the Application Layer, the intelligent process planning system can be accomplished by training it with different fine-tuned datasets, allowing for implementation in various scenarios. Similar to the general model ChatGPT, the system functions as an intelligent question-answering system, providing expert information in the field of manufacturing processes when presented with process planning queries. It can address specific requirements of product designers, act as an experienced process planner, generate comprehensive process plans, and provide iterative feedback and improvements for design proposals. Additionally, the system assists novices in quickly familiarizing themselves with the manufacturing domain and aids employees in acquiring new knowledge and skills. The system serves as an AI coach, guiding novices to become skilled professionals in the enterprise and provides personalized and targeted training programs for employees seeking to learn and improve.

This section introduces the key methodologies of GIPP for the construction of generative AI and digital twins-based intelligent process planning from the perspectives of process knowledge and process plan generation, simulation, and evaluation.

4.1 Process Generative Pre-trained Transformer (ProcessGPT) modelling

Incorporating generative AI into the domain of process planning, we introduce the ProcessGPT, which stands for Process Generative Pre-trained Transformer. Our exposition commences with a clarification of the underlying principles and distinctive traits of the Transformer architecture. Subsequently, we provide an overview of the framework employed in ProcessGPT modelling.

Table 2

Statistics of the domain applications. Syn. means whether the model is pre-trained from scratch
Refenrence	GAIM	Architecture	Code
[42]	BERT	Encoder	https://github.com/google-research/bert
[58]	RoBERTa	Encoder	https://github.com/pytorch/fairseq
[59]	XLNet	Encoder	https://github.com/zihangdai/xlnet
[60]	DistilBERT	Encoder	https://github.com/huggingface/transformers
[61]	GPT	Decoder	https://github.com/huggingface/transformers
[26]	GPT-2	Decoder	https://github.com/openai/gpt-2
[25]	GPT-3	Decoder	https://github.com/openai/gpt-3
[38]	PaLM	Decoder	https://github.com/lucidrains/PaLM-pytorch
[62]	LaMDA	Decoder	https://github.com/conceptofmind/LaMDA-rlhf-pytorch
[37]	LLaMA	Decoder	https://github.com/facebookresearch/llama
[63]	T5	Encoder-Decoder	https://github.com/google-research/text-to-text-transfer-transformer
[64]	BART	Encoder-Decoder	https://github.com/huggingface/transformers
[65]	DQ-BART	Encoder-Decoder	https://github.com/amazon-research/dq-bart
[66]	ExT5	Encoder-Decoder	https://github.com/google-research/text-to-text-transfer-transformer
[67]	Switch	Encoder-Decoder	https://github.com/tensorflow/mesh

Transformer was proposed in the field of NLP to overcome the limitations of traditional models like recurrent neural network-based language models (RNNs) [68] in handling variable-length sequences and context awareness. Figure 6(a) illustrates that the Transformer architecture comprises an encoder and a decoder, utilizing residual connections and normalization. The core components of Transformer include multi-head attention and feed-forward neural networks, which learn to assign varying weights to tokens based on their relevance. Compared to RNNs, the Transformer structure not only improves the handling of long-term dependencies but also offers a high degree of parallelism. These enhancements not only boost the model's performance in large-scale NLP tasks but also enable Transformer-based models to adapt effectively to various downstream tasks through large-scale pre-training [22]. Consequently, it has evolved into the fundamental architecture for numerous mainstream models, including BERT and the GPTs (GPT, GPT-2, and GPT-3). As mentioned in Section 3.2, models based on the Transformer architecture can be categorized into three main types: encoder-only, encoder-decoder, and decoder-only. Table 2 outlines the corresponding mainstream models for each type, as presented. GPTs outperform in text generation due to the impressive performance of the Decoder architecture in generating tasks. Figure 6(b) demonstrates that the GPTs architecture discussed in this paper consists of a 24-layer Transformer decoder.

According to [69], training domain-specific data from scratch is essential, thus requiring a well-designed, comprehensive training process tailored to the process domain. Consequently, we propose ProcessGPT, a domain-specific GAIM for process knowledge and plan generation in manufacturing. Figure 7 depicts the construction and training process of ProcessGPT, utilizing the domain-specific dataset and Transformer architecture.

Pre-training

Firstly, the pre-training dataset is obtained from the mentioned dynamic knowledge base in the previous section. The difficulty of obtaining a large amount of high-quality training data within the domain necessitates effective data augmentation methods to enhance data size and quality. For sequence data, various data augmentation methods can be employed [70]. The third step involves acquiring the domain vocabulary since the model is trained from scratch, and the vocabulary of the transformer-based model cannot be used. Byte-pair encoding can be utilized to segment words in the corpus into word pieces for vocabulary learning [71]. Lastly, the above-mentioned corpus data will be put into the Transformer based model for training.

Fine-Tuning

After pre-training, we will apply ProcessGPT to various downstream tasks using fine-tuned datasets specific to each task, namely process knowledge question-answering and process document classification. As depicted in Fig. 7, when input text is in the form of a question, the model, after fine-tuning training, acquires the capability of general knowledge question-answering in process planning, generating text-based responses to user queries. Moreover, when the input text is in the form of a document, the model, following training, gains the ability for document classification and generation. Consequently, it can generate structured process plans like process cards in accordance with user input requirements.

4.2 Digital twin-based verification method

Like other deep learning models, the training process of GAIM lacks transparency and has poor explainability, often regarded as a black box [72]. Consequently, these concerns about the reliability and security of the generated content, particularly in the manufacturing domain, emerged. Unreliable generated content not only results in significant losses or safety incidents but also undermines manufacturer's confidence in this technology.

Digital twin, is defined as an integrated multi-physics, multi-scale, probabilistic simulation of an as-built product, system, or process which can mirror the life of its corresponding twin using available physical models, history knowledge, and real-time data [73], is nowadays regarded as the key for the convergence of physical systems and cyber systems [74]. The vigorous development of digital twin technology has provided a novel solution for both intelligent machining and Zero Defect Manufacturing (ZDM), demonstrating its potential in product quality management [75, 76]. Building upon this foundation and drawing from the accomplishments of our research group in digital twin [74, 77, 78], this paper proposes the integration of GIAM with digital twin technology to evaluate and verify the process knowledge and plans generated by ProcessGPT, thereby enhancing the reliability and explainability of GIPP.

As shown in Fig. 8, based on the digital twin model, real-time perception and data acquisition of both static data, such as manufacturing resources and blank workpieces essentials, and dynamic data, like operational parameters of machine tools, geometric dimensions and shapes of workpieces, and process execution details, is conducted using a sensor network to form twin data. Concurrently, the process plans generated by ProcessGPT are considered as inputs constituting static data. During the machining process, the virtual space conducts geometric and physical simulations based on twin data and the constructed digital twin model, all of which are continuously updated in real-time through online monitoring to adapt to changes in the physical workspace. Subsequent to this, the existing process plans are subjected to evaluation based on simulation data and monitoring outcomes. In the event of deviations between the actual machining state and the required parameters, discrepancies are identified using twin data. Lastly, the verification of manufacturing resource availability within the theoretical process plans occurs through real-time monitoring and analysis of data regarding workshop machine tools, cutting instruments, raw materials, fixtures, and other essential resources. Additionally, the effectiveness of process parameters is confirmed via a process parameter analysis system, with supplementary simulation validation conducted for machining irregularities such as excessive cutting, tool collisions, and processing delays. Furthermore, the introduction of Siemens Plant Simulation production software enables a global simulation analysis of part machining process plans, thereby validating the optimized machining process plans.

Drawing on the aforementioned key methodologies, this section explores the way to establish the prototype system for GIPP and delineates the execution mechanism from raw materials to the final product. Subsequently, it outlines three potential application scenarios of GIPP, providing guidance for future research directions. Moreover, through comparative analysis, the paper elucidates the advantages of GIPP compared to traditional CAPP, and discusses its limitations.

5.1 Prototype implementation

5.1.1 Construction of GIPP

Figure 9 presents an intelligent experimental platform for process planning, leveraging GPT-2 architecture. The platform comprises five key parts: process dataset, training, model, process knowledge and plans, and a digital twin model for a three-axis CNC milling machine.

Process dataset. As pointed out by [69], it is crucial to train on comprehensive domain-specific data. Therefore, this paper focuses only on textual data within the manufacturing process domain to construct the dataset and trains the model from scratch using the collected data.

Training. For pre-training, the model is trained using the same standard language modelling task as [25] and [26]. Fine-tuning is performed to adapt the pre-trained model to downstream tasks such as question-answering systems and process plan generation. In the question-answering system, given a question, a reference context, and an answer, the objective is to determine whether the answer can be inferred from the reference context.

Model. Considering factors such as model size, open-source availability, and model performance, ProcessGPT will adopt GPT-2 medium as its foundation architecture. The core component of the Transformer as well as GPT-2 is the multi-head attention. Given the input, three linear transformations are applied to generate queries Q, keys K, and values V, followed by the computation of the output as follows:

Process knowledge and plans

Following pre-training, our model is capable of acquiring domain-specific knowledge related to process planning, including fundamental concepts like process specifications and positioning references. Furthermore, fine-tuning can be accomplished through the construction of question-answer datasets to develop a domain-specific question-answering system for process planning, along with the creation of process plan datasets to enable automated process plan generation. In the context of process knowledge and process plans, their correctness can be validated by employing digital twin models.

Digital twin. The physical space in the digital twin model primarily includes a three-axis CNC milling machine and its components such as electric power, cutting force sensors, workpieces, and cutting tools, which are used to complete the CNC milling process of workpieces. The CNC can fulfil the machining requirements for flat surfaces, slots, and curved surfaces. Additionally, it is equipped with pneumatic safety doors and a CNC system that supports network protocols. Specific parameters are presented in Table 3. The geometric, mechanism, and data model of the virtual space are constructed based on this physical space.

Table 3

The key parameters of a three-axis CNC milling machine
Equipment	Model	Key Parameters
CNC Milling Machine	VMC400	Numerical Control System: KND21000MCi
		Machine Size: 1900mm×1500mm×2000mm
		Workbench Size: 250mm×800mm
		Working Stroke: X(400mm), Y(280mm), Z(380mm)
		Positioning Accuracy: ±0.01mm
		Spindle Speed: 6000rpm
		Main Electromotor Power: 2.2kw
		Application: Machining flats, slots, and curved surfaces

5.1.2 Machining execution mechanism of GIPP

The specific aerospace thin-walled component is used to examine the implementation mechanism of GIPP, as depicted in Fig. 10.

In the initial step, the on-site process planner provides pertinent details to ProcessGPT, drawing from the blank workpiece specifications, including material and dimensions (limiting the input to textual data in this context). Subsequently, ProcessGPT generates part processing plans in line with user specifications. It is noteworthy that for standard or typical components within the enterprise, user inputs tend to be more concise compared to those required for novel or specialized parts. Following this, the generated processing plan is fed into the CNC virtual space, culminating in the execution of part machining simulation and real-time state monitoring. This synthesis of the virtual and physical realms facilitates the evaluation and validation of the machining quality of the part. The resultant evaluation and validation data are then integrated back into ProcessGPT to enrich its training dataset, thereby enabling self-learning and evolution. Ultimately, this process yields certified products.

5.2 Potential applications of GIPP

Leveraging the pre-training and fine-tuning capabilities of GAIM, the application of generative AI-based process planning systems bestows notable degrees of flexibility. The system can be customized for specific scenarios by fine-tuning with diverse datasets, encompassing question-answering system-aided process planning, automatic generation of process plans, and an intelligent process AI coach. Subsequent sections will furnish explicit examples of applications pertaining to the preceding three scenarios.

5.2.1 Question-answering system-aided process planning

The question-answering system is designed to offer real-time problem-solving and process-related information to manufacturing enterprises. Illustrated in Fig. 11, the development of a domain-specific intelligent question-answering system involves acquiring a question-answer corpus from manufacturing enterprises, encompassing facets like design, manufacturing, products, and sales, followed by pre-training model implementation. This system is capable of addressing a comprehensive array of inquiries pertaining to product design, manufacturing processes, equipment operations, production workflows, and post-sales services within the manufacturing enterprise. Moreover, through integration with the digital twin, it can furnish responses of heightened reliability.

The process question-answering system features a user-friendly human-machine interface, allowing employees to access it directly on computers or mobile devices. This capability significantly saves time for enterprise staff to obtain information, enabling the manufacturing business to swiftly address issues and thereby enhance production efficiency and quality. Furthermore, it offers enhanced decision-making support for manufacturing personnel by assisting them in comprehending and effectively applying process knowledge. This, in turn, drives additional optimization of production workflows and heightened competitiveness.

5.2.2 Question-answering system-aided process planning

The application scenario of process plans intelligent generation, depicted in Fig. 12, is proposed for aerospace component process planning.

It showcases the use of the GIPP model in the role of a skilled process planner. Firstly, Initially, product designers put forth process planning requirements grounded in 2D drawings or 3D models pertinent to the process planning undertaking. Subsequently, the process requirements are fed into the proficiently trained model, referred to as ProcessGPT. Thirdly, ProcessGPT autonomously generates process plans and provides a recommended list of processing options. Fourth, on-site staff make process determinations guided by the list of recommendations. Fifth, the obtained process plans are input into the digital twin model, enabling real-time interaction between virtual and physical domains to execute the manufacturing process spanning from raw material to finished product. Finally, the validated process plans are amassed as training data, augmenting the model with self-learning and evolving capabilities.

5.2.3 Intelligent process AI coach

Grounded in generative AI and digital twins, GIPP harnesses robust text generation and real-time feedback capabilities, expediting the acquisition of knowledge and skills among process personnel. As shown in Fig. 13, GIPP serves as an AI coach, delivering personalized real-time feedback to individual employees for their questions and providing targeted responses to facilitate knowledge acquisition. Empowered by the digital twin, process staff can actively participate in real-time virtual operations and simulation experiments that are informed by their acquired knowledge. Leveraging a real-time "trial-feedback" mechanism, they can proficiently attain mastery over the precise utilization of diverse production processes, tools, and equipment, thus significantly amplifying their skill levels. Tailored learning plans and feedback are dispensed contingent upon the progress and performance of the process staff's learning journey. This approach guarantees the realization of targeted learning objectives through regularized feedback loops.

5.3 Comparative analysis between GIPP and intelligent CAPP

In this section, we will conduct a comparative analysis between the GIPP and four intelligent CAPPs, namely Expert System-based CAPP (ES-CAPP), Neural Network-based CAPP (NN-CAPP), Agent-based CAPP (Agent-CAPP), and Petri Net-based CAPP (PN-CAPP).

As depicted in Table 4, the comparative analysis primarily involves qualitative assessments of the key characteristics of these four intelligent CAPPs. This approach is employed to highlight the strengths and weaknesses of GIPP. Firstly, ES-CAPP, rooted in expert knowledge, often requires expert participation in constructing rules and knowledge bases. While capable of addressing complex domain-specific challenges, its reliance on human expertise can result in rigidity, limited adaptability, and a deficiency in self-learning capabilities. Secondly, NN-CAPP utilizes extensive data for model training, endowing them with learning and predictive proficiencies. Consequently, it boasts considerable flexibility and the ability to suit various process planning scenarios. However, it mandates substantial training data and its training process lacks transparency. Thirdly, Agent-CAPP simulates human decision-making processes through intelligent agents, capable of mimicking diverse roles and decision behaviors, thereby facilitating flexible process planning. Nevertheless, the simulation process might become overly abstract, necessitating intricate modelling. Finally, PN-CAPP employs Petri Nets to model process planning, providing concurrency and sequencing analysis capabilities. It excels at scrutinizing concurrent process steps and intricate process relationships. However, its modelling requisites are substantial, and its applicability might not extend to all domains.

Table 4

Comparison statistics of six characteristics performances between GIPP and four intelligent CAPP. ‘✓’ means the superior performance of the characteristic.
Intelligent Process Planning	Characteristics
Intelligent Process Planning	Specificity	Self-learning	Flexibility	parallelism	Computability	Reliability
GIPP	✓	✓	✓	✓	-	✓
ES-CAPP	✓	-	-	-	✓	-
NN-CAPP	-	✓	✓	-	-	-
Agent-CAPP	-	-	✓	-	✓	-
Petri-CAPP	-	-	-	✓	✓	-

GIPP, hinging on generative AI, can fine-tune downstream tasks to generate domain-specific content, allowing it to adapt to varying scenarios. Its potential for self-learning and evolution is facilitated through training data revisions. Leveraging the deep neural network architecture of Transformer and its data processing approach, GIPP demonstrates pronounced levels of parallelism. Nevertheless, this entails a significant demand for computational power. Importantly, GIPP, founded on the digital twin model, can submit its generated theoretical process plans to real-time simulation and verification, whereas nearly all CAPP types predominantly output theoretical process plans. As a result, GIPP can be deemed highly reliable.

5.4 Discussion

GIPP is a novel intelligent process planning method based on generative AI and digital twins. From the perspectives of implementation and operation of GIPP, discussions could focus on the following two aspects.

5.4.1 Benefits

This paper proposes an intelligent process planning framework based on generative AI and digital twin, serving as an application case of generative AI in the field of intelligent manufacturing. The constructed test platform provides valuable insights into GIPP, demonstrating its architectural components and revealing the manufacturing execution mechanism of GIPP. Furthermore, the potential applications and a comparative analysis with several intelligent CAPPs highlights the advantages and characteristics of GIPP.

The proposed key methodologies in this framework, namely ProcessGPT modelling, and digital twin-based verification method, empower intelligent process planning systems with effective human-computer interaction, high efficiency and quality generation, and validation and feedback optimization of process knowledge or plans. Therefore, through GIPP's capacities of efficiently harnessing manufacturing data, rapidly generating process knowledge, and ensuring the reliability validation of process knowledge and plans, it can maximize product machining quality and throughput while maintaining the flexibility of process planning. Additionally, leveraging the powerful transfer learning capability of the GAIM's pre-training mechanism enables the application of GIPP in various scenarios. In summary, GIPP offers an innovative strategy to mitigate the existing limitations within intelligent process planning. It illustrates the substantial capability and viability of integrating generative AI with digital twins in the context of process planning.

5.4.2 Challenges ahead

This paper explores several pivotal facets of GIPP. However, several challenges persist in the near future, demanding resolution. From the implementation perspective of GIPP, a significant challenge lies in constructing effective and large-scale training datasets for pre-training the GAIM in the domain of manufacturing. This challenge emanates from two primary reasons. Firstly, a deficiency of large-scale open-source databases in the manufacturing domain, analogous to the medical field's PubMed. Secondly, a dearth of NLP algorithms customized for the manufacturing domain exacerbates the situation. Presently, the predominant characteristic of GAIM lies in its extensive volume of data and massive models. This leads to substantial computational resource requirements and highly expensive training, posing challenges for their application in typical enterprises and research institutions. Hence, another significant challenge is to reduce the size and complexity of GAIM. Fortunately, model compression techniques such as pruning, quantization, and knowledge distillation [79] have been advanced to achieve lightweight GAIM. Nonetheless, striking a balance between model training effectiveness and model lightweighting remains an area that requires further research.

In this paper, we present an intelligent process planning framework that integrates generative AI and digital twins to rectify the deficiencies observed in current process planning methods concerning efficiency, user-friendly human-machine interaction, flexibility, and reliability. Based on the findings presented in this paper, the following conclusions can be drawn.

In response to the current challenges related to low user-friendliness, limited flexibility, and high entry barriers within prevailing process planning systems, this paper explores the feasibility and attributes of integrating generative artificial intelligence (AI) models into intelligent process planning systems. By incorporating digital twin technology, we propose a generative intelligent process planning framework, with the aim of offering fresh perspectives on the application of generative AI in the manufacturing domain.

Within the two supporting technologies proposed, ProcessGPT enables users to input succinct natural language text, generating high-quality knowledge in the realm of process planning or machining process schemes. Digital twin technology offers validation and reliability support for the generated content, mitigating the challenge of low reliability in generative artificial intelligence models. This establishes the technological groundwork for researchers to advance intelligent process planning systems founded on generative artificial intelligence.

The proposed building modules and manufacturing execution mechanism of GIPP provide practical insights for the technology's actual application. The presentation of three potential application scenarios introduces new solutions to overcome the bottlenecks encountered by traditional process planning systems. Comparative analysis between GIPP and intelligent CAPP accentuates the distinctive features and advantages of GIPP.

Future work will focus on three key aspects. Firstly, we plan to construct a general pre-training dataset for the manufacturing domain based on named entity recognition (NER) related methodologies [80]. This dataset will be derived from manufacturing science-related papers and abstracts. Secondly, our efforts will be directed toward researching lightweighting techniques for GAIM to ensure that the laboratory can handle the corresponding computational costs, for instance, employing knowledge distillation techniques [81]. In the end, we aim to establish a high-fidelity and precise digital twin model based on a skin model [82], which will facilitate multi-scale simulation and verification of the actual products manufacturing process.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 52105530 and 52275508; the China National Postdoctoral Program for Innovative Talents [grant number BX2021244]; the Young Talent fund of University Association for Science and Technology in Shaanxi, China [grant number 20210409]; the China Postdoctoral Science Foundation [grant number 2021M692556].

Ethical approval: Not applicable.

Consent to participate: Not applicable.

Consent for publication: Not applicable.

Competing interests: The authors declare no competing interests.

Zhang C, Zhou G, Hu J, Li J (2020) 2.Deep learning-enabled intelligent process planning for digital twin manufacturing cell. Knowl Based Syst 191:105247. https://doi.org/10.1016/j.knosys.2019.105247
Halevi G (2014) Industrial Management-Control and Profit: A Technical Approach. Springer
Gao X, Mou W, Peng Y (2016) 4.An Intelligent Process Planning Method Based on Feature-based History Machining Data for Aircraft Structural Parts. Procedia CIRP 56:585–589. https://doi.org/10.1016/j.procir.2016.10.115
Behandish M, Nelaturi S, De Kleer J (2018) Automated process planning for hybrid manufacturing. Comput Aided Des 102:115–127. https://doi.org/10.1016/j.cad.2018.04.022
Al-wswasi M, Ivanov A, Makatsoris H (2018) A survey on smart automated computer-aided process planning (ACAPP) techniques. Int J Adv Manuf Technol 97:809–832. https://doi.org/10.1007/s00170-018-1966-1
Leo Kumar SP (2017) 7.State of The Art-Intense Review on Artificial Intelligence Systems Application in Process Planning and Manufacturing. Eng Appl Artif Intell 65:294–329. https://doi.org/10.1016/j.engappai.2017.08.005
Xu X, Wang L, Newman ST (2011) 8.Computer-aided process planning – A critical review of recent developments and future trends. Int J Comput Integr Manuf 24:1–31. https://doi.org/10.1080/0951192X.2010.518632
Wu T, He S, Liu J et al (2023) 9.A Brief Overview of ChatGPT: The History, Status Quo and Potential Future Development. IEEE/CAA J Autom Sinica 10:1122–1136. https://doi.org/10.1109/JAS.2023.123618
Li BM, Xie SQ, Xu X (2011) 10.Recent development of knowledge-based systems, methods and tools for One-of-a-Kind Production. Knowl Based Syst 24:1108–1119. https://doi.org/10.1016/j.knosys.2011.05.005
Makatura L, Foshey M, Wang B et al (2023) How Can Large Language Models Help Humans in Design and Manufacturing? arXiv. https://doi.org/10.48550/arXiv.2307.14377. preprint arXiv:2307.14377.
Wang X, Anwer N, Dai Y, Liu A (2023) ChatGPT for design, manufacturing, and education. Procedia CIRP 119:7–14. https://doi.org/10.1016/j.procir.2023.04.001
Kong T, Hu T, Zhou T, Ye Y (2021) Data Construction Method for the Applications of Workshop Digital Twin System. J Manuf Syst 58:323–328. https://doi.org/10.1016/j.jmsy.2020.02.003
Locklin A, Muller M, Jung T et al (2020) Digital Twin for Verification and Validation of Industrial Automation Systems – a Survey. In: 2020 25th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA). IEEE, Vienna, Austria, pp 851–858. https://doi.org/10.1109/ETFA46521.2020.9212051
Talkhestani BA, Jazdi N, Schlögl W, Weyrich M (2018) A concept in synchronization of virtual production system with real factory based on anchor-point method. Procedia Cirp 67:13–17. https://doi.org/10.1016/j.procir.2017.12.168
DebRoy T, Zhang W, Turner J, Babu SS (2017) Building digital twins of 3D printing machines. Scripta Mater 135:119–124. https://doi.org/10.1016/j.scriptamat.2016.12.005
Mukherjee T, DebRoy T (2019) A digital twin for rapid qualification of 3D printed metallic components. Appl Mater Today 14:59–65. https://doi.org/10.1016/j.apmt.2018.11.003
Murphy KP (2022) Probabilistic machine learning: an introduction. MIT press
Gozalo-Brizuela R, Garrido-Merchan EC (2023) ChatGPT is not all you need. A State of the Art Review of large Generative AI models. arXiv preprint. https://doi.org/10.48550/arXiv.2301.04655. arXiv:2301.04655
Wang Y, Pan Y, Yan M et al (2023) A Survey on ChatGPT: AI-Generated Contents, Challenges, and Solutions. arXiv preprint arXiv:2305.18339. https://doi.org/10.48550/arXiv.2305.18339
Wu J, Gan W, Chen Z et al (2023) AI-Generated Content (AIGC): A Survey. arXiv preprint arXiv:2304.06632, 2023. https://doi.org/10.48550/arXiv.2304.06632
Xu M, Du H, Niyato D et al (2023) Unleashing the Power of Edge-Cloud Generative AI in Mobile Networks: A Survey of AIGC Services. arXiv preprint arXiv:2303.16129, 2023. https://doi.org/10.48550/arXiv.2303.16129
Cao Y, Li S, Liu Y et al (2023) A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT. arXiv preprint arXiv:2303.04226, 2023. https://doi.org/10.48550/arXiv.2303.04226
Stefanini M, Cornia M, Baraldi L et al (2021) From Show to Tell: A Survey on Deep Learning-based Image Captioning. IEEE transactions on pattern analysis and machine intelligence, 2022, 45(1): 539–559. https://doi.org/10.1109/TPAMI.2022.3148210
Liang PP, Zadeh A, Morency L-P (2023) Foundations and Trends in Multimodal Machine Learning: Principles, Challenges, and Open Questions. arXiv preprint arXiv:2209.03430, 2022. https://doi.org/10.48550/arXiv.2209.03430
Brown T, Mann B, Ryder N et al (2020) Language models are few-shot learners. Adv Neural Inf Process Syst 33:1877–1901
Radford A, Wu J, Child R et al (2019) Language models are unsupervised multitask learners. OpenAI blog 1:9
Luo R, Sun L, Xia Y et al (2023) BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining. Brief Bioinform 23(6):bbac409. https://doi.org/10.1093/bib/bbac409
Wang H, Liu C, Xi N et al (2023) HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge. arXiv preprint arXiv:2304.06975. https://doi.org/10.48550/arXiv.2304.06975
Li Y, Li Z, Zhang K et al (2023) ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge. https://doi.org/10.7759/cureus.40895. Cureus
Xiong H, Wang S, Zhu Y et al (2023) DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task. arXiv preprint. https://doi.org/10.48550/arXiv.2304.01097. arXiv:2304.01097
Lewkowycz A, Andreassen A, Dohan D et al (2022) Solving Quantitative Reasoning Problems with Language Models. Adv Neural Inf Process Syst 35:3843–3857
Scarlatos A, Lan A (2023) Tree-Based Representation and Generation of Natural and Mathematical Language. arXiv preprint arXiv:2302.07974, 2023. https://doi.org/10.48550/arXiv.2302.07974
Markel JM, Opferman SG, Landay JA, Piech C (2023) GPTeach: Interactive TA Training with GPT-based Students. In: Proceedings of the Tenth ACM Conference on Learning @ Scale. ACM, Copenhagen Denmark, pp 226–236. https://doi.org/10.48550/arXiv.2302.07974
Dhariwal P, Jun H, Payne C et al (2020) Jukebox: A Generative Model for Music. arXiv preprint arXiv:2005.00341. https://doi.org/10.48550/arXiv.2005.00341
Elizalde B, Deshmukh S, Ismail MA, Wang H (2022) CLAP: Learning Audio Concepts From Natural Language Supervision. In ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1–5). IEEE. https://doi.org/10.1109/ICASSP49357.2023.10095889
Sohl-Dickstein J, Weiss E, Maheswaranathan N, Ganguli S (2015) Deep unsupervised learning using nonequilibrium thermodynamics. In: International conference on machine learning. PMLR, pp 2256–2265
Touvron H, Lavril T, Izacard G et al (2023) LLaMA: Open and Efficient Foundation Language Models. arXiv preprint arXiv:2302.13971. https://doi.org/10.48550/arXiv.2302.13971
Chowdhery A, Narang S, Devlin J et al (2022) PaLM: Scaling Language Modeling with Pathways. arXiv preprint arXiv:2204.02311. https://doi.org/10.48550/arXiv.2204.02311
Razavi A, Van den Oord A, Vinyals O (2019) Generating diverse high-fidelity images with vq-vae-2. Advances in neural information processing systems 32
Elizalde B, Deshmukh S, Ismail MA, Wang H (2022) CLAP: Learning Audio Concepts From Natural Language Supervision. In ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1–5). IEEE. https://doi.org/10.1109/ICASSP49357.2023.10095889
Kong Q, Cao Y, Iqbal T et al (2020) PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:2880–2894. https://doi.org/10.1109/TASLP.2020.3030497
Devlin J, Chang M-W, Lee K, Toutanova K (2018) Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv 181004805. https://doi.org/10.48550/arXiv.1810.04805
Rombach R, Blattmann A, Lorenz D et al (2022) High-Resolution Image Synthesis with Latent Diffusion Models. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New Orleans, LA, USA, pp 10674–10685
Ahmad N, Haque A, Hasin AA (2001) Current trend in computer aided process planning. In: Proceedings of the 7th Annual Paper Meet and 2nd Intern. Conf. pp 25–27
Niebel BW (1965) Mechanized process selection for planning new designs. ASME paper 737
Yusof Y, Latif K (2013) Computer Aided Process Planning: A Comprehensive Survey. In: Azevedo A (ed) Advances in Sustainable and Competitive Manufacturing Systems. Springer International Publishing, Heidelberg, pp 389–400. https://doi.org/10.1007/978-3-319-00557-7_32
Fletcher CA (2014) The evaluation of a novel haptic machining VR-based process planning system using an original process planning usability method. Heriot-Watt University. http://hdl.handle.net/10399/2797
Kong Y, Li D, Li C et al (2021) A Multi-source Heterogeneous Data Storage and Retrieval System for Intelligent Manufacturing. In: 2021 IEEE International Conference on e-Business Engineering (ICEBE). IEEE, Guangzhou, China, pp 82–87. https://doi.org/10.1109/ICEBE52470.2021.00032
Zhang C, Zhou G, Bai Q et al (2018) HEKM: A High-End Equipment Knowledge Management System for Supporting Knowledge-Driven Decision-Making in New Product Development. In International Design Engineering Technical Conferences and Computers and Information in Engineering Conference (Vol. 51739, p. V01BT02A014). American Society of Mechanical Engineers. https://doi.org/10.1115/DETC2018-85151
Heng J, Wang J, Xiao L, Lu H (2017) Research and application of a combined model based on frequent pattern growth algorithm and multi-objective optimization for solar radiation forecasting. Appl Energy 208:845–866. https://doi.org/10.1016/j.apenergy.2017.09.063
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. Advances in neural information processing systems 30
Ouyang L, Wu J, Jiang X et al (2022) Training language models to follow instructions with human feedback. Adv Neural Inf Process Syst 35:27730–27744
Li L, Zhang Y, Chen L (2023) Personalized Prompt Learning for Explainable Recommendation. ACM Trans Inform Syst 41(4):1–26. https://doi.org/10.1145/3580488
Yang L, Chen H, Li Z et al (2023) ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling. arXiv preprint. https://doi.org/10.48550/arXiv.2306.11489. arXiv:2306.11489
Han S, Mao H, Dally WJ (2016) Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. https://doi.org/10.48550/arXiv.1510.00149. arXiv preprint arXiv:1510.00149
Wu Q, Wang H, Ma X, Fu Y (2022) Distilling Text-Image Foundation Models
Wen Q, Sun L, Yang F et al (2021) Time Series Data Augmentation for Deep Learning: A Survey. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence. pp 4653–4660
Liu Y, Ott M, Goyal N et al (2019) RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv preprint arXiv:1907.11692. https://doi.org/10.48550/arXiv.1907.11692
Yang Z, Dai Z, Yang Y et al (2019) XLNet: Generalized Autoregressive Pretraining for Language Understanding. Adv Neural Inf Process Syst, 32
Sanh V, Debut L, Chaumond J, Wolf T (2020) DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108. https://doi.org/10.48550/arXiv.1910.01108
Radford A, Narasimhan K, Salimans T, Sutskever I (2018) Improving language understanding by generative pre-training
Thoppilan R, De Freitas D, Hall J et al (2022) LaMDA: Language Models for Dialog Applications. arXiv preprint arXiv:2201.08239. https://doi.org/10.48550/arXiv.2201.08239
Raffel C, Shazeer N, Roberts A et al (2020) Exploring the limits of transfer learning with a unified text-to-text transformer. J Mach Learn Res 21:5485–5551. https://dl.acm.org/doi/abs/10.5555/3455716.3455856
Lewis M, Liu Y, Goyal N et al (2019) BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. arXiv preprint arXiv:1910.13461. https://doi.org/10.48550/arXiv.1910.13461
Li Z, Wang Z, Tan M et al (2022) DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization. arXiv preprint arXiv:2203.11239. https://doi.org/10.48550/arXiv.2203.11239
Aribandi V, Tay Y, Schuster T et al (2022) ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning. arXiv preprint arXiv:2111.10952. https://doi.org/10.48550/arXiv.2111.10952
Fedus W, Zoph B, Shazeer N (2022) Switch transformers: Scaling to trillion parameter models with simple and efficient sparsity. J Mach Learn Res 23:5232–5270. https://dl.acm.org/doi/abs/10.5555/3586589.3586709
Mikolov T, Karafiát M, Burget L et al (2010) Recurrent neural network based language model. In: Interspeech, Makuhari, pp 1045–1048
Gu Y, Tinn R, Cheng H et al (2022) Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing. ACM Trans Comput Healthcare 3:1–23. https://doi.org/10.1145/3458754
Papanikolaou Y, Pierleoni A (2020) DARE: Data Augmented Relation Extraction with GPT-2. arXiv preprint arXiv:2004.13845. https://doi.org/10.48550/arXiv.2004.13845
Sennrich R, Haddow B, Birch A (2016) Neural Machine Translation of Rare Words with Subword Units. arXiv preprint arXiv:1508.07909. https://doi.org/10.48550/arXiv.1508.07909
Chen M, Tworek J, Jun H et al (2021) Evaluating Large Language Models Trained on Code. arXiv preprint arXiv:2107.03374. https://doi.org/10.48550/arXiv.2107.03374
Zhang C, Zhou G, Xu Q et al (2022) A digital twin defined autonomous milling process towards the online optimal control of milling deformation for thin-walled parts. Int J Adv Manuf Technol 124(7–8):2847–2861. https://doi.org/10.1007/s00170-022-10667-5
Liu S, Bao J, Zheng P (2023) A review of digital twin-driven machining: From digitization to intellectualization. J Manuf Syst 67:361–378. https://doi.org/10.1016/j.jmsy.2023.02.010
Psarommatis F, May G (2023) A literature review and design methodology for digital twins in the era of zero defect manufacturing. Int J Prod Res 61:5723–5743. https://doi.org/10.1080/00207543.2022.2101960
Li J, Zhou G, Zhang C (2022) A twin data and knowledge-driven intelligent process planning framework of aviation parts. Int J Prod Res 60:5217–5234. https://doi.org/10.1080/00207543.2021.1951869
Zhou G, Zhang C, Li Z et al (2020) Knowledge-driven digital twin manufacturing cell towards intelligent manufacturing. Int J Prod Res 58:1034–1051. https://doi.org/10.1080/00207543.2019.1607978
Deng BL, Li G, Han S et al (2020) Model Compression and Hardware Acceleration for Neural Networks: A Comprehensive Survey. Proc IEEE 108:485–532. https://doi.org/10.1109/JPROC.2020.2976475
Lample G, Ballesteros M, Subramanian S et al (2016) Neural Architectures for Named Entity Recognition. arXiv preprint arXiv:1603.01360. https://doi.org/10.48550/arXiv.1603.01360
Hinton G, Vinyals O, Dean J (2015) Distilling the Knowledge in a Neural Network. arXiv preprint. https://doi.org/10.48550/arXiv.1503.02531. arXiv:1503.02531
Schleich B, Anwer N, Mathieu L, Wartzack S (2014) Skin Model Shapes: A new paradigm shift for geometric variations modelling in mechanical engineering. Comput Aided Des 50:1–15. https://doi.org/10.1016/j.cad.2014.01.001

Download PDF

Editorial decision: Minor Revisions Needed
02 May, 2024
Reviewers agreed at journal
11 Dec, 2023
Reviewers invited by journal
23 Nov, 2023
Editor assigned by journal
22 Nov, 2023
First submitted to journal
21 Nov, 2023

You are reading this latest preprint version

Generative AI and digital twin integrated intelligent process planning：A conceptual framework

Status:

Version 1

Abstract

Figures

1 Introduction

2 Background

2.1 Generative AI

2.1.1 Brief Introduction

2.1.2 Domain Application

2.2 CAPP

2.3 Motivation

3 Framework of GIPP

3.1 Characteristics of GIPP

3.2 Definition and framework of GIPP

4 GIPP methodologies

4.1 Process Generative Pre-trained Transformer (ProcessGPT) modelling

4.2 Digital twin-based verification method

5 Application and discussion

5.1 Prototype implementation

5.1.1 Construction of GIPP

5.1.2 Machining execution mechanism of GIPP

5.2 Potential applications of GIPP

5.2.1 Question-answering system-aided process planning

5.2.2 Question-answering system-aided process planning

5.2.3 Intelligent process AI coach

5.3 Comparative analysis between GIPP and intelligent CAPP

5.4 Discussion

5.4.1 Benefits

5.4.2 Challenges ahead

6 Conclusion and future work

Declarations

Funding

References

Status:

Version 1