Using AI technology to customize manufacture product label for decision making

doi:10.21203/rs.3.rs-447217/v1

Download PDF

Research Article

Using AI technology to customize manufacture product label for decision making

https://doi.org/10.21203/rs.3.rs-447217/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

When the manufacturing industry is dealing with information technology, it has to face a large number of parameters and frequent adjustments. How to correctly import and maintain it has always been a huge challenge. Once the setting is wrong, it will bring losses, ranging from poor products that require maintenance, heavy work or scrapping, and at worst, resulting in production line shutdown, reduced factory productivity, delayed shipments and other adverse consequences. In order to improve this problem, this study uses the data in the approval form of a customized label set by an electronic manufacturer and use artificial intelligence models to find out the hidden rules behind a large number of customized labels, through data processing and model building. Model and parameter experiments are used to improve the effectiveness of artificial intelligence models, and for the problem of time characteristics but uneven distribution of data, the method of cyclic testing is adopted to increase the diversity of the test set. The results of this paper, we integrate each stage and an auxiliary decision-making is established. When the user's setting is inconsistent with the predicted result, a warning will be displayed to speed up the operation process, reduce the scope of confirmation, and ultimately reduce the error rate, thereby improving the problem, reducing scrap and production line shutdown to improve factory productivity. In the statics, the accuracy rate of new recruits was only 80%. The accuracy rate of the artificial intelligence model can be increased to 95%. The number of stoppages is reduced from 4 times per month to 1 time per month. Under full capacity, this assistance the decision-making system can reduce loss cost.

Artificial Intelligence and Machine Learning

Artificial intelligence (AI)

Product label

Decision making

With the development of science and technology, in order to save manpower, reduce costs, increase production capacity, and improve quality, companies have successively introduced new technologies and new concepts, from the early informationization and automation to today’s smart factories, Industry 4.0, artificial intelligence as the goal of intelligence and so on. To implement intelligence, automation must be carried out first. Based on automation, intelligence can make decisions based on the actual conditions of the site, and deploy the automation work sequence to optimize efficiency and productivity. To implement automation, information must be carried out first, based on information only by automation can judge the correct position, use the best path, make appropriate actions, replace labor, and improve efficiency. Only in this way can the value of existing resources be maximized, difficulties can be reduced, and benefits can be improved. However, when the manufacturing industry deals with information, it is not only necessary to import all the parameters into the system. The factory has a variety of products, machines, manufacturing processes, post-station data, and the production line is divided into production and assembly, testing, packaging, some parameters have a reciprocal and complex relationship with each other. In addition, it is necessary to deal with the problem of frequent adjustment and change of parameters. How to correctly import and maintain huge parameters has always been a huge challenge [1-2].

The current corporate practice is to check through the sign-off process, confirm the production of samples, and check the design procedures, but the results are limited. Too many sign-off forms can easily cause the sign-off personnel to be dazzled, mechanically numb, perfunctory stamps, and lose the original audit intention; proofing verification can confirm more than 80% of the content, but there are still a few special circumstances that cannot be verified. A few variables are only in the testing phase. The default value can be brought out, and the real-time actual value can be cascaded at the moment of production; the design program takes time and manpower development, but short-period products account for the majority, but after development, it can only be used for a short time. It is only possible to design card control check logic for products with higher severity, more value and longer cycle, including data setting limit card control and post-setting check comparison. Products with a long cycle may also be used. In the case of frequent changes to specifications and revisions, the information department is time-consuming and laborious, and cannot meet the overall needs. In response to this problem, this research attempts to use artificial intelligence models to establish an auxiliary decision-making system. When the user sets an error, a warning will be displayed to reduce the error rate and improve the problem [2-4].

In the face of a difficult-to-control environment with diversified customers, diversified products, variable specifications, and short production cycles in the manufacturing industry, the only thing that remains constant is always changing. Under such circumstances, the setting of the production line system is very complicated. Under the premise of sharing the settings as much as possible, there are still many customized special rules. In such a complicated environment, many settings are very similar and similar. They may only be different in base, prefix, and suffix. It is easy for the setting personnel to confuse, and the setting is wrong due to accidental setting. Once the setting is wrong, it will cause loss, and even less. Defective products require maintenance, heavy industry, or even scrapping. In severe cases, the production line will be shut down, referred to as line shutdown, resulting in a decrease in factory productivity and even delays in shipments. To make matters worse: some wrong settings are difficult to find in time. You must wait until the actual production and all the parameters are in place. With the date of the day, serial number, timely measurement data and other parameters, the result can be verified through the numbering logic to verify whether it is correct. Limited by this factor, the program card control that companies often rely on has become like an itch, unable to achieve the expected results. Even if they persist in implementation, it is difficult to meet all product specifications, let alone the development speed to keep up with the ever-changing environment, just like facing a moat that is difficult to cross. Establishing a test environment to simulate the formal environment requires time-consuming and laborious cooperation from all parties. Various settings and operations need to be executed once in the test environment and the formal environment. There may be a time difference and it is difficult to synchronize. After all, it is impossible to make an identical production environment. If the settings on both sides are inconsistent, the test result will be meaningless [5-7].

Some setting errors are difficult to find in time, and you must wait until the actual production and all the parameters are in place before you can verify whether they are correct. If there is an error, the product will be scrapped in the slightest, and the line will be stopped in the serious, resulting in a decrease in factory productivity, and even delayed shipments. In the case of full capacity, the line is stopped due to the customized label problem, which takes about half an hour to solve each time. The loss is nearly every year ten million dollars. In this context, combined with the overall environment and own work experience, the motivation is divided into application and technical aspects. For application: Decrease in factory productivity will affect quality and delivery, which will cause customers to be unhappy; If customers are unhappy, reducing orders and revenue will lead to unhappy shareholders and the board of directors; if the board is unhappy, reducing bonuses will lead to unhappy corporate executives; If the top of the company is unhappy, lowering bonuses, pursuing performance, and raising requirements will lead to unhappiness at the bottom of the company; unhappy and negative attitudes at the bottom of the company will lead to a decline in factory productivity. Source: Refer to famous quote in order to avoid falling into a bad vicious circle, making oneself unhappy, it becomes an imperative goal to improve the problem. In this context, the manufacturing industry needs a better solution [8-10].

This study uses artificial intelligence models to try to predict the setting value in the changing environment to reduce the setting error rate and maintain the stable productivity of the factory. On the technical side: When facing a single customer and a single product, if the specifications can be clarified, the final answer should be limited to a few possibilities based on conditions. When the answer falls outside these possible ranges, the probability of error is extremely high. However, it is difficult to be familiar with all the rules due to human experience, and it is difficult to pass on the experience of senior staff to new recruits through education and training. Card control through programming is theoretically feasible, but requires huge resources. However, the information department of general enterprises has limited resources, and it is difficult to develop all specifications with limited time and manpower, not to mention clarification of specifications, meetings, and communication. Time cost. As a member of the information department, I feel very empathetic to resource constraints. Faced with projects that are never finished, every specification change is tormenting. In order to be simple and fast, the program structure of the stacked bed frame also makes the enterprise virtually impossible with less technical debt, you don't know when potential problems will emerge due to the lack of quality assurance [11-14].

The data in this research comes from a certain factory in Science Park. As general companies are becoming informatized and intelligent, their awareness of information security has gradually increased, and their own data protection measures are different from the past. It is difficult for the outside world to obtain it at will. The data in this study is based on the factory label: after the factory is informatized and automated, all specifications, all stages of raw materials, semi-finished products, finished products, packaging, etc. will be assigned numbers and labeled for identification. Common label contents include: serial number, Part number, model number, date, quantity and other text and barcode. The date range of the data of this research is from the fourth quarter of 2018 to the second quarter of 2020: the data of this research is established by the company in recent years by gradually introducing the existing paper sign-off process into the information-based transfer process, but the variable field has been revised before, so the data before revision is finally discarded and only the data after revision is taken. In addition, due to time constraints, the data for this study only uses data before May 1, 2020 to run the experiment [15-16].

The data in this study only uses the data that has been validated: Although adding the data in the upload signature can make the amount of data larger, there will be more unexpected errors and blanks in the data in the signature, so only the data that has been approved by each unit is used. It is confirmed that there are more than 35,000 complete and effective documents that have been applied to the production line. The data in this study retains historical version data: the common reasons for the new version are customer specification changes. In order to avoid mistakes, factories will formulate SOP, but only for process steps, some rules about professional knowledge and experience cannot be completely written into the file, and this part is the weakest and most frequently mistaken place for users. Since changes and errors are the norm in the factory industry, after evaluation, these data are retained but not excluded, but time characteristics are added to facilitate identification and influence the weight of the data through the characteristics, thereby obtaining correct results.

For the variable parameters that appear for the first time, the user can only confirm with multiple departments via phone or letter. This part cannot rely on experience and rules. In the collected data, all you see are past and confirmed answers, which are actually unknown at the current point in time. Therefore, in data processing, the first appearance of each variable parameter is marked as unknown. Fig. 1 shows the flow of the research data approval form.

The customer (User) provides the reference image file, style, printed content and other information of the label.
The Product Control Coordinator (PCC) obtains information through letters and phone calls and fills in the application form.
The label room staff set the variable parameters corresponding to each item on the label.
The label room staff sometimes needs to ask the information staff (CIM) to assist in setting new variables.
The label room staff creates proofing label files based on the information in the sign-off form.
After the approval is completed and effective, the label file is used to print the label during production on the production line.

The label room handles about 5 business groups, about 60 customers in each business group, and about 20 products per customer. There are about 10 specifications for each product, about 5 labels for each specification, and about 4 versions for each label. 1 million variable parameters need to be set. Generally, the machine has a dedicated person responsible for maintaining the settings, but the label room is a small number of people facing all have a product label, so it is more difficult. In the third point, it is easy to set errors: old variables are easy to cause errors due to confusion; new variables may cause errors due to poor communication. The actual sign-off form for this study is shown in Table 1.

Table 1 Approval form

Label type	SN	Number of boxes	1	Print template	91SAQ403.G20-SN_EF.Lab
Print type	Label	amount per box	1	Number of consecutive sheets	1
Paper model	45.EC213.009	Number of copies	1	Print category	OFPNT
Ink cartridge specifications	General	label total	1

Serial number	PCC fill in					Label Room Fill in
1	Name	Header	Content	Constant or variable	Text or barcode	Variable name
	MAC	MAC:	xxxxxxxxxxxxxx	Variable	Text + 1D barcode	[[MAC]]
	Number	Description			coding	Default value
		MAC->Customer provided, MAC should lowercase, MAC range shared 91SAQ403.G01			CODE128	000B6B0000[%2]
2	Name	Header	Content	Constant or variable	Text or barcode	Variable name
	S/N	S/N:	WAYYWWxxxxxxxx	Variable	Text + 2D barcode	[[SAQ4D1_SN]]
	Number	Description			coding	Default value
		SN->WAYYWWxxxxxxxx (14 bit)			DATAMATRIX	WAYYWW[%8]

The customer provides the label style and documentation, which are filled in the corresponding fields of the form by the product control coordinator (PCC), including the reference drawing and the content of the object on the left side of the lower half of the form; the label room staff judges whether new variables are needed, and then Ask the information personnel to assist in setting up and writing the numbering logic, and then set the variable name field on the right, design and upload the proofing file; after approval by the staff of each site, the form will be returned to the staff of the label room to determine the new version Effective timing. However, when there are special rules, it is difficult for the product control coordinator to clearly describe in the limited fields. If the label room personnel misunderstand, it will cause an error; when the label room personnel is set, it is difficult to clarify and test whether the old variables can be used, and whether the numbering logic assisted by the information personnel is wrong, it is difficult to clarify and test. If this part is not rigorous enough, errors may occur [17-18].

There are three common applications of artificial intelligence in the manufacturing industry: scheduling optimization, numerical monitoring, and image recognition. The purpose of scheduling optimization is mainly to balance the load of the production line, shorten the production cycle, and reduce costs; the purpose of numerical monitoring is to reduce defective products, Control yield and reduce costs; image recognition is often used in optical inspection stations for the purpose of reducing defective products, controlling yield, and reducing costs.

There are many literatures on the application of artificial intelligence models in the manufacturing industry to study how to control the yield, but most of the artificial intelligence-related research focuses on the production environment. Through monitoring measurement values, photos, etc., to predict the production yield status, seldom focuses In the part of forecasting parameters. As shown in Fig. 5, although the production yield rate is indeed very important, the production parameters and photos are all data that can only be obtained at the production stage. Even if problems are found through the artificial intelligence model, most cases still need to be stopped for adjustment, resulting in loss of productivity. If errors can be found when setting parameters at an earlier stage, the number of stoppages can be reduced.

2.1 Application of Artificial Intelligence in Manufacturing

As shown in Table 1, the relevant research on similar issues in the manufacturing industry is as follows: In 2020, 6 people including Nils Thielen from Germany used KNN, random forest, and neural network model to solve the manpower problem required for the automatic optical inspection (AOI) result verification during the production of surface mount technology (SMT) in the manufacturing industry. Verify the results of automatic optical inspection (AOI) to save labor costs [16]. In 2020, in order to solve the problem of the high cost of setting up inspection stations in the manufacturing industry, Hou Yuzhe tried to replace the defect recognition system established by traditional computer vision with a deep learning model. Especially for the recognition of small objects, he proposed a two-stage object detection algorithm. , To reduce the over-fitting situation, and finally use this automatic defect detection system to assist manual detection to reduce the missed detection rate and labor costs [17]. In 2016, Chen Weihan tried to solve the problem of balancing the workload of the production line, reducing the number of workstations to reduce enterprise costs, and maximizing the work efficiency of the production line, using genetic algorithms, immune algorithms, and particle swarm algorithms, and proposed a new The coding method is discussed and compared, and the results show that the solution speed of the particle swarm algorithm is better than the other two algorithms, and the solution quality of the immune algorithm is better than the other two algorithms [18]. In 2012, in order to solve the problem of the deviation of the control parameters of the semiconductor process production machine, which led to the deviation of the process, which caused the wafer yield to decrease, and even scrapped, Zhao Peiyao used a neural network combined with a failure detection and classification system to establish an early warning mechanism to ensure Process yield and guaranteed production capacity [19].

2.1.2 Manufacturing

Through the assembly line, raw materials, semi-finished products, etc. are processed and manufactured into semi-finished or final products that can be used or sold downstream of the industrial chain, which can be called manufacturing. The common production line processes include: production, assembly, testing, packaging, etc. There are two common processes: the semi-finished product process and the integrated manufacturing process of integrated products. The semi-finished product process can provide downstream manufacturers with standardized semi-finished products for downstream manufacturers to make more complex products; the integrated manufacturing process of integrated products can use upstream Semi-finished products are customized and made into final products. One advantage of the manufacturing industry is that it can purchase raw materials on a large scale, and then manufacture them, especially sharing raw materials and parts for different products, thereby reducing costs and improving production efficiency. Electronics manufacturing industry mostly uses SMT (Surface Mount Technology) surface mount technology to solder electronic parts on a printed circuit board (PCB) and process them into finished products required by customers. SMT technology is to print the solder paste on the PCB, and then put the specific electronic parts corresponding to each position. When passing through the high-temperature reflow furnace, the melted solder paste will cover the solder feet of the electronic parts. After the temperature is cooled, it can be sold the electronic parts on the circuit board. [17].

2.1.3 Industry 4.0

It was first proposed by the German government in 2011, and was later regarded as the fourth industrial revolution. Its goal is no longer only focusing on technological development, but focusing on integration, to intelligent existing technologies, sales, and product experience. To integrate the goal, build an intelligent integrated sensory control system that can be highly automated, intelligent in production, modularized, and can automatically eliminate production obstacles. Most manufacturing factories have not fully implemented informatization and automation. With such an unstable foundation, the effect of rashly promoting Industry 4.0 is extremely limited, and most of them are only slogans in the end. Most of the more successful cases are the courageously provided by the corporate leadership with sufficient budget and resources to plan and build from scratch, which can also avoid obstacles caused by existing processes and personnel habits. Some companies have a clearer understanding of their own shortcomings, and first implement Industry 3.5. 1. Systematize and digitize existing manufacturing advantages and management experience; 2. Establish product life cycle and revenue management systems; 3. Integrate software and hardware vertical integration of equipment and analysis capabilities; 4. Planning for sustainable development and green supply chain. Then move on to Industry 4.0. [18]

2.2 Artificial intelligence algorithm

2.2.1 Artificial Intelligence

The intelligence expressed by artificially manufactured machines can be called artificial intelligence, which is usually realized by computer programs and algorithms. It systematically learns from data and uses the learned knowledge to make predictions, thereby establishing expert systems and decision-making aids. System, identification system, etc. This concept was proposed in the middle of the 20th century. From 1943 to 1956, scientists began to explore the possibility of building artificial brains. The Turing test convinced everyone that machine thinking was possible. This was the birth period of artificial intelligence; from 1956 to 1974, many algorithms The birth of, the researchers are quite optimistic, it is the golden age; the optimism before 1974~1980 made people excessively expect, but finally found that it was not as expected, and encountered the first low tide; 1980~1987 launched the expert system, simple design makes it easy The realization or modification of the NAS showed practicality, and artificial intelligence revived; 1987~1993, due to the improvement of the efficiency of the machines produced by Apple and IBM, the market demand for artificial intelligence hardware fell, which led to the second low tide; 1993~2011 finally realized the initial For some goals, artificial intelligence has been successfully applied to the technology industry; from 2011 to now, due to the rapid development of hardware technology, storage space and computing power have increased significantly. Slogans such as machine learning, data exploration, and big data have taken turns to lead the trend, and artificial intelligence has entered period of vigorous development. [19]

2.2.2 KNN

Nearest neighbor method, the classification of a target is determined by the classification of its K neighbors. It is explained by the spatial distribution. The concept is the same data. The higher the similarity, the closer the distribution. It is easy to understand in machine learning algorithms. Practice and have basic results. The input of the model can be classification or regression data, and the output is the category of the target. K is the number of nearest neighbors. If it is set too small, it will reduce the classification accuracy. If it is set too large, it may increase noise and affect the results. The model can be based on the neighbor distance gives different weights to optimize the results. When new data is available in this model, it can be added directly without retraining, and it is not sensitive to outliers; its disadvantage is that it needs to be re-calculated each time the classification requires a large amount of memory, and it is sensitive to the local structure of the data. When the data is distributed unbalanced, the forecast is prone to deviation. Commonly used in text classification, pattern recognition, cluster analysis, multi-classification fields, etc. [9]

2.2.3 SVM

Support vector machine, based on a binary linear classifier, divides the data scattered on the plane into two categories, finds a separation line and maximizes the distance between the separation line and the boundary of the two categories, and can be extended to high-dimensional space. The space can be understood as the separation interface of oil-water mixed liquid. The so-called support vector means that the classification boundary is supported by the points of the boundary. The input of the model can be classification or regression data, and the output is the target category. If it is a non-linear classification, the kernel technique can be used to replace the dot product with a non-linear kernel function. This model has strong generalization ability (adaptability to new samples) and is easy to interpret. It does not rely on all the data, but uses some data to make hyperplane decision-making; its disadvantage is poor performance and sensitivity to missing data. If the feature dimension is too high, Poor performance, no universal solution to nonlinear problems, sensitive to the kernel function and its parameters, and not good at explaining the kernel function. Commonly used in text classification, image recognition, handwriting recognition, etc. [10]

2.2.4 Decision Tree

Establish a tree model, where each node represents a feature, and each bifurcation represents the possible attributes of the feature. Along the tree structure, the path is determined according to each feature attribute, and finally reaching the leaf node is the answer. The decision tree model can build classification trees, regression trees, and even both classification and regression trees that can be used. Basically, the decision tree has only a single output. If you want multiple outputs, you can build multiple trees to handle different targets. When building a decision tree, it is necessary to determine the order of using features to create nodes based on the amount of variation as the division criterion. There are two common processing methods: use entropy to calculate the information gain, and subtract the information disorder before the division from the information after the division. Information disorder; and Gini coefficient to calculate impurity. If the decision tree model loses control and builds too many branches, it will be easy to overfit, so that all data has its own path, so it is necessary to restrict growth or even pruning. This model is easy to understand and implement, has a high degree of interpretability, does not need too much data, does not need to do too much data pre-processing, and can process data and category data at the same time; its disadvantage is that it is easy to overfit, and small changes in data are possible. Generating completely different trees leads to unstable results. When the data is unbalanced, the growth tends to have more numerical features, resulting in poor performance and neglecting the correlation between the attributes in the data. [11]

2.2.5 Random Forest

Random forest is composed of multiple decision trees. Because a single decision tree is easy to overfit, it has the characteristics of low deviation but high variance. Random forest uses random sampling and random selection of features to build multiple decision trees to solve this problem. , Its output is determined by voting on the answers of all decision trees. The input of the model can be classification or regression data, and the output is the target category. The random sampling method of random forest is to use the method of retrieval and replacement. The same sample may be selected multiple times, or none of them may be selected at random; Feature selection is to randomly select some of the features that may be different from all the features to build a decision tree each time, so that the differences between the decision trees are greater. This model can evaluate the importance of features and has a fast training time. It can process high-dimensional data. If there is missing data, it can still maintain its accuracy. If the data is unbalanced, it can balance errors; its disadvantage is that the interpretation is poor and it can handle regression. When there is a problem, it is impossible to make a prediction beyond the data range. If the data is noisy, it may still be over-fitting. It is often used in various classification and regression problems, or used in outlier detection, and also used in unsupervised learning classification problems. [12]

2.2.6 GBDT

Gradient Boosting Decision Tree is the abbreviation of Gradient Boosting Decision Tree. The general concept of Boosting is to gradually optimize through a series of learning, and to increase the weight of the parts that have not been well divided in the past to strengthen learning; the concept of Gradient Boosting is in a series of learning Establish a new model for the gradient descent direction of the loss function of the previous model, amplify the weight of the wrong part in disguise, and strive to make the loss function smaller, the better the overall performance. The decision tree of this model is a regression tree. Compared with random forests that can be parallelized to build decision trees, GBDT can only serialize and wait for the previous decision tree to process before knowing the optimization direction and use time in exchange for results. This model can prevent over-fitting and does not require too much data pre-processing to process complex features. There are many non-linear transformations and can process linear or non-linear data; its disadvantage is that it is computationally complex and cannot be parallelized and time-consuming. Not suitable for sparse high-dimensional data. [13]

2.2.7 XGBoost

Limit gradient boosting is the abbreviation of eXtreme Gradient Boosting. It improves and optimizes the GBDT model in the loss function, regularization, segmentation point query algorithm, sparse perception algorithm, parallelization algorithm, etc. XGBoost is the same as GBDT, based on regression tree operation, through a series of weak classifiers, with negative gradient as the learning goal; its objective function can control the number of leaf nodes and the score of leaf nodes to prevent overfitting; in the branching strategy , The score after splitting must be greater than the score before splitting. In order to limit overgrowth, only when the gain is greater than the threshold value will the split be performed; the score of the leaf node must be multiplied by the reduction weight, so that the influence of each tree will not be too large to prevent Overfitting: The concept of randomly extracting some features is introduced to reduce the chance of overfitting and reduce the amount of calculation; when dealing with missing values, the direction can be determined according to the amount of information gain, or the default direction can be specified to speed up the calculation. This model can well prevent over-fitting. Nodes at the same level can be processed in parallel and can handle sparse data. The disadvantage is that when the node is split, it needs to traverse the data set, and besides storing the feature value, it also needs to store the feature corresponding to the sample. The index of the gradient statistics is equivalent to twice the memory usage. [14]

2.2.8 Neural Networks

The concept of similar neural network is to simulate the function and structure of biological neural network through algorithms. Each neuron is composed of its input, excitation function, and output. The entire network is composed of input layer, hidden layer, and output layer. The input layer simulates numerous neurons receive huge non-linear input information. The hidden layer simulates the synaptic connection of neurons and is responsible for transmitting the information to the corresponding position. The more the more complex non-linear relationships, the more it can lead to over-fitting. The output layer: The simulation information is processed through neural connections, and the model can learn by optimizing the weights of neurons in each layer and the connections between neurons. The input of the model can be classified or regression data, but the categorical data needs to be converted into numerical values through one-hot encoding. In the entire network, each neuron is connected to all the neurons in the next layer, and the neurons in the same layer. The elements are not connected to each other. The number of layers in the network and the number of neurons in each layer are determined by the complexity of the problem. It can be optimized and controlled during the parameter tuning stage. Usually, when the number of neurons is the same, a deep neural network. The performance is better than that of shallow neural networks. In addition to the development of deep learning research, it also extends more advanced methods such as recurrent neural networks and convolutional neural networks.

This model has self-learning ability, can learn the rules and patterns behind it from the data, and the learned knowledge is scattered throughout the neural network, so it has a certain fault tolerance, and a small part of the damage will not cause too much to the whole It can also adjust itself based on the learned results and combined with the newly provided data; its disadvantage is that the learning speed is slow, and it is not explanatory. When the neural network is deeper and more complex, the huge amount of parameters requires a larger amount of Data assists in training, otherwise it is easy to overfit. Because neural networks have infinite possibilities, it is difficult to find the best solution. In the process of tuning parameters, only a lot of attempts can be used to obtain better parameters. Commonly used in speech recognition, image recognition, recommend systems. [15]

The data used in this study comes from the data in the customized labeling and endorsement form of electronic manufacturer. The fields filled in by the applicant are the input data of the artificial intelligence model. The variable setting parameters actually set in the label room are the artificial intelligence model. The output data.

3.1 Research structure

As shown in Fig. 2, the research steps are mainly divided into three blocks: data processing, model modeling, and optimization experiment. The purpose of data processing in the first block is to prepare the input data required for model modeling, including data collection, Sorting and screening in section 3.2. Data pre-processing that the model can operate normally and correctly in section 3.3. Feature construction and selection to clarify the importance of features in section 3.4. The purpose of modeling the second block model, in order to find a model that is more suitable for the data type of this research, since most of the data fields in this research are categorical data, seven models that are well-known in the performance of categorical data are selected and compared with each other in section 3.5. Block optimization experiment based on the results of the previous block, after finding a better model, try to optimize further and obtain better results. This part is divided into three types of experiments, feature processing experiments, loop test experiments, and parameters Experiment. Finally, the stage results of each step can be used as a fixed parameter for other steps, and go back and optimize other steps.

3.2 Data collection, sorting and screening

The data of this research is stored in the company's internal database. According to the approval form process, three types of data sheets are designed to store. One is the approval form data, which records the contents filled in by each unit in the transmission and can be based on the site authority of the approval process Modification; Two is the historical version data that has been validated after the approval is completed, and it is provided for confirmation by the customer and the inspection unit, which is also the basis for the next revision; the third is the data actually used by the production line after it takes effect, which is extremely sensitive data. Once modified, it will be directly modified. Affect the production line. The data used in this study falls into the second category: historical version data that has been validated and entered into force. As the information in subscribe, there will be many blank fields that have not been filled in, and there may be errors that have not been finalized. On the other hand, the data actually used by the production line is only the latest version of the data, the data is not continuous, and there is no past data, resulting in too little data volume, it is difficult to find out the setting rules; and historical version data is only recorded when the production line takes effect, not only can ensure the accuracy of the data, but also can be traced back to the past. Using this data to train an artificial intelligence model, it is expected that the model can learn the rules behind each customized label over time and version evolution. As shown in Table 2, the data of this study are stored independently in five factories under the same structure. Different factories may face different customers, fill in forms and set up personnel, and the proportion of data varies greatly. Compared with the scale of data It is 3~26 times, so we finally decided to use only the largest plant area data for research.

Table 2 Data quantity table of each factory

Factory	Quantity of data
A	24492
B	3113
C	3049
D	7984
E	941
Total	39579

As shown in Table 3, the data of this research is mainly based on Hsinchu factory. The data comes from 4501 signed forms, but because only the second type of signed forms are used, the number of valid forms is actually only 3787, which is about the total number of forms. About 84% of the total. The source of these forms comes from 2877 different material numbers, but only 2588 material numbers have gone through the sign-off process. There are 1199 application forms for revision of part numbers, and some part numbers have been revised several times.

Table 3 Quantity of form data

Number of items	Quantity
The amount of form data applied for	4501
The amount of form data after signing off	3787
The quantity of applied form material number	2877
The quantity of the material number of the signed form	2588

As shown in Table 4, each form contains one or more different label types, and all the used label types are 24 types, only 23 types are actually applied to the production line. In practice, there are 6 types that are more commonly used: 1. CB_SN, semi-finished product customer label, mostly a two-dimensional code, used to record the serial number of the semi-finished product stage; 2. SN, production line Input the serial number label, mostly one-dimensional code, to record the production serial number in the factory; 3. FCC, the product label on the host, mostly one-dimensional code, record the product serial number; 4. BOX, the label on the color box, mostly one-dimensional code, record the serial number of the color box; 5. CARTON, the label on the outer box, mostly composite bar code, including the outer box serial number, product specifications, quantity, content serial number List; 6. PALLET, pallet label, mostly composite bar code, including pallet serial number, product specification, quantity, content serial number list. The other types are usually an extra label at the same position to display additional content or special specifications. Because the amount of data is small or too special, this research will skip these data and not adopt it, and use six types is the main research data.

Table 4 Number of label types

Number of items	Quantity
The amount of label type data applied for	24
Amount of signed label type data	23

As shown in Table 5, a label type contains one or more different variables. Among 3787 completed forms contains 41159 variable data. On average, there are about 10.86 variables in a form. However, the variable settings have been revised. The old variable names have been discarded and no longer used. There are only 24492 data on the new system variables. There are only 22,625 items left.

Table 5 Quantities of Variable Data

Number of items	Quantities
Data volume of historical variables	41159
Historical new system variable data volume	24492
Data volume of six new system variables	22625

As shown in Table 6, the field that may be incorrectly set by the setting personnel is the variable parameter field. This study will list as the final prediction target of the artificial intelligence model. Under all label types, there are 583 answers in the variable field. Focusing on six types of data, there are 465 answers after screening. Among all the types, the answers to some variables are the same. After taking the intersection, there are 279 types of answers, while the six types of intersections are left with 264 types of answers. The unpredictable unknown is regarded as one answer and added, namely the largest output layer dimension of the artificial intelligence model.

Table 6 Quantities of Variable Types

Number of items	Quantities
All types of new variables	583
Six types of new variables	465
The intersection of all types of new variables	279
The intersection of the six types of new variables	264

As shown in Table 7, among the 583 variable types, the number of variable types and the amount of data of each type are not even. Sort according to the number of variable types: FCC> CARTON> BOX> SN> PALLET> CB_SN; sort according to the amount of data: CARTON > BOX> FCC> PALLET> CB_SN> SN.

The more types of variables, the more complicated the rules behind, the higher the error rate may be; if the amount of data is less, the more difficult it is to accumulate experience and learn the rules, and the error rate will also increase. Looking at these two elements together, the ratio of the amount of available data divided by the number indicates how much data can be used for training for each answer. Sort according to this ratio: CB_SN> CARTON> BOX

> PALLET> the sum of the six categories> All> SN> FCC, it can be expected that CB_SN has the most learning materials and is the easiest to learn; FCC has the least learning materials and is the least easy to learn.

Table 7 Corresponding data table of variable types

Number of items	Quantities	Data volume	Data volume/quantity
BOX	99	5622	56.78
CARTON	108	8050	74.53
CB_SN	21	1753	83.47
FCC	139	3305	23.77
PALLET	40	2223	55.57
SN	58	1672	28.82
Total of the six categories	465	22625	48.65
Total	583	24492	42.01

As shown in Table 8, in all types of data, the utilization rate of variable types is extremely uneven. The usage rate of serial number, quantity, date, etc. is particularly high. Among the 583 variable types, the top two together account for 20%, the top four together account for 30%, and the top six together account for 40%. Ratio, the top eight together accounted for 50% of the ratio. For variables with a particularly high usage rate, direct blind guessing can get a good accuracy rate. If the user can clearly understand several types of rules and will not confuse this part of the answer, it is equivalent to mastering these types. For the overall accuracy rate, this study uses the sum of the proportions of several commonly used variables as one of the control groups, and then compares it with the trained artificial intelligence model. Refer to Table 24 to Table 29 in the attachment for the frequency of use of each type of variable.

Table 8 Variable Frequency Table

Variable	Quantity	Proportion (%)
[[PN]]	2527	11.17%
[[CARTONID]]	2038	9.01%
[[SSN]]	1317	5.82%
[[QTY]]	1102	4.87%
[[DATE_CODE]]	1080	4.77%
[[CARTON_QTY]]	1000	4.42%
[[BOX_QTY]]	983	4.34%
[[BOXID]]	955	4.22%
[[YYYYMMDD_DBA]]	767	3.39%

As shown in Table 9, explain the original data fields.

Table 9 Data field table

Column name	English abbreviation	Description
Part number	PN	unique value defined according to different products
Model	Model	Product type, different customers may have the same model.
Business group	BU	product usage classification.
Label style	Label Type	defined according to the position where the label is posted. There will be common forms and contents in different locations.
Variable Type	Var Type	Defines a fixed value or variable. Only variables need to set variable parameters
Serial number name	SN Name	The name of the label object, which can be an alias, it is required. The name used by the person filling in the form is not mandatory.
Serial number header	SN Title	The name to be displayed on the label object, it is not required.
Sample text	Example Desc	An example of the content to be displayed by the label object.
Display style	Display Mode	label object display style, Common ones are: text, one-dimensional bar code, two-dimensional bar code, etc.
Description	Label Var Desc	Description of the rules for using label objects.
Variable name	Lab Var	system sets the parameter name with number.
Variable default value	Lab Default	system configuration example
Action	Action	Add or advance version.
Updated Date	Updated Date	The last change date of the form, usually the effective date.
Version	Version	The number of changes of the same material number and the same label style.

3.3 Data pre-processing

In the process of informatization, it is inevitable that some data is incomplete. It may be that the data table structure design is not perfect at the beginning, resulting in some data not being collected at the initial stage; or the personnel are not rigorously operated, causing the data to be misplanted; or the data is improperly maintained and other modifications are made. Data is accidentally moved by mistake, causing data abnormalities; it may also cause abnormal system instability and data damage.

The situation and processing methods of incomplete data in this study are as follows:

When there are missing values or missing data, the common processing method is to discard or supplement the value. Because the amount of data in this study is not large, the available data should not be discarded as much as possible. The missing values of the data in this study are mainly derived from the data The framework does not require users to fill in, but users have a consensus. If the SN Title is not filled in, refer to the SN Name field. Therefore, in this study, check the serial number header ( Whether the SN Title field is empty, if it is empty, the value of the SN Name field will be entered. If the data contains extreme values, most of the methods are discarded. Most of the data fields in this study are categorical data, and there is no such type of data. The data may have duplicate problems, and the duplicated parts are usually discarded. The data table of the data source of this research is designed with a primary key (Primary Key). If the data is duplicated, it cannot be written. Therefore, this research does not have this type of data.

The data may contain obscure data and noise, which are usually discarded or repaired. The data in this research has been revised. Therefore, when screening the data, the old version of the data is directly filtered out, and only the new version is used for research; and this research There is no mandatory way to fill in the data in the serial number name (SN Name) and serial number header (SN Title) fields, as long as the users of each station can understand it, and there is no restriction on symbols and capitalization. It depends on the user's habits. There may be several ways to fill in the same object. In this study, this type of data was removed through a program, leaving only letters, numbers, and then the letters were converted to uppercase, so that the fuzzy approximate data can be converted Integrate into consistent data. Some fields need to be normalized. The value-related fields in this study are only the date. The date is converted into seconds through a program and then normalized.

In some models, the category field needs to be converted by One Hot Encoding before it can be used. Most of the data fields in this study are category data, and one category field needs to be converted into multiple values using one hot encoding. The fields of 0 or 1 are then passed as input to the model for learning. The text field is usually processed after cutting keywords or conversion vectors. However, the text field data in this study is for supplementary explanation, and the usage is not high and the amount of information is uneven, so it is not used in the study. Some field data may contain valuable hidden data. After combining professional knowledge and experience, the hidden information of the data can be integrated and extended. The data in this study implies some information that cannot be seen directly on the surface, item number. The first two codes of the data represent the production stage of the factory for the item number. Extracting into a new field may help the model.

3.4 Feature construction and selection

The original data contains multiple features, but not all features are necessary. Too many features may interfere with the learning of the model, and there may even be doubts about dimensional explosion. Reducing unnecessary features can reduce the calculation of the model. In practice, it is interpretable to convince the boss or customer. Many reasons show the necessity of analyzing the importance of features. After analyzing the importance of data features, follow-up studies can be used to verify the correctness of the model. If the prediction is inaccurate, this judgment basis can more quickly clarify the problem and improve the model. For important features, in addition to optimizing the existing model, it also When collecting follow-up data, it can enhance the user's proper use of important features. At this stage, this research tries to clarify the importance of each feature, find out the key features in different feature fields, and confirm whether each feature is as expected to help the model predict the correct answer. In addition to preliminary screening based on experience and professional knowledge, this study uses the random forest model to explore the importance of features. Random forest judges the importance of features by calculating the contribution of each feature on each tree, and then Take the average value and draw a graph to see the importance of each feature at a glance.

3.5 Common model modeling and comparison

The data type of this research is quite special. The data field is mainly based on categories. When you are not sure which model is more suitable for this data type, try and compare several common models that perform better in classification. The data that has been pre-processed before is modeled, and the effectiveness is evaluated. As shown in Table 10, the advantages and disadvantages of the artificial intelligence model are sorted out.

Table 10 Comparison of advantages and disadvantages of artificial intelligence models

Model	Advantages	Disadvantages
KNN	information can be added directly without retraining. It is fast and insensitive to outliers.	Each classification needs to be re-calculated. The memory demand is large. When the data distribution is unbalanced, the forecast is easy to be biased.
SVM	Strong generalization ability and is easy to understand. Part of the data can be used to make hyperplane decisions. It works well for processing high-dimensional data.	Poor performance and sensitive to missing data. There is no universal solution to nonlinear problems. The explanation of the kernel function is not high.
Decision tree	Easy to understand and implement, and has a high degree of interpretability. No need to do too much data pre-processing, can process data and category information at the same time	It is easy to overfit, the result of data changes is unstable, and the performance is poor when the data is unbalanced. Ignore the correlation between the attributes in the data.
Random forest	Not easy to overfit, training speed is fast, The importance of features can be assessed with high accuracy. Can deal with missing data, unbalanced data.	The degree of interpretation is poor, and when dealing with regression problems, it is impossible to make predictions beyond the scope of the data. If the data is noisy, it may still overfit.
GBDT	Can prevent overfitting and does not require complex features, Characteristic processing, non-linear transformation, can process linear or non-linear data.	The computational complexity is high. Failure to parallelize is time-consuming. Not suitable for sparse high-dimensional data.
XGBoost	Prevents over-fitting, nodes at the same level can be parallel processing, can handle sparse data.	When a node splits, it is necessary to traverse the data set. Use twice as much memory.
NN	Flexible, it has fault tolerance and self-adjustment ability.	It is not explanatory and computationally intensive. A lot of information is needed, and tuning requires a lot of trial and error.

3.6 Optimization model

Based on the results of the previous modeling, after finding a better model, try further optimization to obtain better results. This block is divided into three parts, feature processing, loop experiment, and parameter experiment.

3.6.1 Feature processing experiment

Through more complete data pre-processing, the model learning effect can be better, mainly for the processing of non-required fields and fuzzy data. Use all the models to try to learn the modeling for the data before and after processing, and then evaluate and compare benefit. Complementary value method: SN_NAME is required, but may be an alias; SN_TITLE is not required and may be empty. Fill in the empty SN_TITLE into the data in the SN_NAME field. Fuzzy processing method: There is no mandatory and standardized way to fill in these two fields, as long as users of each station can understand it, and there is no restriction on symbols and capitalization. Depending on the user's habits, there may be several types of filling for the same object. Therefore, after removing the symbols in the two fields, only letters, numbers, and Chinese are left, and then the letters are converted to uppercase, so that the fuzzy approximate data can be integrated into consistent data.

3.6.2 Cycle experiment

After testing the neural network, it was found that the data was too optimistic and inconsistent with the actual application. After analysis, it was found that the data has time characteristics. If the training set and the test set are randomly selected, there may be future data of the test data in the training set. Seeing the answer before verification may cause the model to get out of control, overfitting, and failing to learn correctly. Therefore, the experiment was designed to clarify the actual situation based on the test set segmentation method. During the analysis and evaluation, it was found that the material numbers in the data were unevenly distributed in the entry action. Due to the time characteristics, the test data could not be randomly selected. In this case, the data diversity in the test set was low. As shown in Fig. 3, in order to increase the diversity of the test set, partial data is used section by section through loops, so that more diverse data can have the opportunity to act as test data.

As shown in Fig. 4, in order to increase diversity, different methods of using partial data are designed so that more data can have the opportunity to act as test data. In order to avoid the use of partial data and too little data, the models are all memorizing answers, this experiment uses all types together.

Method one is that all data is used only once regardless of training or testing; method two is that all data will be put into the training set once; method three is that all data will be used once in the test set; method four is that all data will be used in the test set Once, use the used data as the next training set, and then take a certain percentage of the new data as the test set. Part of the information at the beginning and end may be skipped.

3.6.3 Parameter experiment

The model needs to adjust the parameters according to the data type to find a better parameter set to perform more ideally, so that the model has a better performance. Based on the results of modeling comparison, this part is for neural network-like.

First, for the generation parameters, observe the learning curve of the data to find out the generation parameters that are suitable for the data type; then adjust the test set proportions and observe the changes in accuracy to determine the better test set proportion parameters; then adjust the excitation function, except In addition to the accuracy rate, the time factor is also taken into consideration to determine the better excitation function parameters; in order to prevent over-fitting, observe the changes in the accuracy rate to find the better pruning ratio parameters; to ensure a similar neural network The model has enough neuron connections to learn the data rules completely, do experiments to confirm the best network depth; finally, integrate the parameters obtained at each stage to verify whether the neural network model progresses as expected.

After the process of research methods and steps in the previous chapter, the research results and experimental results are presented and analyzed in the following chapters. Although the results of some experiments are not significant, as long as each link can be optimized, it will eventually bring Great improvement.

4.1 Feature construction and selection

As shown in Fig. 5, using the random forest model to directly draw the top ten features, it is found that the features on the weight map do not match the expectations, not the original data field names, and the importance proportion is also very low. Most of the researched data are categorical fields, which have been converted into multiple fields after one hot encoding (One Hot Encoding). Observing in this state, it is impossible to see the importance of the original data fields.

As shown in Fig. 6, the importance of each original feature can be seen by adding up the importance value of each derived field based on the original field. It can be seen from the Fig. that SN_NAME has the highest importance, which is higher than SN_TITLE. In fact, SN_TITLE is the word displayed on the label. However, in the setting, SN_TITLE may be empty, which causes its importance to decrease, even after compensation , The effect is also limited; although SN_NAME may be an alias, the same object has a higher correlation with the target variable due to the user’s setting habits. The importance of ACTION_TYPE is the lowest. After analysis, it is found that the label content of the new part number may directly copy the label content of similar products and change it, resulting in the inability to clearly distinguish the new version and the entry situation, which is of little help to the prediction model. The importance of PN_RANK is not as expected. After analysis, it is found that the label style is mainly affected by the process stage, and there is no obvious rule restriction on the variable content.

4.2 Common model modeling and comparison

The control group in this experiment uses statistical data in practice. Each time the form is entered, the average change content is 20%. If the entire form is sent directly without modification, the accuracy rate is 80%, but in fact, there are new materials. The number needs to be considered, this value will only be lower.

Table 11 Experimental results of each artificial intelligence model

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
Blind guess	0.112	0.175	0.252	0.217	0.154	0.188	0.222	X
Base	0.800
KNN	0.857	0.912	0.850	0.925	0.762	0.844	0.722	0.868
SVM	0.269	0.383	0.404	0.831	0.282	0.584	0.280	0.435
Decision tree	0.199	0.297	0.390	0.743	0.252	0.568	0.447	0.473
Random forest	0.879	0.933	0.876	0.940	0.754	0.874	0.794	0.868
GBDT	0.764	0.865	0.803	0.920	0.730	0.853	0.686	0.882
XGBoost	0.839	0.917	0.845	0.957	0.786	0.851	0.787	0.847
NN	0.888	0.896	0.883	0.931	0.838	0.896	0.802	0.860

As shown in Table 11, the horizontal axis of this table is of various types: All means all types; Others is the remaining data after excluding the six types. The vertical axis is the control group and various models. Looking at the vertical axis, it can be seen that regardless of the model, the accuracy of CB_SN is very high. It can be found that the target field of CB_SN has the least type, but in addition to looking at the target field type, attention should also be paid to the amount of data in Table 6. Divide the number of target fields of each type by the ratio of the amount of data to sort to get: CB_SN(83.47) > CARTON(74.53)> BOX(56.78)> PALLET(55.57)> All (48.65)> SN(28.82)> FCC(23.77). Table

It can be found that the higher the data ratio, the more data available for training, the higher the accuracy rate, but because there is still a problem of sparseness between the amount of data and the target field that needs to be considered, it is not absolute. The CARTON data ratio is the 2nd, and the BOX data ratio is the 3rd. However, the performance of these two types in SVM and decision tree is worse than that of the 4th PALLET in the data ratio. Looking at the horizontal axis, SVM and decision trees are generally poor; GBDT's performance is not satisfactory; the accuracy of KNN, random forest, XGBoost, NN and other models exceeds the control group, reaching the reference standard, and the performance is good, each with its own advantages and disadvantages.

4.3 Evaluation of the effectiveness of data pre-processing

In the previous step to clarify the importance of features, you can know that SN_TITLE and SN_NAME are the more important fields, and these two fields have room for further optimization. After compensation and processing of fuzzy data, each artificial intelligence model is retrained and observed. Accuracy. Complementary value method: SN_NAME is required, but may be an alias; SN_TITLE is not required and may be empty. Fill in the empty SN_TITLE into the data in the SN_NAME field. Fuzzy processing method: There is no mandatory and standardized way to fill in these two fields, as long as users of each station can understand it, and there is no restriction on symbols and capitalization. Depending on the user's habits, there may be several types of filling for the same object. Therefore, after removing the symbols in the 2 fields, only letters, numbers, and Chinese are left, and then converted to uppercase, so that the fuzzy approximate data can be integrated into consistent data.

Table 12 Experimental results after data processing

	All	BOX	CARTON	CB_SN	FCC		PALLET		SN	Others
KNN	+0.013	-0.003	+0.004	+0.009	+0.012	+0.016		+0.027		+0.006
SVM	+0.218	+0.183	+0.143	+0.097	+0.143	+0.018		+0.093		+0.048
Decision tree	+0.090	+0.096	+0.064	+0.111	+0.180	-0.002		+0		+0.018
Random forest	+0.001	-0.026	-0.005	+0.003	+0.022	+0.020		+0.006		+0.025
GBDT	+0.003	+0.049	+0.023	+0.014	+0.020	+0.005		+0.057		-0.016
XGBoost	+0.035	+0.011	+0.014	-0.029	-0.03	-0.002		-0.014		-0.011
NN	+0.006	+0.010	-0.015	+0.014	+0.007	+0		+0.039		+0.011

As shown in Table 12, the horizontal axis of this table is various types, the vertical axis is the control group and various models, and the data column is the accuracy rate change value. It can be seen that SVM and decision tree have increased significantly, and the accuracy rate has increased by up to 21.8%; as for KNN, random forest, GBDT, XGBoost and NN, the impact is less significant. Some models may have already learned the upper limit on the existing data and features, so there is not much room for optimization, or some models such as random forest can deal with the problem of missing values or fuzzy data.

4.4 Evaluation of the effectiveness of circuit training

The experiment in this section only tries to optimize the neural network model such as the best overall performance, explains the purpose of the loop test, and analyzes and discusses the experimental results.

4.4.1 Comparison of effectiveness evaluation of test set segmentation methods

Table 13 Experimental results of test set segmentation method

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
random segmentation	0.959	0.956	0.960	0.980	0.940	0.957	0.910	0.922
time segmentation	0.894	0.906	0.868	0.945	0.845	0.896	0.841	0.871

As shown in Table 13, the horizontal axis of this table is the various types, and the vertical axis is the test set segmentation method. It can be clearly seen that the accuracy of all types in the random segmentation test set is better than the accuracy rate of the time segmentation test set. It is much higher. The accuracy of the random segmentation test set is all greater than 90%, the CB_SN even reaches 98% of the data, and the accuracy of the time segmentation test set falls between 84% and 95%.

4.4.2 Comparison of effectiveness evaluation of circuit training

80% of the entire data is used for training, and only 20% is used for verification. Because the data in this study is special, the field information is mostly categorical data, and the interval time of each material number is inconsistent, resulting in uneven data distribution, And due to time characteristics, the test data cannot be randomly selected. In this case, the data diversity in the test set is low. In order to solve this problem, this experiment was designed. In order to increase the diversity of the test set, partial data is used section by section through a loop, so that more data can have the opportunity to act as the test data.

Table 14 Results of the loop training experiment

	All	BOX	CARTON	CB_SN		FCC		PALLET	SN	Others
NN Base	0.894	0.906	0.868	0.945	0.845		0.896		0.841	0.871
Rolling 2	0.913	0.932	0.910	0.920	0.830		0.901		0.791	0.866
Rolling 3	0.920	0.952	0.936	0.948	0.823		0.946		0.785	0.847
Rolling 4	0.911	0.971	0.928	0.943	0.837		0.909		0.773	0.893
Rolling 5	0.909	0.973	0.937	0.942	0.804		0.887		0.716	0.879
Rolling 6	0.904	0.952	0.966	0.932	0.872		0.864		0.714	0.841
Rolling 7	0.897	0.944	0.943	0.920	0.894		0.843		0.833	0.851
Rolling 8	0.913	0.957	0.950	0.886	0.831		0.875		0.785	0.808
Rolling 9	0.928	0.935	0.916	0.846	0.918		0.800		0.783	0.785
Rolling 10	0.912	0.938	0.944	0.885	0.848		0.866		0.705	0.763

As shown in Table 14, the horizontal axis of this table is various types, and the vertical axis is the number of cycles. In this experiment, the total data volume of the test set is the same as that of the control group. It can be proved that only the test set is dispersed and increased diversity. It can indeed improve results. Only the results of the SN type are lower than the control group, which may be related to the fact that most of the data types are special cases. It can be seen from the table that the best result has no significant relationship with the number of cycles, and it is speculated that it should be more related to the diversity of the test set distribution.

4.4.3 Comparison of effectiveness evaluation of circuit training

In order to increase diversity, design different methods of using partial data, so that more data can have the opportunity to act as test data. In order to avoid the use of partial data, the amount of data is too small, resulting in the model is memorizing answers, this experiment only uses all types to carry out. The control group in this experiment is the basic neural network model and method one of the previous experiment. Method one is that all data is used only once regardless of training or testing; method two is that all data will be put into the training set once; method three is that all data will be used once in the test set; method four is that all data will be used in the test set Once, the data used each time is used as the next training set, and a certain percentage of the new data is taken as the test set. Part of the information at the beginning and end may be skipped.

Table 15 Advanced experimental results of circuit training

Data volume	Training set	Test set	NN Base	Method 1 Base	Method 2	Method3	Method 4
1/2	9796	2450	0.894	0.913	0.917	0.922	0.899
1/3	6531	1633		0.920	0.931	0.923	0.841
1/4	4898	1225		0.911	0.945	0.942	0.864
1/5	3918	980		0.909	0.913	0.923	0.862
1/6	3265	817		0.904	0.958	0.951	0.893
1/7	2798	700		0.897	0.899	0.962	0.767
1/8	2448	613		0.913	0.941	0.928	0.795
1/9	2176	545		0.928	0.917	0.919	0.791
1/10	1959	490		0.912	0.918	0.926	0.796

As shown in Table 15, the horizontal axis of this table is the amount of data for each training of methods one to three, the control group and various methods, and the vertical axis is the proportion of the amount of data used for each cycle training. From this table, it can be observed that the results of method two and method three are better than those of the control group, but only looking at the data, there is no obvious advantage or disadvantage between method two and method three. Because the research data of this experiment is not enough, no further verification is possible. The result is generally lower than that of the control group. After analysis, it is speculated that the model will have problems because each training will be trained to the first data. Observing the learning curve, it is found that the learning curve of method 2 is relatively normal, and the learning curve of method 3 is suspected of overfitting. Therefore, method 2 is selected, and the amount of data is 1/6 which is the better parameter for the experiment.

4.5 Evaluation of the effectiveness of model optimization

The experiments in this chapter are aimed at optimizing neural network models such as the best overall performance, and discuss and compare the results after sorting out the experimental results.

4.5.1 Comparison of effectiveness evaluation of adjustment generations

This experiment focuses on the generation parameters and observes the changes in the learning curve.

As shown in Fig. 7, Fig. 7a is the learning curve loss graph, and Fig. 7b is the learning curve accuracy graph. From Fig. 7a, it can be observed that the learning curve slowly rises all the way after the reversal, and cannot fall until after 60 It can be seen from Fig. 7b that the learning curve is constantly fluctuating, but there is no obvious trend overall. It is the highest when the generation is 35, and it becomes lower after 60. It can be seen from the Fig. 7 that there is a gap between the training learning curve and the test learning curve, and convergence can be achieved quickly. If there are too many generations, it may cause the neural network model to overfit the answer.

4.5.2 Comparison of effectiveness evaluation of adjusting the test set proportion

This experiment adjusts the test set ratio and observes the changes in accuracy.

Table 16 Test result table of test ratio

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
0.1	0.917	0.934	0.912	0.914	0.831	0.878	0.833	0.855
0.15	0.893	0.888	0.929	0.954	0.843	0.880	0.788	0.871
0.2	0.894	0.906	0.868	0.945	0.845	0.896	0.841	0.871
0.25	0.806	0.892	0.874	0.947	0.803	0.879	0.803	0.858
0.3	0.813	0.822	0.881	0.939	0.748	0.875	0.792	0.884

As shown in Table 16, the horizontal axis of this table is each type, and the vertical axis is the test ratio. Under the All type, 0.1> 0.2> 0.15>0.3> 0.25; for BOX type, 0.1> 0.2> 0.25> 0.15> 0.3; for CARTON type, 0.15> 0.1> 0.3>0.25> 0.2; for CB_SN type, 0.15> 0.25> 0.2> 0.3> 0.1; for FCC type, 0.2> 0.15> 0.1> 0.25> 0.3; for PALLET type, 0.2> 0.15> 0.25> 0.1> 0.3; for SN type, 0.2> 0.1> 0.25> 0.3> 0.15; Under Others type, 0.3> 0.2 = 0.15> 0.25> 0.1.

At first glance, the smaller the test ratio, the higher the accuracy rate. However, after analysis, it is obvious that the high accuracy rate is only an illusion when the test ratio is small. As the test ratio is less, the diversity of test data is lower. The more difficult it is to verify the correctness of the data; if the test ratio is too high, it will in turn lead to too low diversity of training data, resulting in a sharp drop in accuracy, so 0.2 is the better test ratio in the end.

Table 17 Test proportions in the loop training experiment result table

	NN Base	Rolling(1,9)	Rolling(2,6)	Rolling(3,7)
0.1	0.917	0.963	0.970	0.834
0.15	0.893	0.929	0.936	0.923
0.2	0.894	0.928	0.958	0.962
0.25	0.806	0.892	0.932	0.940
0.3	0.813	0.874	0.933	0.923

As shown in Table 17, due to the numerous combinations of loop training methods and data volume, this experiment only takes the methods and data volume parameter sets that perform well in the previous loop training experiments for further attempts. The horizontal axis of this table is the control group and cyclic training parameters, the first parameter is the method, and the second parameter is the data volume denominator; the vertical axis is the test set ratio, under NN Base, 0.1> 0.2> 0.15> 0.3> 0.25; Under Rolling(1,9), 0.1> 0.15> 0.2> 0.25> 0.3; under Rolling(2,6), 0.1> 0.2> 0.15> 0.3> 0.25; under Rolling(3,7), 0.2> 0.25> 0.15 = 0.3> 0.1. The overall effect is better with a result of 0.2. At first glance, the smaller the test ratio, the higher the accuracy rate. However, after analysis, it is obvious that the high accuracy rate is only an illusion when the test ratio is small. As the test ratio is less, the diversity of test data is lower. The more difficult it is to verify the correctness of the data; if the test ratio is too high, it will in turn lead to too low diversity of training data, resulting in a sharp drop in accuracy, so 0.2 is the better test ratio in the end.

4.5.3 Comparison of effectiveness evaluation of adjusting trigger function

This experiment adjusts the excitation function and observes the changes in accuracy.

Table 18 Excitation function experiment results table

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
softmax	0.129	0.124	0.233	0.244	0.128	0.170	0.185	0.141
sigmoid	0.882	0.880	0.874	0.926	0.794	0.876	0.829	0.831
elu	0.885	0.896	0.877	0.946	0.835	0.894	0.838	0.852
relu	0.894	0.906	0.868	0.945	0.845	0.896	0.841	0.871
selu	0.891	0.919	0.887	0.951	0.855	0.894	0.826	0.874

As shown in Table 18, the horizontal axis of this table is each type, and the vertical axis is the excitation function parameter. Under the All type, relu> selu> elu> sigmoid> softmax; under the BOX type, selu> relu> elu> sigmoid> softmax; CARTON Under the type, selu> elu> sigmoid> relu> softmax; under the CB_SN type, selu> elu> relu> sigmoid> softmax; under the FCC type, selu> relu> elu> sigmoid> softmax; under the PALLET type, relu> selu = elu > sigmoid> softmax; for SN type, relu> elu> sigmoid> selu> softmax; for Others type, selu> relu> elu> sigmoid> softmax. On the whole, relu ≒ selu> elu> sigmoid> softmax, but the difference between relu, selu, elu, and sigmoid is not big, the difference is less than 1% for the All type; the difference is about 4% for the BOX type; the difference is about 4% for the CARTON type The difference is about 2%; the difference is about 2.5% for the CB_SN type; the difference is about 6% for the FCC type; the difference is about 2% for the PALLET type; the difference is about 1.5% for the SN type; the difference is about 4% for the Others type. Since the execution time of relu is faster, the time difference is about 20%, so relu is the better setting.

Table 19 Results of the excitation function in the loop training experiment

	NN Base	Rolling(1,9)	Rolling(2,6)	Rolling(3,7)
softmax	0.129	0.278	0.365	0.763
sigmoid	0.882	0.897	0.946	0.957
elu	0.885	0.908	0.944	0.951
relu	0.894	0.928	0.958	0.962
selu	0.891	0.910	0.949	0.875

As shown in Table 19, due to the numerous combinations of loop training methods and data volume, this experiment only takes the methods and data volume parameter sets that perform well in the previous loop training experiments for further attempts.

The horizontal axis of this table is the control group and cyclic training parameters. The first parameter is the method, and the second parameter is the data volume denominator; the vertical axis is the excitation function, under NN Base, relu> selu> elu> sigmoid> softmax; Rolling Under (1,9), relu> selu> elu> sigmoid> softmax; under Rolling(2,6), relu> selu> sigmoid> elu> softmax; under Rolling(3,7), relu> sigmoid> elu> selu> softmax; the overall effect is the best result of relu. The difference between relu, selu, elu, sigmoid is not big, the difference is about 3% under Rolling(1,9); Rolling(2,6). The difference below is about 1.5%; the difference below Rolling(3,7) is about 9%. Since relu has better effect and faster execution time, relu is the better setting.

4.5.4 Comparison of effectiveness evaluation of adjusting the pruning ratio

This experiment adjusts the proportion of pruning and observes the changes in accuracy.

Table 20 Experimental results of pruning ratio

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
0.1	0.893	0.899	0.887	0.943	0.842	0.892	0.835	0.855
0.15	0.885	0.903	0.875	0.948	0.837	0.894	0.829	0.858
0.2	0.887	0.893	0.881	0.951	0.837	0.894	0.832	0.855
0.25	0.894	0.906	0.868	0.945	0.845	0.896	0.841	0.871
0.3	0.889	0.905	0.875	0.946	0.837	0.892	0.835	0.858

As shown in Table 20, the horizontal axis of this table is each type, and the vertical axis is the proportion of pruning. Under the All type, 0.25> 0.1> 0.3>0.2> 0.15; for BOX type, 0.25> 0.3> 0.15> 0.1> 0.2; for CARTON type, 0.1> 0.2> 0.15 =0.3> 0.25; for CB_SN type, 0.2> 0.15> 0.3> 0.25> 0.1; for FCC type, 0.25> 0.1 = 0.15 = 0.2= 0.3; in the PALLET type, 0.25> 0.15 = 0.2> 0.1 = 0.3; in the SN type, 0.25> 0.1 = 0.3> 0.2> 0.15; Under Others type, 0.25> 0.15 = 0.3> 0.1 = 0.2. On the whole, the results of the different pruning ratios are not very different, the difference is less than 1% in the All type; the difference is about 1% in the BOX type; the difference is about 2% in the CARTON type; the difference is about 1% in the CB_SN type; The difference is about 1% under the FCC type; the difference is about 0.5% under the PALLET type; the difference is approximately 1% under the SN type; the difference is approximately 1.5% under the Others type. In order to avoid over-fitting caused by too many neurons in the neural network, pruning is needed to prevent the neural network model from memorizing the answer. In different types, the difference between the best and the worst is only about 1%, and there are no obvious advantages and disadvantages. The neural network model of this study is not over-fitting, so it is not possible to optimize too much through this parameter. The overall effect is better to take 0.25.

Table 21 The proportion of pruning in the loop training experiment results table

	NN Base	Rolling(1,9)	Rolling(2,6)	Rolling(3,7)
0.1	0.893	0.915	0.957	0.957
0.15	0.885	0.928	0.954	0.955
0.2	0.887	0.926	0.958	0.958
0.25	0.894	0.915	0.959	0.961
0.3	0.889	0.921	0.955	0.960

As shown in Table 21, due to the numerous combinations of loop training methods and data volume, this experiment only takes the methods and data volume parameter sets that perform well in the previous loop training experiments for further attempts. The horizontal axis of this table is the control group and the cycling training parameters. The first parameter is the method, and the second parameter is the data volume denominator; the vertical axis is the pruning ratio, under NN Base, 0.25> 0.1> 0.3> 0.2> 0.15; Under Rolling (1, 9), 0.15> 0.2

> 0.3> 0.1 = 0.25; under Rolling (2, 6), 0.25> 0.2> 0.1> 0.3> 0.15; under Rolling (3,7), 0.25

> 0.3> 0.2> 0.1> 0.15. The overall effect is the best with a result of 0.25. The results of the different pruning ratios are not very different, the difference is about 1% under Rolling(1,9); the difference is about 0.5% under Rolling(2,6); the difference is about 0.5 under Rolling(3,7) %. In order to avoid over-fitting caused by too many neurons in the neural network, pruning is needed to prevent the neural network model from memorizing the answer. In different types, the difference between the best and the worst is only about 1%, and there are no obvious advantages and disadvantages. The neural network model of this study is not over-fitting even after cyclic training. Therefore, it is not possible to optimize too much through this parameter. The overall effect is better to take 0.25.

4.5.5 Adjusting the depth of the network effectiveness evaluation comparison

The data in this study has been transformed or the input layer has a dimension of about 1,000, and the output layer has a dimension of about 256. This experiment adjusts the number of hidden layers to observe the changes in accuracy. Due to the limited amount of data in this study, if the depth of the network is too deep, the parameters will be too large and over-fitting. Therefore, this experiment only tested three hidden layers at most.

Table 22 Network depth experiment results table

	All	BOX	CARTON	CB_SN	FCC	PALLET	SN	Others
1	0.889	0.889	0.887	0.926	0.790	0.890	0.838	0.844
2	0.894	0.906	0.868	0.945	0.845	0.896	0.841	0.871
3	0.892	0.908	0.883	0.960	0.833	0.899	0.823	0.852

As shown in Table 22, the horizontal axis of this table is each type, and the vertical axis is the number of hidden layers. Under the All type, 2> 3> 1; under the BOX type, 3> 2> 1; under the CARTON type, 1> 3> 2 ; Under CB_SN type, 3> 2> 1; Under FCC type, 2> 3> 1; Under PALLET type, 3> 2> 1; Under SN type, 2> 1> 3; Under Others type, 2> 3> 1 . On the whole, the difference between the results of each network depth is not big, the difference is less than 1% under the All type; the difference is about 2% under the BOX type; the difference is approximately 2% under the CARTON type; the difference is approximately 3.5% under the CB_SN type; The difference is about 5.5% for FCC type; about 1% for PALLET type; about 2% for SN type; and about 3% for Others type. In order to solve the problem of over complexity, the deeper the neural network is, the better, but if it is too deep, it may cause overfitting. It can be seen from the table that under different types, the best and the worst are not much different. There are no obvious advantages and disadvantages. The problem behind the problem to be solved in this research is not particularly complicated. The neural network model can learn well under the basic structure, so it is impossible to optimize too much through this parameter. The overall effect is the result of 2 hidden layers. Better.

Table 23 Network depth in the loop training experiment results table

	NN Base	Rolling(1,9)	Rolling(2,6)	Rolling(3,7)
1	0.889	0.913	0.958	0.964
2	0.894	0.915	0.959	0.961
3	0.892	0.919	0.954	0.960

Due to the numerous combinations of loop training methods and data volume, this experiment only takes the methods and data volume parameter sets that perform well in the previous loop training experiments for further attempts. As shown in Table 23, the horizontal axis of this table is the control group and cyclic training parameters. The first parameter is the method and the second parameter is the denominator of the data volume; the vertical axis is the number of hidden layers, under NN Base, 2> 3> 1; under Rolling (1, 9), 3> 2> 1; under Rolling (2, 6), 2> 1> 3; under Rolling (3, 7), 1> 2> 3; the overall effect is better with two hidden layers, but the difference is very small. The difference between the results of each network depth is not big, the difference is about 0.5% under Rolling (1, 9); the difference is about 0.5% under Rolling (2, 6); the difference is about 0.5 under Rolling (3, 7) %.

In order to solve overly complex problems, the deeper the neural network-like depth is, the better, but too deep may cause over-fitting. As can be seen from the table, under different types, the difference between the best and the worst is only 0.5% are no obvious advantages and disadvantages. The problem to be solved in this study is not particularly complicated. The neural network model can learn well under the infrastructure even after it is trained in a loop. Therefore, it is impossible to optimize too much through this parameter, and the overall effect is better. The results of 2 hidden layers are better.

In recent years, there has been a lot of research on artificial intelligence in the manufacturing industry, but most of them focus on timely monitoring of production lines, and rarely on production parameters. This research collects the fill-in content of the approval form of customized labels in the manufacturing industry to predict production line labels. In the study, such as random forests and neural networks were used for training and prediction. Practical statistical data was used as a control group to evaluate the effectiveness of artificial intelligence models. Finally, production line labels decision-making was established, enhance the operating experience and improve work efficiency, reduce error rate, and maintain factory productivity.

In the past, setting personnel could only base their settings on their own experience and knowledge. However, due to factors such as insufficient experience and knowledge, incorrect cognition, insufficient caution, poor communication, etc., setting errors may result, causing abnormalities during production and reducing factory productivity. Bring losses to the enterprise. Existing corporate conventional practices have limited results. In order to solve this problem, this research use artificial intelligence models to break through the artificial intelligence model to learn the relationship between the content of the sign-off form of the customized label and the variable parameter setting. The experiments results of this research have confirmed that most artificial intelligence models are better than the control group. Among them, the random forest and the neural network method is better performance in AI method. The auxiliary decision-making system established by the better trained model can reduce the error rate in practice as expected and maintain the productivity of the factory. Through the results of this research, it can be seen that among the data features, the label alias is the most important and has the greatest impact, while features such as form action and item number production stage are not very helpful. To avoid problem of dimensional explosion, so you can consider removing features of lower importance to avoid unnecessary features that may cause the model to waste computing resources.

From the experiments results of this research: we can know that take the loop training parameter as method 2, and each time 1/6 the amount of data; take the generation parameter as 30; take the test ratio parameter as 0.2; take the excitation function as relu; take the pruning ratio parameter as 0.25; take The hidden layer parameter is 2; it is a better parameter combination. The contributions of this paper, before tuning, the accuracy is about 85%. After tuning, it rises to 89.4%. If combined with the cyclic AI test, it can be increased to 95%. According to statistics, the accuracy of new recruits is only about 80%, and use the best of artificial intelligence models can be as high as 95%. The accuracy rate of the line is reduced from 4 times per month to 1 time per month. In the case of full capacity, this assists decision-making the system can reduce cost. This artificial intelligence module is based on the factory approval form and parameter settings. There are many systems in the company that involve verification and parameter setting. If this module can be extended to other application scenarios in the factory, it will be optimize and overall improvement.

Future works: base n the research process and experimental results of this research, the following suggestions are put forward for future research reference.

Data volume integration: The amount of data in this study is limited. If data from various factories can be integrated in future research, the amount of data will be more and more abundant, and the artificial intelligence model will be less prone to overfitting.
Hidden data fields: When collecting data in this research, the description field has not been properly used, so this field has not been used. If the filling standard can be promoted, additional valuable implicit information can be obtained from this field when conducting research in the future. Assistance artificial intelligence model training and learning.

Ethical approval

No need ethical approval.

Funding details

No Funding

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Informed Consent

No need informed consent.

Authorship contributions

Kai Zhang: Conceptualization, Methodology, Validation, Investigation, Writing.

Chongjie Dong: Formal analysis, Visualization, Funding acquisition, Validation, Writing-Review, Editing, Supervision.

Zhou Y, Liu X, Li M (2011) Discussion about Problems and Corresponding Countermeasures in Enterprise Informatization Construction," 2011 International Conference on Business Computing and Global Informatization, Shanghai, 2011, pp. 243-246, doi: 10.1109/BCGIn.2011.69.
Rojas A, Barbieri G (2019) A Low-Cost and Scaled Automation System for Education in Industrial Automation," 2019 24th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA), Zaragoza, Spain, 2019, pp. 439-444, doi: 10.1109/ETFA.2019.8869535.
Ozdemir R, Koc M (2019) A Quality Control Application on a Smart Factory Prototype Using Deep Learning Methods," 2019 IEEE 14th International Conference on Computer Sciences and Information Technologies (CSIT), Lviv, Ukraine, 2019, pp. 46-49, doi: 10.1109/STC-CSIT.2019.8929734.
Ostrowski D (2018) Artificial Intelligence with Big Data," 2018 First International Conference on Artificial Intelligence for Industries (AI4I), Laguna Hills, CA, USA, 2018, pp.125-126, doi: 10.1109/AI4I.2018.8665678.
Ding Z, Peng W, Yan Q, Lin F (2019) Research on Intelligent Manufacturing System of Sustainable Development," 2019 2nd World Conference on Mechanical Engineering and Intelligent Manufacturing (WCMEIM), Shanghai, China, 2019, pp. 657-660, doi: 10.1109/WCMEIM48965.2019.00139.
Liu Y, Zhao Y, Tao L, Zhao K, Li K (2018) The Application of Digital Flexible Intelligent Manufacturing System in Machine Manufacturing Industry," 2018 IEEE 8th Annual International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), Tianjin, China, 2018, pp. 664-668, doi: 10.1109/CYBER.2018.8688303.
Muhammad Y, Cong P, Lu H, Fan Y (2010) MES development and significant applications in manufacturing -A review," 2010 2nd International Conference on Education Technology and Computer, Shanghai, 2010, pp.97-101, doi: 10.1109/ICETC.2010.5530040.
Zhou L, Wang L, Ge X, Shi Q (2010) A clustering-Based KNN improved algorithm CLKNN for text classification," 2010 2nd International Asia Conference on Informatics in Control, Automation and Robotics (CAR 2010), Wuhan, 2010, pp. 212-215, doi: 10.1109/CAR.2010.5456668.
Chen, C. M, Chen L, Gan W, Qiu L, Ding W (2021) Discovering High Utility-occupancy Patterns from Uncertain Data.” Information Sciences 546: 1208–1229. doi:10.1016/j.ins.2020.10.001.
Chen, C. M, Huang Y, Wang K, Kumari S, Wu M (2020) A Secure Authenticated and Key Exchange Scheme for Fog Computing.” Enterprise Information Systems 14: 1233–1250
Hai Y, D. Bin D, Zheng S (2008) Workflow Exception Forecasting Method Based on SVM Theory," 2008 International Symposium on Computational Intelligence and Design, Wuhan, 2008, pp. 81-86, doi: 10.1109/ISCID.2008.66.
Elaidi H, Elhaddar Y, Benabbou Z, Abbar H (2018) An idea of a clustering algorithm using support vector machines based on binary decision tree," 2018 International Conference on Intelligent Systems and Computer Vision (ISCV), Fez, 2018, pp. 1-5, doi: 10.1109/ISACV.2018.8354024.
Liu Y, Liu L, Gao Y, Yang L (2019) An Improved Random Forest Algorithm Based on Attribute Compatibility," 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chengdu, China, 2019, pp. 2558-2561, doi: 10.1109/ITNEC.2019.8729146.
Liu S, Cui Y, Ma Y, Liu P (2018) Short-term Load Forecasting Based on GBDT Combinatorial Optimization," 2018 2nd IEEE Conference on Energy Internet and Energy System Integration (EI2), Beijing, 2018, pp. 1-5, doi: 10.1109/EI2.2018.8582108.
Jidong L, Ran Z (2018) Dynamic Weighting Multi Factor Stock Selection Strategy Based on XGboost Machine Learning Algorithm," 2018 IEEE International Conference of Safety Produce Informatization (IICSPI), Chongqing, China, 2018, pp. 868-872, doi: 10.1109/IICSPI.2018.8690416.
Liu G, Chen Z, Zhuang Z, Guo W, Chen G (2020a) A Unified Algorithm Based on HTS and Self-adapting PSO for the Construction of Octagonal and Rectilinear SMT.” Soft Computing 24 (6): 3943–3961. doi:10.1007/s00500-019-04165-2.
Liu G, Zhu W, Xu S, Zhuang Z, Chen Y. C., Chen G (2020b) Efficient VLSI Routing Algorithm Employing Novel Discrete PSO and Multi-stage Transformation.” Journal of Ambient Intelligence and Humanized Computing. doi:10.1007/s12652-020-02659-8.
Bai Y, Li C, Sun Z, Chen H, (2017) Deep neural network for manufacturing quality prediction," 2017 Prognostics and System Health Management Conference. Harbin, 2017, pp. 1-5, doi: 10.1109/PHM.2017.8079165.
Thielen N, Werner D, Schmidt K, Seidel R( 2020) Reinhardt and J. Franke, "A Machine Learning Based Approach to Detect False Calls in SMT Manufacturing," 2020 43rd International Spring Seminar on Electronics Technology (ISSE), Demanovska Valley, Slovakia, 2020, pp. 1-6, doi: 10.1109/ISSE49702.2020.9121044.

Download PDF

Editorial decision: Major Revision
23 Jun, 2021
Reviews received at journal
20 Apr, 2021
Editor assigned by journal
19 Apr, 2021
Reviewers invited by journal
19 Apr, 2021
First submitted to journal
17 Apr, 2021

You are reading this latest preprint version

Using AI technology to customize manufacture product label for decision making

Status:

Version 1

Abstract

Figures

1. Introduction

2. Literature Review

3. Research Methods

4. Results And Analysis

5. Conclusion

Declarations

References

Status:

Version 1