Data-driven Modelling of Engineering Systems with Small Data, a Comparative Study

doi:10.21203/rs.3.rs-1884366/v1

Download PDF

Research Article

Data-driven Modelling of Engineering Systems with Small Data, a Comparative Study

https://doi.org/10.21203/rs.3.rs-1884366/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

This paper equitably compares five different Artificial Intelligence (AI) models and a linear model to tackle two real-world engineering data-driven modelling problems with small number of experimental data samples, one with sparse and one with dense data. The models of both cases are shown to be highly nonlinear. In the case with available dense data, Multi-Layer Perceptron (MLP) evidently outperforms other AI models and challenges the claims in the literature about superiority of Fully Connected Cascade (FCC). However, the results of the problem with sparse data shows superiority of FCC, closely followed by MLP and neuro-fuzzy network.

Modelling

Artificial Intelligence

Small Data

Sparse Data

Dense Data

Piezoelectric Actuator

Electrical Submersible Pump.

Nowadays, engineering area witnesses two partly conflicting realities:

Model-based design/optimisation/control are on rise [1], and analytical and numerical models of many engineering systems cannot serve their purpose satisfactorily, e.g. as detailed in [2, 3]. This leads to an ascending demand for data-driven models, developed with experimental data.

Experiments take time and cost. Hence, experimental data sets often consist of limited number of samples or they are small.

That is, engineers are more likely to deal with small data rather than big data [4]. Thus, developing accurate models out of small data is a crucial task for engineers [5]. There are a lot of piecemeal research in the literature reporting development of data-driven models for engineering systems with small data, e.g. [6]. However, no comparative research was found on modelling of engineering systems with small data, though a few were found in other areas [7, 8], as the value of models developed with small data is not limited to engineering [9].

AI provides powerful tools to model intricate engineering systems with their input-output data [10, 11]. The research question is which AI method suits best to this purpose particularly with small data. In order to answer the aforementioned research question, several AI-based data-driven models were developed with small data to solve two real-world engineering problems: (i) head estimation of electrical submersible pumps (ESPs) lifting two-phase petroleum fluids, detailed in [12] and (ii) selection of the sensing resistor in a charge estimator of a piezoelectric actuator, detailed in [13]. Neuro-fuzzy and FCC networks, MLPs, and exact and efficient Radial Basis Function Network (RBFN) models as well as linear models have been developed to tackle problems (i) and (ii).

Section 2 of this paper explains the aforementioned problems, and section 3 explains the models used in this research. Section 4 provides the results of modelling and their analysis.

This section briefly explains aforementioned dual engineering problems, which were solved in this research using AI data-driven modelling.

2.1. Head Estimation of Two-Phase Petroleum Fluids Lifted by ESPs

A variety of empirical models are used to estimate head of two-phase petroleum fluids, H_m, lifted by ESPs. Most of them have three inputs. One input is either intake pressure, p_in, or density and the other two are among oil flow rate, gas flow rate, mixed fluid flow rate, q_m,, or gas void ratio, α (volumetric ratio of gas to mixed fluid). Temperature has been overlooked in data-driven modelling so far [12]. The effect of pump rotational speed is considered through the affinity law, which is separate from the empirical models that are developed for a single rotational speed [14]. Inspired by prevalent empirical models, following general model was employed in this work:

H _m=f_ESP(α, p_in, q_m,). (1)

In this research, the two-phase fluid is a mixture of carbon dioxide and diesel fuel pumped by eight stages of an I-42B radial ESP, as detailed in [15, 16]. In total, the results of 109 experiments are available. 74, 17 and 18 data samples were used as modelling, validation and test data sets to identify/approximate and cross-validate f_ESP. The exact use of these triple data sets will be detailed in section 3. Input space of these data, depicted in Fig. 1, is fairly sparse. For instance, few data samples are available from operating areas with high pressure and low flow rate and in operating areas with high gas void ratio. The units of p_in and q_m in this research are ksi and gpm (gallons per minutes) and α is unit-less.

2.2. Selection of Sensing Resistor in a Charge Estimator of a Piezoelectric Actuator

Figure 2 depicts a resistor-based, or digital [17, 18], charge estimator of a piezoelectric actuator. The excitation voltage, V_e, is applied on the actuator, leading to a voltage across the sensing resistor, V_S. Since the current passing analogue to digital convertor, A/D, is negligible, the current passing the actuator almost equals the current passing the resistor = V_S / R_S. Charge of the actuator is integral of its current. High pass filter removes drift phenomenon as detailed in [19].

According to [20] as a selection criterion, an apt R_S should lead to a V_S just within the smallest range of A/D input voltage. For instance, if A/D has input voltage ranges of [-1 1] V, [-5 5] V and [-10 10] V, R_S should be selected so that V_S takes the widest possible span within the range of [-1 1] V. However, experiments have shown that a fixed sensing resistor cannot meet the aforesaid criterion for all operating conditions [21]. In other words, the apt sensing resistor should be selected based on operating conditions e.g. waveform, amplitude and frequency of excitation voltage (V_e in Fig. 2). It has been also shown that analytical models are inaccurate in finding such an apt R_S [13]. Thus, the remaining alternative is to develop data-driven models to estimate apt R_S based on operating conditions. In this research, it is assumed that V_e is a sinusoidal function of time, with the range of υ (in V) and the frequency of f (in Hz). Hence, υ and f are only operating conditions. That is, (2) is the data-driven model to estimate apt R_S:

R _S = f_PIEZO (υ,f ). (2)

The data of 42 experimental tests on a 5⋅5⋅36 mm³ piezoelectric stack actuator are available to approximate and cross-validate f_PIEZO. υ has the values of 5,7.5,10,12.5,15 and 17.5 V, and the frequencies are 20,30,40,50,60,70 and 80 Hz. In each experiment, with a pair of υ and f, R_S was tuned so that eventually met the aforementioned selection criterion. Such an R_S is the output of (2). The input-output data of 30, 6 and 6 randomly selected experiments were used as modelling, validation and test data sets, respectively. Figure 3 shows that the available data of the second case study are quite dense. Having dense data does not contradict with small size of the data. In data-driven modelling, the inputs of a dense data set are distributed in the input space rather uniformly [22]. Such a data set is small, if its size (number of samples) is small compared to the number of parameters of an appropriate model for such a problem.

Development of a reliable data-driven model may include four tasks:

Mathematical Structure Definition
Parameter Identification
Overfitting Avoidance
Cross Validation

Up to three separate data sets, modelling, validation and test data, were used to perform the listed tasks for each problem defined in section 2. Generally speaking, the purpose of these quadruple steps is to minimise error, ‘the discrepancy between the real output, from an experiment, and the estimated output by the model’. If the error is calculated using modelling, validation or test data, it is called modelling, validation or test error, respectively. Eq. 3 mathematically defines the error in this research [23]:

$$E=\frac{{\sum\limits_{{\iota =1}}^{{{n_d}}} {{{\left( {{{\hat {y}}_\iota } - {y_\iota }} \right)}^2}} }}{{{n_d}}}.$$

where y is an output, n_d is the number of samples in the data set used to calculate the error and ^ refers to estimated values. Aforementioned quadruple tasks performed in data-driven modelling are briefly introduced in the following:

Mathematical Structure Definition

In some models, the mathematical structure is not certain from the beginning. For instance, in a neuro-fuzzy network (or in short, fuzzy model), the number of rules can be defined using the modelling data through subtractive clustering, or in exact RBFNs, the size of the model depends on the modelling data.

Parameter Identification: Parameters of a data-driven model, with a known mathematical structure, are identified using the modelling data. Methods of parameter identification, generally, minimise the modelling error and have two categories: single-step and iterative methods. Some models, e.g. linear and RBFN models, use single-step identification methods such as non-recursive least square of error (LSE) [24]. In iterative methods, e.g. the ones based on error propagation [25], the parameters are tuned step by step to minimise the modelling error (also known as the training error, as detailed in Appendix A of [2]).

Overfitting Avoidance

Overfitting refers to excessive focus on decrease of the modelling error, which diminishes the generality of data-driven models [26, 27]. In iterative parameter identification, e.g. for MLPs, FCCs and neuro-fuzzy networks, at each iteration, the error is both calculated for the modelling and the validation data sets; while, the latter is not used for parameter identification. A discrepancy in trend of these dual errors (normally, increase of the validation error and ongoing decrease of the modelling error) is considered as a sign of overfitting and triggers to stop parameter identification [2]. In models with single-step parameter identifications, e.g. RBFNs, some specific parameters are identified with the validation data rather than with the modelling data to avoid overfitting [4].

Cross Validation

Any data-driven model should fulfil the requirements of cross validation. In this paper, one round cross-validation or hold-out was employed, which requires that the estimation error of the model calculated with the test data (neither used in parameter identification nor in overfitting avoidance) is acceptable [28]. In short, the test error should be reasonably low to cross validate a model. It should be noted that the validation data were not used to perform cross validation.

Six types of data-driven models were developed in this research to tackle problems detailed in section 2. In following subsections, a brief explanation of each model is presented with a focus on four aforementioned tasks for data driven modelling and correct use of the modelling and the validation data. All the developed models have a single output of y and n inputs of u_i, i = 1,…,n.

3.1. Linear Models

In these models, the output is a linear combination of inputs

$$y=\sum\limits_{{j=1}}^{n} {{{\mathbf{A}}_i}{u_i}} +{{\mathbf{A}}_{i+1}}\,.$$

Nothing needs to be done to define the mathematical structure of this model (i.e. task 1 in the list of quadruple tasks at the beginning of section 3), as the mathematical structure is evident. Model parameters (elements of A) were identified with single-step method of LSE [24]. Overfitting was disregarded in development of (4) (i.e. task 3 was not performed); thus, both the modelling and the validation data were used for modelling.

3.2. Multi-layer Perceptrons (MLPs)

The employed MLPs have one hidden layer with m neurons and activation function of φ.

$$y=\sum\limits_{{j=1}}^{m} {{\mathbf{B}}_{j}^{{}}\phi \left( {\sum\limits_{{i=1}}^{n} {{\mathbf{C}}_{{ij}}^{{}}{u_i}+{\mathbf{D}}_{i}^{{}}} } \right)} +{\mathbf{D}}_{{i+1}}^{{}},$$

where

$$\phi (x)=\frac{2}{{1+\exp ( - 2x)}} - 1$$

MLPs, presented by (5) and (6), are universal approximators. That is, the model has a proven capability to model any system when sufficient data are available [29, 30] .

In this research, m = 2n + 1, (7), based on recommendation of [25]. Considering (7), the mathematical structure would be known. Nguyen-Widrow algorithm was used to suggest initial values for parameters [31]. Then, error back propagation with Levenberg-Marquardt algorithm [32] was utilised to minimise the modelling error iteratively and to identify MLP parameters. At each iteration, the validation error was calculated. Parameter identification stopped as the trend of the modelling and the validation errors became discrepant, i.e. overfitting happened. Even with use of parameter initialisation algorithms, some initial values of parameters may push the utilised parameter identification method to be trapped in local minima of the modelling error, leading to low accuracy of the model [33]. Consequently, parameter identification was repeated with different initial parameters. The model with the lowest validation error was chosen in the end.

3.3. Fully Connected Cascade (FCC) Networks

The employed FCC networks are very similar to the MLPs, with extra parameters (E elements) which connect the inputs directly to the output.

$$y=\sum\limits_{{j=1}}^{n} {{\mathbf{\bar {B}}}_{j}^{{}}\phi \left( {\sum\limits_{{i=1}}^{m} {{\mathbf{\bar {C}}}_{{ij}}^{{}}{u_i}+{\mathbf{\bar {D}}}_{i}^{{}}} } \right)} +\sum\limits_{{i=1}}^{n} {{\mathbf{{\rm E}}}_{i}^{{}}{u_i}+} {\mathbf{\bar {D}}}_{{i+1}}^{{}}.$$

FCC networks have shown their merit in solving some non-engineering benchmarks [34]. The number of hidden layer neurons, m, was considered same as the one of MLPs, as the same recommendation of (7) is valid for FCC networks [34]. Parameter identification, overfitting avoidance and evasion from local minima of the modelling error in FCC networks are similar to the ones of MLPs.

3.4. Neurofuzzy Networks

Linear Sugeno-type fuzzy models were used in this research which are convertible to neuro-fuzzy networks [35]. Such fuzzy models have k rules, each with n membership functions (one per input). For j^th rule and i^th input, the Gaussian membership function of (9) was employed to produce a membership grade,µ_ij, based on the input, u_i [36]:

$${\mu _{ij}}=\exp \left( { - \frac{{{{({u_i} - {{\mathbf{F}}_{ij}})}^2}}}{{2{{\mathbf{G}}_{ij}}^{2}}}} \right).$$

The product of membership grades of a rule was considered as the weight of the rule, a number between zero and one. The output of the whole model is the weighted sum of rule outputs [36]:

$y=\frac{{\sum\limits_{{j=1}}^{k} {\left( {\overbrace {{\left( {\sum\limits_{{i=1}}^{n} {{{\mathbf{H}}_{ij}}{u_i}} +{{\mathbf{I}}_j}} \right)}}^{{{j^t}^{{h\,}}\,rule\,\,output}}\prod\limits_{{i=1}}^{n} {{\mu _{ij}}} } \right)} }}{{\sum\limits_{{j=1}}^{k} {\underbrace {{\prod\limits_{{i=1}}^{n} {{\mu _{ij}}} }}_{{{j^t}^{{h\,}}\,rule\,\,weight}}} }}.$

(10)

Neuro-fuzzy models, presented by (9) and (10), are universal approximators [37]. The mathematical structure of the fuzzy model, e.g. the number of rules (k), was defined through subtractive clustering with use of the modelling data, the utilised subtractive clustering algorithm is similar to the one detailed subsection 2–3 of [38].

Parameters were identified using an iterative method. At each iteration, gradient descent error back propagation algorithm was used to adjust elements of F and G, and LSE was used to adjust elements of H and I [39, 40]. The validation error, calculated at every iteration, was used to stop parameter identification procedure and to avoid overfitting, in the same way as used for MLPs, detailed in subsection 3 − 2.

3.5. Radial Basis Function Netrworks

RBFNs, which are universal approximators too [41], are presented as a combination of (11) and (12). They receive an array of inputs rather than inputs of a single data sample; a data sample has n inputs. An RBFN can estimate the output of maximum w data samples, where w is the number of data samples used to develop the model. If the input of fewer number of data samples, i.e. z, are fed into the model, first z columns of O and L are used.

$${{\mathbf{{\rm O}}}_{ik}}=\exp \left( { - {{\left( {S\underbrace {{\sum\limits_{{j=1}}^{n} {{{\left( {{{\mathbf{J}}_{ij}} - {{\mathbf{U}}_{jk}}} \right)}^2}} }}_{\begin{subarray}{l} {\text{distance}}\,\,{\text{between}}\,\,{\text{input}}\, \\ \,\,\,\,{\text{and}}\,\,{\text{weight}}\,\,{\text{arrays}}\, \end{subarray} }} \right)}^2}} \right).$$

${\mathbf{\hat {Y}}}$ _1⋅w = K_1⋅w ⋅O_w⋅w +L_1⋅w. (12)

(12) indicates that greater elements of O are more influential on the network’s output. In addition, (11) shows that (i) the range of O elements is [0 1] and (ii) if the i^th row of J is identical to the k^th column of U, then O_ik will be maximum, 1.

In RBFN modelling, arrays of J_w⋅n, K and L and the scalar of S namely ‘spread’ should be identified. At model development stage, where modelling data were used, (13) was used instead of (12). ^ is unnecessary in (13), since no estimation happens during model development:

$${{\mathbf{Y}}_{1 \times w}}={\left[ {{\mathbf{K}}\,\,\,{\mathbf{L}}} \right]_{1 \times 2w}}{\left[ {\begin{array}{*{20}{c}} {\mathbf{O}} \\ {\mathbf{I}} \end{array}} \right]_{2w \times w}}.$$

In exact RBFNs, J = U_M^T (14); where U_M^T is the transpose of an array of all inputs of the modelling data. Hence, w equals the number of modelling data samples and the mathematical structure is known from the beggining. For instance, for the second problem of section 2, U_M^T has the size of 2⋅30. In order to maximise the effect of O elements on the output, all of them, calculated with (11) and inputs of the modelling data, were considered to be 1. Here is a pseudo-algorithm of exact RBFN modelling (to find J, K, L and S using the input and output arrays of the modelling data, U_M and Y_M)

Set J = U_M^T
Set O_w⋅w=ones(w⋅w)
Form and solve (13) with Y_M and O from step 2 to find K_1⋅w and B_1⋅w
Find S, with trial and error, so as to minimise the validation error of the developed RBFN (anti-overfitting step)

An alternative to exact RBFN modelling is efficient RBFN modelling, which may produce RBFNs with fewer parameters. In this research, despite exact RBFNs, that employ the transpose of inputs array of the modelling data as J, in efficient RBFN modelling, some columns of U_M were selected and transposed to form J [42]. Hence, the number of J rows, w, is smaller or equal to the number of the columns of U_M, named w_max in this paper.

Prior to select U_M columns to be used as J rows, S, and a target error, E_t should be defined. For each set of S and E_t, every single column of U_M was transposed and tried as a single-row J. Then, the corresponding RBFN was created using K and L calculated with (13). The column of U_M leading to the smallest modelling error was selected, transposed and used as the first row of J. Afterwards, the remaining columns of U were tested to find the one in which addition of its transpose to J led to the largest drop in the modelling error. Transposed of such a column was added to J. This continued till the modelling error reached E_t. Thus, the mathematical structure of efficient RBFNs is defined with use of the modelling data. In this research, the entire process of finding J was repeated for different pairs of S and E_t, and the validation error was calculated for each pair.

Here is a pseudo-algorithm of efficient RBFN modelling:

J = null, U_rem= U_M, U_opt=null, E = VEX = 1000 (a large number), ^TJ=null (temporary weight matrix)
Choose a large S and a target modelling error, E_t
Set w = 1
Set k = 1
Add transpose of k^th column of U_rem to J to form ^TJ
Calculate O from (11) with U_rem, ^TJ_w⋅n and S defined at steps 5 and 2.
Solve (13) to find K and L (Y_M and O are available from the modelling data and step 6)
Find the modelling error, ME. The model needs to be rum more than once as w < w_max.
If ME < E, then E = ME and U_opt=U_k
k = k + 1
If k ≤ (w_max –w + 1) then go to 5
Remove U_opt from U_rem and add it to J
w = w + 1
if E > E_t then go to 4
Find the validation error, VE
If VE < VEX then VEX = VE, SX = SX and E_tX=E_t
If VEX is unacceptable go to 2

Choice of S and E_t was performed using full space search with zooming (use of smaller step-size) at low error areas.

Line 4 of exact RBFN modelling pseudo-codes and lines 14–17 of efficient RBFN modelling pseudo-codes use the validation data to tackle overfitting. Use of the modelling data at these lines would result in no generalisation of the model, and use of the test data would violate the conditions of cross validation.

3.6. Section Summary

Table 1 summarises the tasks performed in development of each model and the data used for each task. MD and VD refer to the modelling and the validation data, respectively. Two last columns refer to avoidance of overfitting through different strategies: (1) stopping parameter identification in the case of discrepancy in trend of modelling and validation errors, used for MLP, FCC and neuro-fuzzy networks, and (2) identifying some parameters with the validation data to improve generality of the models, or dual identification, used for RBFNs.

Table 1. Development stages for different models and their associated data

Model	Structure Definition	Parameter Identification	Over-fitting Avoidance-Stop Process	Over-fitting Avoidance- Dual Identification
Linear		MD + VD
MLP		MD	VD
FCC		MD	VD
Fuzzy	MD	MD	VD
RBFN	MD	MD		VD

The models of section 3 were developed to approximate f_ESP and f_PIEZO introduced in section 2. For (neuro-) fuzzy modelling, subtractive clustering was performed with the influence range of 0.5 and squash factor of 1.25 for both problems. Also, accept ratios of 0.1 and 0.5 and reject ratios of 0.05 and 0.15 were used for subtractive clustering for the purpose of f_ESP and f_PIEZO approximation, respectively. The aforementioned factors have been explained in [38]. Exact RBFNs were developed with the spreads (S) of 76 and 41 for the first and the second problem, respectively. Efficient RBFNs were developed with S and E_t of 58 and 30 for the first problem (to approximate f_ESP) and 83.5 and 1.2 for the second problem. The results showed that targeting a too small modelling error (e.g. 0) increases the validation error or rises the chance of overfitting.

Tables 2 and 3 present different statistics for the developed models, all calculated with the test data. MAE and MSE stand for mean of absolute error and mean of squared error. The range of output for the problems associated with Tables 2 and 3 are [28,72] ft and [17.5,225] Ω, respectively. The results evidently show that both systems are highly nonlinear, as linear models practically fail to approximate f_ESP and f_PIEZO.

Table 2

**Different statistics of eestimation error for different models to approximate** f_ESP
	MLP	FCC	Fuzzy	RBFN Efficient	RBFN Exact	Linear
MAE	2.692	2.342	2.824	8.006	8.611	174.50
Bias	1.975	1.360	0.967	0.929	0.473	174.50
Variance	14.124	8.732	16.832	96.49	240.44	8484.3
MSE	18.025	10.582	17.767	97.35	240.66	38934
Number of Parameters	36	39	80	26	371	4

Table 3 .Different statistics of eestimation error for different models to approximate f_PIEZO

	MLP	FCC	Fuzzy	RBFN Efficient	RBFN Exact	Linear
MAE	0.623	1.386	2.652	1.537	2.000	54.916
Bias	0.127	1.265	0.370	1.070	-1.081	44.177
Variance	0.555	2.020	11.44	1.792	11.146	2682.8
MSE	0.571	3.620	11.58	2.937	12.315	4634.4
Number of Parameters	21	23	56	37	121	3

For the first case study (approximation of f_ESP, presented in Table 2), with fairly sparse data, FCC outperforms other models, with a sizably lower estimation error variance. This result is in agreement with the conclusions of [34]. The performance of MLP and Fuzzy models are fairly close to the FCC. MAE, a sensible criterion of accuracy, of the MLP is only around 15% larger than the one of FCC. Fuzzy and RBFN models which convert the parameter identification problem to a linear algebra problem, partly or in full, show small estimation biases of lower than 1 ft, almost 5 to 11 inches smaller than the bias of FCC head estimation.

MLP, however, shows evident superiority for the second case study (approximation of f_PIEZO, presented in Table 3) with dense data. MLP has the smallest estimation bias, estimation variance and number of parameters compared to all other nonlinear models. Alternatively, due to high density of the data, one can guess interpolation techniques may estimate the sensing resistance accurately in this case study. However, a similar research has shown that RBFN models outperform cubic interpolation and averaging methods [43]. As a result, superiority of MLP can be extended to these interpolation techniques too.

In summary, for development of models to approximate nonlinear systems/functions with small data and for engineering purposes, three following recommendations can be drawn from the results of this research:

FCC is recommmnaded to be employed in the case of sparsity of data; although, MLP and fuzzy models are also worth to be tried.
MLP is suggested to be employed with dense data.
RBFN models are not recommended, due to relatively high number of parameters and low accuracy. RBFNs were not the best models for any type of estimation purposes; the reason may be their inherited weakness against overfitting.

This article investigated the capability of a variety of common artificial intelligence techniques in data-driven modelling of engineering systems in the case of access to small data (small number of data samples). Five different AI models were even-handedly assessed in data-driven modelling of two case studies with sparse and dense data. Both systems (approximated functions) were shown to be highly nonlinear.

For modelling with sparse data, FCC outperformed other techniques, closely followed by MLP and fuzzy models. This outcome is consistent with the literature claiming that that FCCs are at an advantage over other AI modelling tools. However, for the problem with dense data, MLP showed an obvious superiority. RBFN models could not excel in any of the investigated data-driven modelling tasks; therefore, they are recommended to be disregarded in data-driven modelling of engineering systems with small data.

Disclosure of potential conflict of interest

The authors have no conflicts of interest, associated with this paper, to declare.

Fudnings

No funding has been used to complete research reported in this article. The authors wish to thank Birmingham City University for covering article processing charge for open access publishing.

Madni A. M. and Sievers M. (2018) Model-based systems engineering: motivation, current status, and needed advances, in Disciplinary Convergence in Systems Engineering Research, Springer.
Mohammadzaheri M., Tafreshi R., Khan Z., Ghodsi M., Franchek M., and Grigoriadis K. (2020) Modelling of petroleum multiphase flow in electrical submersible pumps with shallow artificial neural networks. Ships and Offshore Structures, 15, 2: 174–183.
Mohammadzaheri M., Grainger S., and Bazghaleh M. (2012) A comparative study on the use of black box modelling for piezoelectric actuators. The International Journal of Advanced Manufacturing Technology, 63, 9–12: 1247–1255.
Mohammadzaheri M., Ghodsi M., and AlQallaf A. (2018) Estimate of the head of the head produced by electrical submeersible pumps on gaseous petroleum fluids, a radial basis function network approach. International Journal of Artificial Intelligence and Applications, 9, 1: 53–62.
Chang C.-J., Li D.-C., Huang Y.-H., and Chen C.-C. (2015) A novel gray forecasting model based on the box plot for small manufacturing data sets. Applied mathematics and computation, 265, 400–408.
Taajobian M., Mohammadzaheri M., Doustmohammadi M., Amouzadeh A., and Emadi M. (2018) Fault diagnosis of an automobile cylinder head using low frequency vibrational data. Journal of Mechanical Science and Technology, 32, 7: 3037–3045.
Collins J., Brown J., Schammel C., Hutson K., Jeffery W., and Edenfield M. (2017) Meaningful Analysis of Small Data Sets: A Clinician’s Guide. Clinical and Translational Research 2, 1: 16–19.
Steyerberg E. W., Eijkemans M. J., Harrell Jr F. E., and Habbema J. D. F. (2000) Prognostic modelling with logistic regression analysis: a comparison of selection and estimation methods in small data sets. Statistics in medicine, 19, 8: 1059–1079.
Kitchin R. and Lauriault T. P. (2015) Small data in the era of big data. GeoJournal, 80, 4: 463–475.
Castro R. (2018) Data-driven PV modules modelling: Comparison between equivalent electric circuit and artificial intelligence based models. Sustainable Energy Technologies and Assessments, 30, 230–238.
Garg A. et al. (2015) Application of artificial intelligence technique for modelling elastic properties of 2D nanoscale material. Molecular Simulation, 41, 14: 1143–1152.
Mohammadzaheri M. and Ghodsi M. (2018) A Critical Review on Empirical Head-Predicting Models of Two-phase Petroleum Fluids in Electrical Submersible Pumps. Petroleum & Petrochemical Engineering Journal, 2, 4: 1–4.
Mohammadzaheri M. et al. (2019) A variable-resistance digital charge estimator for piezoelectric actuators: An alternative to maximise accuracy and curb voltage drop. Journal of Intelligent Material Systems and Structures, 30, 11: 1699–1705.
Mohammadzaheri M., Tafreshi R., Khan Z., Franchek M., and Grigoriadis K. (2016) An intelligent approach to optimize multiphase subsea oil fields lifted by electrical submersible pumps. Journal of Computational Science, 15, 50–59.
Lea J. F. and Bearden J. (1982) Effect of gaseous fluids on submersible pump performance. Journal of Petroleum Technology, 34, 12: 922–930.
Mohammadzaheri M., Tafreshi R., Khan Z., Franchek M., and Grigoriadis K. (2015) Modelling of Petroleum Multiphase Fluids in ESPs, an Intelliegnt Approach, Offshore Mediternean Conference, Ravenna, Italy.
Bazghaleh M., Grainger S., Mohammadzaheri M., Cazzolato B., and Lu T. (2013) A digital charge amplifier for hysteresis elimination in piezoelectric actuators. Smart Materials and Structures, 22, 7: 075016.
Mohammadzaheri M. et al. (2022) Adaptive Charge Estimation of Piezoelectric Actuators with a Variable Sensing Resistor, an Artificial Intelligence Approach. Engineering Letters, 30, 1: 193–200.
Bazghaleh M., Grainger S., and Mohammadzaheri M. (2018) A review of charge methods for driving piezoelectric actuators. Journal of Intelligent Material Systems and Structures, 29, 10: 2096–2104.
Mohammadzaheri M., Ziaiefar H., and Ghodsi M. (2022) Digital Charge Estimation for Piezoelectric Actuators: An Artificial Intelligence Approach, in Handbook of Research on New Investigations in Artificial Life, AI, and Machine Learning, IGI Global.
Mohammadzaheri M. et al. (2019) Adaptive Charge Estimation of Piezoelectric Actuators, a Radial Basis Function Approach, 20th International Conference on Research and Education in Mechatronics Wels, Austria.
Foster D., Gagne D. J., and Whitt D. B. (2021) Probabilistic Machine Learning Estimation of Ocean Mixed Layer Depth From Dense Satellite and Sparse In Situ Observations. Journal of Advances in Modeling Earth Systems, 13, 12: e2021MS002474.
Mohammadzaheri M., Akbarifar A., Ghodsi M., Bahadur I., AlJahwari F., and Al-Amri B. (2020) Health Monitoring of Welded Pipelines with Mechanical Waves and Fuzzy Inference Systems, International Gas Union Research Conference, Muscat,Oman,
Mohammadzaheri M., Chen L., Ghaffari A., and Willison J. (2009) A combination of linear and nonlinear activation functions in neural networks for modeling a de-superheater. Simulation Modelling Practice and Theory, 17, 2: 398–407. 10.1016/j.simpat.2008.09.015
Haykin S. (1999) Neural Networks A Comprehensive Introduction. Prentice Hall, New Jersey, New York, USA.
Mohammadzaheri M., Mirsepahi A., Asef-afshar O., and Koohi H. (2007) Neuro-fuzzy modeling of superheating system of a steam power plant. Applied Math. Sci, 1, 2091–2099.
Cawley G. C. and Talbot N. L. (2010) On over-fitting in model selection and subsequent selection bias in performance evaluation. Journal of Machine Learning Research, 11, Jul: 2079–2107.
Lendasse A., Wertz V., and Verleysen M. (2003) Model selection with cross-validations and bootstraps—application to time series prediction with RBFN models. Artificial Neural Networks and Neural Information Processing—ICANN/ICONIP 2003, 174–174.
Chen T. P. and Chen H. (1995) Approximation capability to functions of several variables, nonlinear functionals, and operators by radial basis function neural networks. IEEE Transactions on Neural Networks, 6, 4: 904–910.
Mohammadzaheri M., Chen L., and Grainger S. (2012) A critical review of the most popular types of neuro control. Asian Journal of Control, 16, 1: 1–11.
Nguyen D. and Widrow B. (1990) Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights, International Joint Conference on Neural Networks, San Diego, USA.
Mohammadzaheri M. and Chen L. (2010) Intelligent predictive control of a model helicopter's yaw angle. Asian Journal of Control, 12, 6: 667–679.
Mohammadzaheri M. et al. (2021) Fault diagnosis of an automobile cylinder block with neural process of modal information. International Journal of Mechanical and Mechatronics Engineering, 21, 2: 1–8.
Hunter D., Yu H., Pukish III M. S., Kolbusz J., and Wilamowski B. M. (2012) Selection of proper neural network sizes and architectures—A comparative study. IEEE Transactions on Industrial Informatics, 8, 2: 228–240.
Ahmadpour M., Yue W. L., and Mohammadzaheri M. (2009) Neuro-fuzzy Modelling of Workers Trip Production, 32nd Australasian Transport Research Forum, Auckland, New Zealand.
Mehrabi D., Mohammadzaheri M., Firoozfar A., and Emadi M. (2017) A fuzzy virtual temperature sensor for an irradiative enclosure. Journal of Mechanical Science and Technology, 31, 10: 4989–4994.
Ying H. (1998) General Takagi-Sugeno fuzzy systems are universal approximators.
Mohammadzaheri M., Grainger S., and Bazghaleh M. (2012) Fuzzy modeling of a piezoelectric actuator. International Journal of Precision Engineering and Manufacturing, 13, 5: 663–670.
Jang J. R., Sun C., and Mizutani E. (2006) Neuro-Fuzzy and Soft Computing. Prentice-Hall of India, New Delhi.
Mohammadzaheri M., AlQallaf A., Ghodsi M., and Ziaiefar H. (2018) Development of a Fuzzy Model to Estimate the Head of Gaseous Petroleum Fluids Driven by Electrical Submersible Pumps. Fuzzy Information and Engineering, 10, 1: 99–106.
Park J. and Sandberg I. W. (1993) Approximation and radial-basis-function networks. Neural computation, 5, 2: 305–316.
Mohammadzaheri M., Ghodsi M., and AlQallaf A. (2018) Estimate of the head produced by electrical submersible pumps on gaseous petroleum fluids, a raidal basis function netrwork approach. International Journal of Artificial Intelligence and Applications 9, 1: 53–62.
Mohammadzaheri M., Emadi M., Ghodsi M., Bahadur I. M., Zarog M., and Saleem A. (2020) Development of a Charge Estimator for Piezoelectric Actuators: A Radial Basis Function Approach. International Journal of Artificial Intelligence and Machine Learning (IJAIML), 10, 1: 31–44.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Data-driven Modelling of Engineering Systems with Small Data, a Comparative Study

Status:

Version 1

Abstract

Figures

1. Introduction

2. Problem Statement

3. Modelling

4. Results And Analysis

5. Conclusion

Declarations

Disclosure of potential conflict of interest

Fudnings

References

Additional Declarations

Status:

Version 1