Glass Tube Manufacturing Process

Open Live Script

This example shows linear model identification of a glass tube manufacturing process. The experiments and the data are discussed in:

V. Wertz, G. Bastin and M. Heet: Identification of a glass tube drawing bench. Proc. of the 10th IFAC Congress, Vol 10, pp 334-339 Paper number 14.5-5-2. Munich August 1987.

The output of the process is the thickness and the diameter of the manufactured tube. The inputs are the air-pressure inside the tube and the drawing speed.

The problem of modeling the process from the input speed to the output thickness is described below. Various options for analyzing data and determining model order are discussed.

Experimental Data

We begin by loading the input and output data, saved as an iddata object:

loadthispe25.mat

The data are contained in the variableglass:

glass

glass = Time domain data set with 2700 samples. Sample time: 1 seconds Outputs Unit (if specified) Thickn Inputs Unit (if specified) Speed

Data has 2700 samples of one input (Speed) and one output (Thickn). The sample time is 1 sec.

For estimation and cross-validation purpose, split it into two halves:

ze = glass(1001:1500);%Estimation datazv = glass(1501:2000,:);%Validation data

A close-up view of the estimation data:

plot(ze(101:200))%Plot the estimation data range from samples 101 to 200.

Figure contains 2 axes objects. Axes object 1 with title Thickn contains an object of type line. This object represents untitled1. Axes object 2 with title Speed contains an object of type line. This object represents untitled1.

Preliminary Analysis of Data

Let us remove the mean values as a first preprocessing step:

ze = detrend(ze); zv = detrend(zv);

The sample time of the data is 1 second, while the process time constants might be much slower. We may detect some rather high frequencies in the output. In order to affirm this, let us first compute the input and output spectra:

sy = spa(ze(:,1,[])); su = spa(ze(:,[],1)); clf spectrum(sy,su) axis([0.024 10 -5 20]) legend({'Output','Input'}) gridon

Figure contains an axes object. The axes object with ylabel Power (dB) contains 2 objects of type line. These objects represent Output, Input.

Note that the input has very little relative energy above 1 rad/sec while the output contains relatively larger values above that frequency. There are thus some high frequency disturbances that may cause some problem for the model building.

We compute the impulse response, using part of the data to gain some insight into potential feedback and delay from input to output:

Imp = impulseest(ze,[],'negative',impulseestOptions('RegularizationKernel','SE')); showConfidence(impulseplot(Imp,-10:30),3) gridon

Figure contains an axes object. The axes object with title From: Speed To: Thickn contains 2 objects of type line. One or more of the lines displays its values using only markers This object represents Imp.

我们看到一个延迟大约12样本的冲动response (first significant response value outside the confidence interval), which is quite substantial. Also, the impulse response is not insignificant for negative time lags. This indicates that there is a good probability of feedback in the data, so that future values of output influence (get added to) the current inputs. The input delay may be calculated explicitly usingdelayest:

delayest(ze)

ans = 12

The probability of feedback may be obtained usingfeedback:

feedback(ze)%compute probability of feedback in data

ans = 100

Thus, it is almost certain that there is feedback present in the data.

We also, as a preliminary test, compute the spectral analysis estimate:

g = spa(ze); showConfidence(bodeplot(g)) gridon

Figure contains 2 axes objects. Axes object 1 with title From: Speed To: Thickn, ylabel Magnitude (dB) contains an object of type line. This object represents g. Axes object 2 with ylabel Phase (deg) contains an object of type line. This object represents g.

We note, among other things, that the high frequency behavior is quite uncertain. It may be advisable to limit the model range to frequencies lower than 1 rad/s.

Parametric Models of the Process Behavior

Let us do a quick check if we can pick up good dynamics by just computing a fourth order ARX model using the estimation data and simulate that model using the validation data. We know that the delay is about 12 seconds.

m1 = arx(ze,[4 4 12]); compare(zv,m1);

Figure contains an axes object. The axes object with ylabel Thickn contains 2 objects of type line. These objects represent Validation data (Thickn), m1: 11.14%.

A close view of simulation results:

compare(zv,m1,inf,'Samples',101:200)

Figure contains an axes object. The axes object with ylabel Thickn contains 2 objects of type line. These objects represent Validation data (Thickn), m1: 12.62%.

有明显的困难处理的高frequency components of the output. That, in conjunction with the long delay, suggests that we decimate the data by four (i.e. low-pass filter it and pick every fourth value):

ifexist('resample','file')==2% Use "resample" command for decimation if Signal Processing Toolbox(TM)% is available.zd = resample(detrend(glass),1,4,20);else% Otherwise, use the slower alternative - "idresamp"zd = idresamp(detrend(glass),4);endzde = zd (1:50 0);zdv = zd(501:尺寸(zd,'N'));

Let us find a good structure for the decimated data. First compute the impulse response:

Imp = impulseest(zde); showConfidence(impulseplot(Imp,200),3) axis([0 100 -0.05 0.01]) gridon

We again see that the delay is about 3 samples (which is consistent with what we saw above; 12 second delay with sample time of 4 seconds in zde). Let us now try estimating a default model, where the order is automatically picked by the estimator.

Mdefault = n4sid(zde); compare(zdv,Mdefault)

Figure contains an axes object. The axes object with ylabel Thickn contains 2 objects of type line. These objects represent Validation data (Thickn), Mdefault: 32.73%.

The estimator picked a 4th order model. It seems to provide a better fit than that for the undecimated data. Let us now systematically evaluate what model structure and orders we can use. First we look for the delay:

V = arxstruc(zde,zdv,struc(2,2,1:30)); nn = selstruc(V,0)

nn =1×32 2 3

ARXSTRUC also suggests a delay of 3 samples which is consistent with the observations from the impulse response. Therefore, we fix the delay to the vicinity of 3 and test several different orders with and around this delay:

V = arxstruc(zde,zdv,struc(1:5,1:5,nn(3)-1:nn(3)+1));

Now we callselstrucon the returned matrix in order to pick the most preferred model order (minimum loss function, which is shown in the first row of V).

nn = selstruc(V,0);%choose the "best" model order

SELSTRUC could be called with just one input to invoke an interactive mode of order selection (nn = selstruc(V)).

Let us compute and check the model for the "best" order returned in variablenn:

m2 = arx(zde,nn); compare(zdv,m2,inf,compareOptions('Samples',21:150));

Figure contains an axes object. The axes object with ylabel Thickn contains 2 objects of type line. These objects represent Validation data (Thickn), m2: 33.44%.

The modelm2is about same asMdefaultis fitting the data but uses lower orders.

Let us test the residuals:

resid(zdv,m2);

Figure contains 2 axes objects. Axes object 1 with title AutoCorr, ylabel e@Thickn contains 2 objects of type line. One or more of the lines displays its values using only markers This object represents m2. Axes object 2 with title XCorr (Speed) contains 2 objects of type line. One or more of the lines displays its values using only markers This object represents m2.

The residuals are inside the confidence interval region, indicating that the essential dynamics have been captured by the model. What does the pole-zero diagram tell us?

clf showConfidence(iopzplot(m2),3) axis([ -1.1898,1.3778,-1.5112,1.5688])

Figure contains an axes object. The axes object with title From: Speed To: Thickn contains 4 objects of type line. One or more of the lines displays its values using only markers This object represents m2.

From the pole-zero plot, there is am indication of pole-zero cancellations for several pairs. This is because their locations overlap, within the confidence regions. This shows that we should be able to do well with lower order models. Try a [1 1 3] ARX model:

m3 = arx(zde,[1 1 3])

m3 = Discrete-time ARX model: A(z)y(t) = B(z)u(t) + e(t) A(z) = 1 - 0.115 z^-1 B(z) = -0.1788 z^-3 Sample time: 4 seconds Parameterization: Polynomial orders: na=1 nb=1 nk=3 Number of free coefficients: 2 Use "polydata", "getpvec", "getcov" for parameters and their uncertainties. Status: Estimated using ARX on time domain data "zde". Fit to estimation data: 35.07% (prediction focus) FPE: 0.4437, MSE: 0.4384

Simulation of modelm3compared against the validation data shows:

compare(zdv,Mdefault,m2,m3)

Figure contains an axes object. The axes object with ylabel Thickn contains 4 objects of type line. These objects represent Validation data (Thickn), Mdefault: 32.73%, m2: 34.77%, m3: 33.04%.

The three models deliver comparable results. Similarly, we can compare the 5-step ahead prediction capability of the models:

compare(zdv,Mdefault,m2,m3,5)

Figure contains an axes object. The axes object with ylabel Thickn contains 4 objects of type line. These objects represent Validation data (Thickn), Mdefault: 31.91%, m2: 34.74%, m3: 33.04%.

As these plots indicate, a reduction in model order does not significantly reduce its effectiveness is predicting future values.