Chemometrics

Calibration allows the user to relate instrumental measurements to the sample of interest. Multivariate calibration allows for the analysis of several measurements from several samples or specimens. This compares to univariate calibration, which involves the use of a single instrumental measurement to determine a single analyte. Either method may contribute to the two-step procedure where 1) data is calibrated and 2) predictions based on the calibration are made.

In calibration, indirect measurements are made from samples where the amount of the analyte has been pre-determined, usually by an independent assay or technique. These measurements, along with the pre-determined analyte levels, comprise a group known as the calibration set. This set is used to develop a model that relates the amount of sample to the measurements by the instrument. In some cases, the construction of the model is simple due to a certain relationship, such as Beer's Law in the application of UV spectroscopy. Unlike spectroscopy, other cases can be much more complex, and it is in these cases where construction of the model is the time-consuming step. Once the model is constructed, it can predict analyte levels based on measurements of new samples.

Another advantage of multivariate calibration (other than abiltiy to include multiple measurements and samples) is that it can be used to separate samples from interferences without the need of highly selective measurements for the analyte (Thomas, 1994) . In the case of HPLC, certain overlapping or anomalous peaks can be systematically separated or deleted from the data set based on certain linear combinations of measurements derived from one of several multivariate calibration techniques.

As alluded to earlier, the multivariate calibration set contains multiple measurements from multiple sources of samples and pre-determined analyte amounts. Next would be the prediction step for new sample levels, and this uses a model that provides the basis for the evaluation of a linear combination of the measurements (Equation1). Calibration techniques (used in the calibration step) differ in determining coefficient values for the preceding equation (or a similar equations). Three of these methods to be discussed are multiple linear regression, principle components regression, and partial least squares.

Multiple linear regression (MLR): As stated earlier, models constructed from spectroscopy are relatively simple due to linear combinations of the instrumental measurements which makes the model a correlation-based model. Models for a broader range of conditions (i.e., measurements from several wavelengths) have been constructed in order to separate overlapping peaks elicited from the analyte plus other unknown components or conditions. These methods for separating "outliers" are based upon the following equation:

xi = b0 + b1 * yi1 + b2 * yi2 + ... + bq * yiq + ei

where xi = analyte level of the i th specimen, yij = j th instrumental measurement with the i th specimen, b = model parameters, and ei = error associated with yi.

Next, using Equation1 above, the analyte levels of new specimens can be predicted when estimated bj is substituted for aj.

MLR does not require knowledge of the amount of interference or other samples in the calibration set. But the maximum amount of measurements (q) used in this method is restricted to approximately 2 to 10 in most cases; therefore, selecting an appropriate set of instrumental measurements is paramount (Thomas, 1994). For example, in critical care environments, use of Vis-Near-IR spectroscopy noninvasively monitors oxygen saturation in arterial blood (Mendelson, 1983). One wavelength in the Vis spectrum monitors blood pulsatile volume and oxygen saturation; the other in the Near-IR measures pulsatile blood volume only. Since the interference in this case is the blood volume, the information from the two wavelengths can be used to filter out the interference and provide a measurement of the oxygen content.

Principle Components Regression (PCR) and Partial Least Squares (PLS): These methods belong to a class of statistical methods known as continuum regression. Models for contiuum regression techniques are described by:

xi = b0 + b1 * ti1 + b2 * ti2 + ... + bh * tiq + ei

where tik is the k th score affiliated with the i th specimen. Every score has a linear combination of the instrumental measurements, such that

where gammakj are coefficients. The calibration set data determines these, along with the estimated model parameters (b) and the metaparameter, h, which is the model size and is usually much smaller than number of measurements, q (Thomas, 1994). The prediction step follows where estimated x and estimated b are correlated in the following equation:

x = b0 + b1 * t1 + b2 * t2 + ... + bh * th

where

The prediction step equation is similar to Equation1, where estimated x is simply a linear combination of the measurements affiliated with the new specimen.

The difference between PCR and PLS is the coefficients and how they are obtained: in PCR, only the calibration set measurements are used to acquire the coefficients, while in PLS, the measurements and the analyte values are used (Thomas, 1994) . For information on how to calculate and obtain these coefficients, detailed references are provided (Martens, 1989; Hoskuldsson, 1988).

Briefly, in is important to mention that before methods such as PCR or PLS are utilized, data pretreatment is usually necessary. The most prevalent and easiest type of transformation is centering the data. This method is executed for both the analyte values and the instrumental measurements. It tends to make subsequent calculations less prone to round-off and overflow problems, and it can reduce the size of the model by one factor (Thomas, 1994). Another data manipulation method is where the instrumental measurements are differentially weighted. Each centered measurement's importance is determined by a certain measurement-dependent weight. The centered and/or weighted data is then used to construct the calibration model. More information on these techniques is available in the "Thomas, 1994" article below.

Thomas, E.V. Analytical Chemistry, 1994, Vol. 66, pp. 797.

Thomas, p. 799.

Mendelson, Y. Ph.D. Dissertation, Case Western University, Cleveland, OH, 1983.

Thomas, p. 799.

Thomas, p. 800.

Martens, H.; Naes, T.; Multivariate Calibration; Wiley: Chichester, England, 1989.

Hoskuldsson, A. J. Chemom., 1988, Vol. 2, pp. 211-228.

Thomas, p. 800.

Return to automation page.

Return to Photodiode detector page.

Return to Users Guide Page

Exit to the University of Kentucky College of Pharmacy