Validation and Verification of Analytical Methods
Validation and Verification of Analytical Methods
Recommendation of the Board of the Czech Society for Clinical Biochemistry. The recommendation was approved on 16th November 2004.
Authors: Friedeck B., prongl L., Kratochvla J.
CONTENT
1. Aim and purpose of recommendation......................................................................................................................... 2 2. The terms validation and verification (ISO 9000: 2000, Eurachem Guide. The Fitness for Purpose of Analytical Methods 1998) ........................................................................................................................................................................... 2 3. Why is validation/verification necessary? .................................................................................................................. 2 4. Who performs validations?......................................................................................................................................... 2 5. When is validation/verification performed? ............................................................................................................... 3 6. Extent and intensity of validation and verification ..................................................................................................... 4
Verification of qualitative tests performed with the use of IVD products with CE marking ............... 4 Verification of quantitative tests performed with the use of IVD MD products with CE marking...... 4 Validation plan of measurement methods of other quantitative tests................................................... 5
General validation and verification procedure............................................................................................................ 5 List of problems closely related to validation/verification.......................................................................................... 6 Experimental and statistical method of validation and verification ............................................................................ 6
Repeatability, reproducibility............................................................................................................... 6 Bias methods........................................................................................................................................ 7 Comparability of methods.................................................................................................................... 8 Linearity and working interval of measurement .................................................................................. 8 Limit of detection and limit of quantification .................................................................................... 10 Recovery, analytical specificity and interference............................................................................... 10
Validation of the measurement quality control programmes .................................................................................... 12 Summary of validation and verification principles................................................................................................... 12 Evidence of validation methods used before this publication................................................................................... 12 Terminology ............................................................................................................................................................. 13 References ................................................................................................................................................................ 14
Page 1 of 15
2. The terms validation and verification (ISO 9000: 2000, Eurachem Guide. The Fitness for Purpose of Analytical Methods 1998)
Validation is confirmation that the requirements for specifically intended use or specific applications were met through objective evidence. Verification means confirmation that the analytical characteristics data provided by a manufacturer, a laboratory or a reference institution were reached through objective evidence in the given laboratory with the use of a specific measuring system. Validation confirms that the measurement method/measuring system/IVD MD product (in vitro diagnostic - medical device) is capable of meeting the requirements set on it. In other words, it confirms that the level of measurement is sufficient, the measurement procedures are correct and the calibration was properly done. Verification means that the measurement method/measuring system/IVD MD product is fully functional in a specific laboratory. In this recommendation, the term "verification" will only be used in relation to the products marketed in accordance with the principles of the Directive IVD 98/79 EC. In other cases, the term "validation" will be used. Objective data mean the results of planned laboratory experiments on the one hand, and the data from manufacturing documents, research laboratories, and certification procedures on the other hand. Analytical characteristics that form the subject matter of validation/verification form the base of validation/verification plan.
Page 2 of 15
Manufacturers who market their products in the USA must also validate their products within the extent required by the document "Guidance for Industry: Bioanalytical Method Validation", elaborated by the following institutions: US Department of Health and Human Services, Food and Drug Administration,
Center for Drug Assessment. This document which was issued in 2001 is available on http://www.fda.gov/cder/guidance/index.htm Professional organizations of analytical chemists The aim of AOAC (the Association of Official Analytical Chemists) is to contribute to the worldwide credibility of analytical results (i.e. of analytical chemistry and analytical chemists as well). This association has developed a number of validation, verification and certification programmes and cooperates with such institutions like FDA, thus targeting directly laboratory medicine as well. Web site: www.aoac.org LGC- Laboratory of the Government Chemists (www.lgc.co.uk) Among others, it also operates VAM programme (Validated Analytical Measurement) available on http://www.vam.org.uk Laboratories However, final responsibility for adequate validation of measurements is borne by the laboratory itself. If validation data come from the manufacturers' documents and are supported by experiments in laboratories of manufacturers and professional organizations and in calibration/reference laboratories, then the laboratory must continuously complement and verify such data. Following means are available to reach this aim: continuous education of the staff; regular quality controls (intra- and inter-laboratory comparison); elaboration and revision of SOPs and their validation during which the main emphasis is placed on assessment of results of inter-laboratory comparisons and studies; continuous observation of documents and information provided by reference institutions and organizations, such as JCTML (Join Committee for Traceability in Laboratory Medicine), LGC, and AOAC; observance of standard operation procedures meeting the requirements of the Directive IVD 98/79 EC.
The trend of modern analytical measurement is clear: Validated kits and measuring systems should be used and laboratories should concentrate mainly on assessments of measurements uncertainty, quality control, education, continuous observation of information, and implementation of new findings in practical laboratory activity.
Page 3 of 15
Page 4 of 15
- the product must be used in a strict conformity with its intended use declared by the manufacturer; - the laboratory must observe the procedure specified by the manufacturer (it must not modify this procedure in any case and in any way); - the laboratory must interpret the results of these tests in accordance with the international recommendations (if available); - instead of extensive validation, the laboratory performs basic verification of such products' properties, using a simple verification plan. The aim of laboratory verification of IVD methods is to confirm that the achieved values of analytical and performance characteristics conform to the values declared by the manufacturer and that they can be achieved in the given laboratory under common conditions of routine operation. Thus, it is not the IVD product itself that is the subject matter of verification but the ability to implement the measurement procedure in a specific laboratory in a given time and space. A reliably estimated value of a measurement uncertainty should be the decisive result of verification. Verification plan of IVD MD products (minimum): A) Precision (repeatability, reproducibility) B) Recovery as quantification of bias value These characteristics are necessary to assess the measurement uncertainty. If needed, the laboratory can verify other analytical characteristics as well. If the IVD MD analytical system is modified in any way, we must perform the "cross" verification by means of which precision and recovery (and then the measurement uncertainty) of the measuring system before and after modification is assessed. We must prove that the modification did not cause any significant deterioration of these analytical characteristics. In addition, analytical characteristics of measurement, results of interlaboratory comparisons and external quality control must be documented to prove that the modification did not cause a breach of conformity. 6.3. Validation plan of measurement methods of other quantitative tests Methods developed by a laboratory or methods adopted from another laboratory (in house) belong to this category. In such a case, much more extensive and comprehensive validation plan must be used. A) Precision (repeatability, reproducibility) B) Recovery bias C) Linearity/Measuring interval D) Limit of detection/quantification E) Interference F) Comparison with another method
Page 5 of 15
All above given items must form an integrated, continuously solved and monitored complex in the laboratory .The solution of partial problems without a link to the others is irrelevant.
Eurachem Guide (The Fitness for Purpose 1998) mentions the possibility to calculate repeatability and reproducibility using 10 measurements. Statistical assessment Mean value Standard deviations of repeatability and reproducibility Variation coefficients of repeatability and reproducibility
Page 6 of 15
9.2. Bias methods Aim The estimation of systematic error of measurement by means of analysis of either reference or suitable control material. To perform such an estimation, the random error of measurement must be minimized by means of sufficient number of repeated measurements. Bias value: records the traceability of measurements in the laboratory; quantifies the systematic part of the combined uncertainty of measurement results Samples Matrix certified reference materials that have the values of analytes obtained by reference methods of measurement are the ideal samples for determination of the measurement bias. Such samples are not available for a routine laboratory. In practice, a laboratory compares its results with the results of other laboratories using the same control materials. Such a comparison can be performed by means of control materials used previously in programmes of external quality assessment (NCCLS EP9-A2, NCCLS EP15A, Eurachem Guide. The Fitness for Purpose of Analytical Methods, 1998, FDIS VIM 3ed., 2004). Such materials are measured under repeatability conditions. The bias value reached under the repeatability conditions is suitable for the assessment of uncertainty of measurement. Replicates, concentrations 10 measurements under repeatability conditions (in one series), 2 reference (control materials - CM) of two different concentrations. One concentration of CM within the reference interval and one concentration of CM above its upper limit or below its lower limit. Reference values According to the measurement traceability status and regarding the possible influencing of bias results by matrix effects, the following values are used: a) Reference values acquired by means of a reference method in an external reference/calibration/expert laboratory - RMP or AV. Use of these values is subject to the condition that the difference between them and the mean results of participants does not exceed the amount of extended uncertainty RMP/AV (k = 2). Thus the relation: |RMP (AV) - ALTM| Uref (k = 2) must be true b) The mean of all results of participants of interlaboratory comparisons (after the exclusion of outliers), i.e. results of ALTM type. These values are used if the above given relation between RMP/AV and ALTM is not true. c) The average of results of participants who use the same method of measurement. These values are used in cases depending on the used measurement method/measuring system, i.e. values of ConV type. In such a case, the bias can only be assessed if the given group consists of 10 participants at least. Procedure and calculation (Eurachem Guide. The Fitness for Purpose of Analytical Methods, Harmonised guidelines for the recovery information in analytical measurement): Ten sample replicates are measured (x1 to x10) under repeatability conditions. We get a set of 10 results xi (x1 to x10). Arithmetic mean AM is calculated using the formula: AM = ( xi ) / 10 Repeatability value SD and CV (%) is calculated. Bias value is calculated in form of recovery Rm . Rm = AM / or Rm (%) = (AM / ) . 100 where is the reference value (one of the above given possible values).
Page 7 of 15
9.3. Comparability of methods Aim Assessment of bias, type of systematic error and correlation by comparing a validated method with a comparative one. Reference measurement method is an ideal comparative method. In routine practice, it is not usually available. Therefore, a new routine method/procedure/analytical system is compared with another routine method/procedure/system which was validated previously. If POCT method/system is validated, we use routine laboratory methods as comparative ones. IVD-MD products The manufacturer is obliged to provide the laboratory with data about traceability and comparability of its method (ISO 17511, ISO 18153). The manufacturer must prove the comparability by a graph and numerical assessment of regression curve or by means of Bland-Altman diagram. This duty is stipulated in standards (EN 375, ISO 18112-1). Samples For in-house methods,the use of at least 40 samples of native biological materials from patients is recommended. The samples must be chosen carefully to cover uniformly the whole working measuring interval (J.O.Westgard: Basic Method Validation 2003). Careful choice of limits from the point of view of suitable concentrations is considered much more essential than increase of their number. Replicates Measurement of individual samples in duplicates is recommended. The experiment may be performed in one day , however, 5 days by 8 samples each are recommended. Results recording It is recommended to record the measured data in a table (such as Excel). The results of comparative methods are always recorded in the "x" column and results of validated method in the "y" column. Graphical data recording a) Bland-Altman diagram, where the "x" axis shows the means of both methods and the "y" axis shows differences of both methods (in % preferably). b) Regression graph, where the "x" axis shows localized value of comparative method. Use of Passing-Bablok nonparametrical regression is preferable to linear regression (OLR). Assessment of systematic difference between the validated method and the comparative one From the equation y = q + k . x the value of bias (b) is determined according to: b=y-x This procedure is only recommended when correlation coefficient r 0.975 is reached. It is Pearson's correlation coefficient where the values of the slope (k) and section (q) can be considered correct. It can be concluded from the graphical record of Bland-Altman diagram, the graphical record regression analysis, and from the value of the regression analysis slope whether the proportional part systematic measurement error is present or not. It can be concluded from the graphical record of Bland-Altman diagram, the graphical record regression analysis, and from the value of the regression analysis section whether the constant part systematic error is present or not. of of of of
9.4. Linearity and working interval of measurement Aim Checking the validity of linearity interval/working interval of measurement under the conditions of a specific laboratory. This interval defines the range within which the validity of precision and bias values declared by the manufacturer and checked by validation experiments can be safely expected.
Page 8 of 15
IVD-MD systems According to IVDD (98/79 EC) and in accordance with the standards EN 375 and ISO 18112-1, the manufacturer is obliged to provide the laboratory with validated data about linearity and working interval of measurement. In IVD-CE products intended for immunoanalytical measurements, working interval is characterized by data obtained by means of a dilution experiment and quantified by recovery values (R %). Material for linearity validation of in house methods Commercially available kits of reference materials intended directly for this purpose can be successfully used . Such kits can be used within the appropriate programmes of external quality assessment (such as "Linearity" programme, SEKK, in the Czech Republic). In case of analytes with no such material available, measurement by means of a kit of human sera is recommended for linearity checking. If verification of the working interval of measurement does not provide linear relation between the analytical signal and concentration, dilution experiment is used. Preparation of the set of samples for linearity validation (J.O.Westgard, Basic Method Validation, 2003) A pool of patients' samples (H) of extremely high analyte concentration is prepared. The concentration should be close to the upper limit of linearity/working interval declared by the manufacturer; however, it must not exceed it. A pool of patients' samples (L) of extremely low analyte concentration is prepared. The concentration should be close to the limit of quantification. If a "zero" calibrator is available, it is used as the "L" sample. By mixing the both pools, other three samples are prepared (see the table). Table of samples for validation of linearity/working interval: Sample 1 Sample 2 Sample 3 Sample 4 Sample 5 L 3L + 1H 2L + 2H 1L + 3H H
Precision and trueness of volumes measurement are the critical points of this experiment. Preparation of samples for dilution experiment Five (or more) samples with decreasing analyte concentrations are prepared from biological material whose concentration is close to the upper limit of the working interval. By means of choosing more samples with suitable initial concentrations, the whole working interval of measurement can be covered (or its clinically most important part at least). Example (measurement of TSH concentration in blood serum): Dilution 1 3 5 9 17 Measured value - mean 19.2 7.1 3.88 2.25 1.27 Theoretical value 20 6.7 4.0 2.2 1.18 R (%) 96 106 97 102 108
Replicates Each sample is measured in triplicate. The sequence of individual samples should be randomized.
Page 9 of 15
Assessment In linearity validation, the dependence of the means of measured values (y axis) on theoretically known values (x axis) is assessed. Linear regression is used for assessment. In case of validation by means of a dilution experiment, recoveries of individual samples (R %) are assessed according to the equation: R % = (result of measurement/theoretical value) . 100 Procedure recording Graph of dependence of measured values (y axis) on theoretical values (x axis); Slope, section and correlation coefficient Percentage of recovery (R %)
9.5. Limit of detection and limit of quantification Material Sample blank or "zero" calibrator. IVD-MD systems The manufacturer provides the user with the value of the detection limit. The value of quantification limit is not always provided. If the limits of quantification in IVD-MD products are available, the (confusing) term "functional sensitivity" is often used. Procedure of determination of the limit of detection (LoD) Ten blank samples are measured in a series. The value 3 SDblank is calculated. LoD value is calculated as concentration corresponding to 3 . SDblank (by means of routinely used calibration).
Procedure of determination of the limit of quantification (LoQ) The limit of quantification is calculated according to the equation LoQ = 3 . LoD
Reproducibility is determined in several samples (ten are recommended) by means of measurement in ten different days. The lowest reached value of sample concentration in which better than required reproducibility is reached, will be used as the limit of quantification. In clinical laboratories, the target reproducibility should be CV = 20 %. The limit of detection belongs to key analytical characteristics of qualitative methods using nominal and/or ordinal measuring scales. The limit of quantification belongs to key analytical characteristics of quantitative methods of measurement. It is sometimes wrongly named "functional sensitivity". 9.6. Recovery, analytical specificity and interference Aim Recovery of measurement obtained by adding the measured analyte to a sample of measured material quantifies the bias of measurement (see above). The values of measurement recovery obtained after adding an interferent to a sample of measured biological material provide information about analytical specificity of the validated method by means of assessment of presence or absence of a significant proportional systematic error of measurement. In separation methods, recovery of measurement forms the basic information about efficiency of the compound isolation from the sample matrix. In these cases, recovery is determined by adding a suitable amount of analyte in a biological material sample.
Page 10 of 15
Procedure and calculation of recovery The simplest procedure is to measure (certified) reference material or control material validated in EQA programme. Ten measurements are recommended under repeatability conditions and two reference (control) materials on two different clinically significant levels. The procedure and assessment of recovery R was described above in the section about bias measurement. If there is no commutable certified reference material available during the validation (which is quite common), addition of the analyte to the native patients' material is used instead of it. If the aim of the experiment is to assess the analytical specificity of measurement and the level of interference, then the method of the assessed interferent addition is used. Number of patients' samples In most cases, a few (approximately 5) samples suffice. Analyte addition It is recommended to add nonmatrix reference material, usually its water solution, by means of adding a small volume of it directly to the samples of biological material (such as patients' samples). The concentration in patients' samples and the concentration of the added analyte must be chosen so that the resulting concentration in measured samples was: within the linearity interval/working interval; within the range of clinical decision limits, if possible. Interferent addition It is carried out according to the same principles as those specified at recovery determination. Instead of the analyte, an interferent is added. These are the basic tested interferents: haemoglobin (added in form of erythrocytes haemolysate); bilirubin (added in form of a standard solution);
triacylglycerides (added in form of commercially available lipidic suspensions). Volume of the added analyte - interferent It must be as least as possible so as not to dilute the patients' samples matrix significantly (mixing of 9 parts of the patient's sample with one part of the added analyte sample is recommended). The base solution of the added analyte must be duly concentrated (tenfold addition in the example given above) and very high precision of volumes measurement must be ensured. Balances and volume-measuring instruments linked to reference must be used for substances weighing and volume measurement. Preparation of samples for recovery measurement For each patient's sample used for the experiment, a partial sample with the addition (sample A: 9 parts of the original sample + 1 part of the addition) and a partial sample with dilution (sample B: 9 parts of the original sample + 1 part of saline solution or zero calibrator) is prepared. Samples A and B are measured in duplicates. Calculation Concentration of the addition in the patient's samples is calculated according to the equation: Ar = concentration of the addition . volume of the addition / (volume of the addition + volume of the sample) Mean values of A and B samples duplicates are measured. Difference of A and B is calculated and the mean value of their duplicates Dr is determined. Recovery as ratio Dr/Ar is calculated. Percentage of recovery is calculated: R = (Dr/Ar) . 100 [%]
Interference In IVD-CE products, the basic interference data form a part of the manufacturer's working documents. Such basic data include information about haemoglobin, bilirubin, lipids and drugs interference, as well as data about cross reactions.
Page 11 of 15
Records The graph of dependence of theoretical concentration values on the measured values. Interferences occurrence can be detected well by the breach of linearity, i.e. by the change of the graph's slope which shows the presence of proportional systematic error caused by the interferent. Numerical values of recovery (R and R %).
Quality control must be considered a continuation of validation/verification procedures. The results of validation/verification experiments, especially precision, bias measurement, and criteria of suitability for intended use (control limits) are the data that the quality control programme is based on. The standard ISO/FDIS 15198: 2004 deals only with validation of programmes of internal quality control. Control materials for implementation of external assessment of quality are not a part of IVD MD directives.
Page 12 of 15
13. Terminology
The following terminology is based on the electronically published text "Metrological Terminology in an Analytical Laboratory", SEKK 2003. The terms are defined in a short version sufficient for the given purpose. The text of the mentioned CD-ROM is based mainly on normative texts of ISO 3534-1, ISO 57287-1 and VIM (3rd edition). Analytical interference The systematic error of measurement caused by the analytical interferent. A part of the sample interfering with the measured value. It need not be a Analytical interferent source of the measuring system signal but causes either increase or decrease of its indicated value. A sample that does not contain the analyte of interest or has a concentration at Blank sample least an order of magnitude less than the lowest level of interest. The difference of the quantity value obtained by measurement and the true Error of measurement value (reference value) of the measurand. A part of an error of discrete and variable character. The difference of the Random error quantity value obtained by measurement and the average that would ensue from an infinitive number of replicated measurements of the same measurand carried out under repeatability conditions. It can neither be influenced nor corrected mathematically; it can be both of positive and negative value. It causes a measurement results variability which can be characterized by precision of measurement quantified in form of standard deviation or variation coefficient on the basis of statistical analysis of a series of independent measurements. The influence of a random error on the measurement result can be decreased by an increase of number of measurements. The difference of average that would ensue from an infinite number of Systematic error replicated measurements of the same measurand carried out under repeatability conditions and the true value of the measurand. It is such a part of error which remains constant in a series of test (measurement) results, or which changes in a predictable way. European Directive (IVDD) 98/79 EC defining the conditions and IVD MD (In Vitro requirements necessary to market laboratory diagnostics and devices in EU Diagnostic Medical countries (transferred in legislation of individual EU countries in form of a Devices) law on 1st June 2000). The material used for the purposes of an operational quality control both for Control material internal quality control and external quality assessment. It undergoes the same measurement procedure (or its part) as analyzed samples aiming at effective monitoring of the analytical quality of measurement. Linearity of calibration The ability to provide results that are directly proportional to the relation ("linearity" in concentration; the range of content, amount or concentration values in which the analytical signal is a linear function of content, amount or concentration short) values. Calibration is an operation that establishes the relation, obtained by the reference to one or more measurement standards that exists under specified conditions between the indications of a measuring system and the measurement result that would be obtained using the measuring system The limit of detection of a specific analytical procedure is given by the lowest Limit of detection amount of an analyte in a sample that can be detected with stated probability, although perhaps not quantified as an exact value with desired uncertainty level. Limit of quantification The limit of quantification of a specific analytical method is the lowest amount of an analyte in a sample that can be quantitatively determined with (lower LoQ) stated acceptable precision and trueness with required uncertainty level. The set of activities aimed to determine the value of a quantity. The process Measurement of experimentally obtaining information about the magnitude of the quantity. The generic description of a logical sequence of generally described Measurement method
Page 13 of 15
Precision
Specificity (analytical)
Bias
Recovery
Validation Verification
operations - activities used in a measurement. The complex set of all constituent parts enabling performance of individual specified measurements. A detailed description of measurement activities according to one or more measurement principles and to a given measurement method. The closeness of agreement between the average that would ensue from an infinitive number of quantity values obtained under specified measurement conditions and the true value of the measurand. The closeness of agreement between quantity values (independent results of a test) obtained under specified conditions. These conditions include repeatability or reproducibility. The ability of a measurement procedure to determine only that measurand which is to be measured. The lower the interference is, the higher is the specificity. The difference of the mean quantity value obtained from a great number of measurements and the reference value (true value) of the measurand systematic error. The ratio of the measurement result to the certified - reference value. The indicator of the constant systematic error of a measurement or a systematic error caused by the interferent or an incomplete extraction of the analyte from the matrix. A confirmation through the examination of a given item and the provision of an objective evidence that it fulfils the requirements for a stated intended use. A confirmation through the examination of a given item and the provision of an objective evidence that it fulfils the specified requirements.
14. References
ISO 21748: 2003 Guide to the use of repeatability, reproducibility and trueness estimates in measurement uncertainty estimation NCCLS EP-15A. User demonstration of performance for precision and accuracy. 2001 Final CLIA rules. Part V. Method validation process and procedures. 2003. Dostupn na http://www.westgard.com NCCLS EP6-P2. Assessment of linearity in measurement. 2002 Plzk Z, Koruna I, Friedeck B, Kratochvla J.: Metrologick terminologie v analytick laboratoi. CD ROM publikace SEKK Pardubice 2003 The fitness for purpose of analytical methods. Eurachem 1998. Dostupn na www.eurachem.ul.pt Vyhlka 345 MPO ze dne 11.7.2002 Sb. 345/2002, str. 7280. stka 125 Westgard JO a spol. Basic Method Validation. Westgard QC, Madison, 2003 NCCLS EP9-A2 Method comparison and bias estimation using patient samples. 2002 Guidance for industry. Bioanalytical method validation. U.S. department of Health and Human Services, Food and Drug Administration, Center for Drug Assessment and research, 2001 ISO /FDIS 18112-1. Clinical laboratory testing and in vitro diagnostic test systems-In vitro diagnostic medical devices- Information supplied by the manufacturer. Part 1: In vitro diagnostic reagents for professional use. 2004. SN EN 375. Informace pedvan vrobcem s diagnostickmi inidly in vitro pro odborn pouit. 2001 EN 13612. Performance assessment in vitro diagnostics medical devices. 2002 SN EN ISO 13485. Systmy jakosti - Zdravotnick prostedky. Zvltn poadavky na pouvn ISO 9001. 2000 ISO/FDIS 15198. Clinical laboratory medicine. In vitro diagnostic medical devices. Validation of user quality control procedures by the manufacturer. 2004.
Page 14 of 15
IUPAC. Harmonised guidelines for the use of recovery information in analytical measurement. Pure and Applied Chem, 1999, 71, 337-348. VAM Bulletin. An LGC publication. Focus on method validation. Autumn 2003,17-22. International Vocabulary of basic and general terms in metrology (VIM). Final draft, April 2004.
Page 15 of 15