A rose by any other name would smell as sweet…

is a quo­ta­tion by William Shake­speare from his play Romeo and Juliet meant to say that the names of things do not mat­ter, only what things are. In the play Romeo and Juliet, the line is said by Juliet in ref­er­ence to Romeo’s house, Mon­tague which would imply that his name means noth­ing and they should be together.

Does nomenclature really matter?

Does nomen­cla­ture really matter?

Why do we lawyers have prob­lems con­nect­ing and talk­ing with you scientists?

Why do you sci­en­tists seem so obtuse and need­lessly pedan­tic to us?

In part, it may be a nomen­cla­ture issue. I sug­gest that if we are going to try to open lab­o­ra­to­ries and make them be trans­par­ent and for lawyers and the judi­ciary to exam­ine their processes to ensure against unjust con­vic­tions, then we need to mind our nomen­cla­ture. I sug­gest that we use either the Inter­na­tional Con­fer­ence on Har­mo­niza­tion (ICH) and The Inter­na­tional Union of Pure and Applied Chem­istry (IUPAC) Gold Book definitions.

No place in foren­sic test­ing is this need to use the cor­rect nomen­cla­ture more impor­tant than when we dis­cus the valid­ity of a given method when it comes to any sort of test­ing. A small com­po­nent of valid­ity is metrol­ogy. We have dis­cussed metrol­ogy before here at the www.TheTruthAboutForensicScience.com blog as well as www.PADUIBlog.com as well.

The goal of all mea­sure­ment is to try to cap­ture the true value or the actual value of that which we are mea­sur­ing. How­ever, we can never, never do so. We can only attempt to design a method of mea­sure­ment where we have set up a process where we have deter­mined what level of risk we are will­ing to accept that we are wrong. Mea­sure­ment is the study of accept­able risk. What level of risk is accept­able that we could be wrong in our mea­sure? You see we are always wrong. It is a ques­tion of how much are we will­ing to risk that we are wrong and how wrong are we will­ing to be. Uncer­tainty Mea­sure­ment (UM), if prop­erly done, is the imper­fect embod­i­ment of the expres­sion of that risk.

You write as to “accu­racy.” Accu­racy (strictly in a ICH and IUPAC way) is a par­tic­u­lar type of assess­ment of a mea­sure­ment. Accu­racy is more prop­erly known as “bias.” Bias is the mea­sure of how closely the results are to the true value. It is char­ac­ter­ized by per­haps a high Stan­dard Devi­a­tion, but may or may not have a low aver­age devi­a­tion from the true (actual) value.

Then there is pre­ci­sion. Pre­ci­sion is an entirely dif­fer­ent type of ani­mal. They are inter-related and depen­dent vari­ables, but they are entirely dif­fer­ent con­cepts. Pre­ci­sion is more prop­erly known as “cal­i­bra­tion.” Pre­ci­sion is best defined as a mea­sure of how closely the results can be to one another. It is char­ac­ter­ized by a low Stan­dard Devi­a­tion, but may or may not have a high aver­age devi­a­tion from the true (actual) value. Pre­ci­sion is made up of repeata­bil­ity, inter­me­di­ate pre­ci­sion, and repro­ducibil­ity. Repeata­bil­ity is char­ac­ter­ized as the abil­ity day-in and day-out using the test, using the same method on the same instru­men­ta­tion on the same unknown arrives at the same result. Inter­me­di­ate pre­ci­sion is an expres­sion of with-in lab­o­ra­tory vari­a­tion: dif­fer­ent days, dif­fer­ent ana­lyst, etc. Repro­ducibil­ity is defined as the abil­ity of a test or exper­i­ment to be accu­rately repro­duced, or repli­cated, by some­one else work­ing inde­pen­dently. Pre­ci­sion should be inves­ti­gated using homo­ge­neous, authen­tic sam­ples over the long term.

There are three graph­i­cal rep­re­sen­ta­tions that best and most sim­ply show these concepts.

(This graphical representation is the best one to describe these intersecting and dependent features as to a single measuring event)

(This graph­i­cal rep­re­sen­ta­tion is the best one to describe these inter­sect­ing and depen­dent fea­tures as to a sin­gle mea­sur­ing event)

Multiple meaasure explanation of metrology

(This graph­i­cal rep­re­sen­ta­tion is the best one to show the inter­sect of these depen­dent fea­tures over mul­ti­ple mea­sures. As we can see from this depic­tion the goal of min­i­miz­ing risk of bias and cal­i­bra­tion error is a mov­ing tar­get. You adjust one and the other may suf­fer. It is also quite costly to min­i­mize both simul­ta­ne­ously. It is expo­nen­tially eas­ier and cheaper to cor­rect for cal­i­bra­tion error than bias error.)

(This graphical representation is from Ted Vosk’s presentation at the AAFS meeting. I am unsure as to where he got it. This graphical representation again looks at an individual measure and shows the difference between Type I error and Type II error. Type I error can be termed, by and large, as a function of bias; whereas, Type II error is, by and large, a function of calibration.)

(This graph­i­cal rep­re­sen­ta­tion is from Ted Vosk’s pre­sen­ta­tion at the AAFS meet­ing. I am unsure as to where he got it. This graph­i­cal rep­re­sen­ta­tion again looks at an indi­vid­ual mea­sure and shows the dif­fer­ence between Type I error and Type II error. Type I error can be termed, by and large, as a func­tion of bias; whereas, Type II error is, by and large, a func­tion of calibration.)

 

This post is inspired by a com­bi­na­tion of two events. First a com­ment by oppos­ing coun­sel, and sec­ond sev­eral com­ments by sev­eral “old school” foren­sic sci­ence prac­ti­tion­ers at this year’s Amer­i­can Acad­emy of Foren­sic Sci­ence (AAFS) annual meet­ing that I attended. In a con­tested hear­ing oppos­ing coun­sel argued that there was “no such thing as metrol­ogy.” He said this despite the fact that he heard from two metrol­o­gists about the well-established sci­ence, and its appli­ca­tion. The sec­ond set of com­ments came from many at the AAFS meet­ing when Ted Vosk, Esquire who is a friend and col­league of mine was pre­sent­ing at the meet­ing. He was lec­tur­ing at a part of a pre­sen­ta­tion on Uncer­tainty Mea­sure­ment (UM) report­ing in the foren­sic arena. In reac­tion to his words, some peo­ple com­mented that UM report­ing was a “waste of time” or a “use­less exer­cise.” One per­son com­mented that if it were to be done “where would I stop the fig­ur­ing of UM.”

I actu­ally think it is a sim­ple case.

Quite frankly, I don’t under­stand what all of the hub-bub is about in not report­ing UM.

What is the confidence of your measure? Are you a house cat who thinks he is a lion?
What is the con­fi­dence of your mea­sure? Are you a house cat who thinks he is a lion?

Whereas my good friend Ted Vosk, Esquire made a very good, very con­vinc­ing and very impas­sioned plea to the analyst’s sense of jus­tice and sci­ence, I am going to try to be more prac­ti­cal. I am going to make an appeal to your log­i­cal bias.

Here is my open address to all of those involved in foren­sic sci­ence (regard­less of whether you are employed by a pros­e­cu­tor or a defender) in terms of UM reporting:

An opinion letter to all those in forensic science laboratories today
An opin­ion let­ter to all those in foren­sic sci­ence lab­o­ra­to­ries today

Dear Foren­sic Scientist,

I know you are not the robot that you claim to be when you pre­form some form of sci­ence. I under­stand that you are a real life human being. As such, you have bias. And you know what? Here’s a dirty lit­tle secret: it is totally accept­able that you do. You can­not not have bias. You have no choice in the mat­ter. It is fail­ing to acknowl­edge your bias that is dan­ger­ous. If you acknowl­edge you have bias, then you can take steps to mit­i­gate it and try your best to not allow it inap­pro­pri­ately to influ­ence your process, your pro­ce­dure, your per­for­mance, your inter­pre­ta­tion, your opin­ion and your conclusion.

Your bias could be as extreme as that you want one side to win. Your bias could be that you want to defend your inter­pre­ta­tion or your opin­ion. Your bias could be that you want to defend your data. Your bias could be that you want to defend your pro­fes­sion. Your bias could be that you want to defend what you do or did.

To that end, I appeal to your bias with this. Logically.

1. You are not the finder of facts.

2. You are not sup­posed to be an advocate.

3. To do oth­er­wise, you are an edi­tor of facts.

4. When you present your mea­sure as an absolute value, the old adage of “A half truth+ A half truth= A full lie” applies.

But you still say “Why present UM at all or why should I present it unless it is near a crit­i­cal value?”

Well, it’s sim­ple. These days, crim­i­nal defense lawyers win by expos­ing the whole truth when you chose not to present the whole truth. When you present a mea­sure, whether it is a qual­i­ta­tive or quan­ti­ta­tive mea­sure, as an absolute and there­fore free of any sort of doubt or error, you know sci­en­tif­i­cally this is wholly wrong.

A half truth+ A half truth= A full lie.

My col­leagues are slowly learn­ing the whole sci­en­tific truth. When you show half truth, we show the whole truth truth. We show the truth in the lim­i­ta­tion of the assay per­formed, the truth about the lim­i­ta­tion of your knowl­edge and expe­ri­ence, and the truth that you made assump­tions or inter­pre­ta­tions or judg­ment calls along the way.

No mat­ter how much you try to jus­tify on re-direct this ini­tial lack of full dis­clo­sure of the whole truth, you will likely lose. Also, it fre­quently doesn’t mat­ter if on re-direct exam­i­na­tion if you have the UM ready to report. You have been exposed. There is doubt.

While the sim­ple truth is in some tightly con­trolled and truly val­i­dated meth­ods, the demand for hon­est and com­plete report­ing in the expanded UM in both the quan­ti­ta­tive mea­sure and the qual­i­ta­tive mea­sure (using accept­able metro­log­i­cally accept­able meth­ods such as the prop­a­ga­tion of errors method or Monte Carlo analy­sis) may actu­ally show that there is no pos­si­ble way that the value could be below the crit­i­cal mea­sure, in my view, I say “Good,” and “So be it.” If you can legit­i­mately and sta­tis­ti­cally prove (not just sim­ply a stated value) that it takes 6, 7, 8 or 100 sigma to get below the crit­i­cal value, then you have noth­ing to fear do you? But do you know you are in control?

If it is true, then that is what belongs in the court­room and noth­ing else.

To do oth­er­wise is a sci­en­tific sin (Vosk’s point) and will make you seem decep­tive because you know what? You are (my point).

True sci­ence is not your pri­vate parochial sand­box that you need to “pro­tect” us from, but rather it is for all of us to share in the joy of unbi­ased dis­cov­ery of the truth at the tem­ple of empircism.

With true sin­cer­ity as a true admirer of val­i­dated science,

Justin J. McShane, Esquire

 

Conclusion to the twelve part ISO 17025 introduction

Scan this QR mark into your cell phone to get bonus information on Lord Kelvin

Scan this QR mark into your cell phone to get bonus infor­ma­tion on Lord Kelvin

In a series of posts, I am going to intro­duce the reader to the exis­tence of ISO 17025 and its impor­tance.  I am going to intro­duce it in bite-sized bits for easy diges­tion.  Just like all mat­ters of learn­ing, knowl­edge is incre­men­tal over time and builds upon pre­vi­ous exposure.

So far we have answered the fol­low­ing questions:

In today’s post we seek to tie all of the other 11 posts together into some­thing mean­ing­ful to the Practitioner.

While it is impor­tant to note for the crim­i­nal law Prac­ti­tioner that ISO 17025 is com­ing to a lab­o­ra­tory near you, it pro­vides only a use­ful frame­work from which min­i­mum stan­dards of sci­en­tif­i­cally accept­able pol­icy, pro­ce­dure, and instruc­tions result in the over­ar­ch­ing Qual­ity Man­age­ment Sys­tem. These are min­i­mum safe­guards that are rec­og­nized per ISO and cer­tainly do not con­sti­tute or endorse a lab­o­ra­tory to pro­duce a foren­si­cally accept­able result at the end of its imple­men­ta­tion of the ISO 17025 qual­ity man­age­ment system.

(Con­sider this: Who wants a doc­tor who only meets “min­i­mum stan­dards” or a lawyer who only meets “min­i­mum standards.”)

Some of the most strik­ing inad­e­qua­cies of ISO 17025 and ASCLD/LAB’s inter­pre­ta­tion of it con­cerns the def­i­n­i­tion of “cus­tomer” as described in our ear­lier post as well as when and if  Uncer­tainty Mea­sure­ment (UM) should be reported as detailed as dis­cussed previously.

With the "customer' being interpreted as the proscuting authority UM will not likely be reported

With the “cus­tomer’ being inter­preted as the pros­e­cut­ing author­ity UM will not likely be reported

Another sys­temic short­com­ing of ISO 17025 and ASCLD/LAB’s inter­pre­ta­tion of it comes in the basics of true Bayesian-based expanded UM report­ing. While they address for the first time the need to be uni­form and con­sis­tent in the approach towards test­ing and cal­i­bra­tion among lab­o­ra­to­ries, within a lab­o­ra­tory, and even down to the ana­lyst, they focus exclu­sively on the quan­tifi­ca­tion of an unknown. It leaves untouched and unad­dressed, except indi­rectly in the method val­i­da­tion require­ments, the impor­tance of the qual­i­ta­tive mea­sure­ment and its valid­ity. Nowhere in ISO 17025 is the need to be selec­tive and spe­cific in a reported qual­i­ta­tive mea­sure­ment report directly addressed. This rush to be able to express UM in terms of quan­tifi­ca­tion seems to be plac­ing the cart before the prover­bial horse . If one focuses on the quan­ti­ta­tive mea­sure but first can­not be cer­tain that the method employed is one that results in a qual­i­ta­tive mea­sure that is both selec­tive and spe­cific to the exclu­sion of all other pos­si­ble meth­ods and con­clu­sions in terms of a qual­i­ta­tive result, then there can be no real value in the mea­sure itself. In other words, the key ques­tion of “Does the method exclu­sively and uniquely mea­sure what we need mea­sured?” remains unaddressed.

William Thom­son, 1st Baron Kelvin (26 June 1824 – 17 Decem­ber 1907) once wrote,“[When you can mea­sure what you are speak­ing about, and express it in num­bers, you know some­thing about it; but when you can­not mea­sure it, when you can­not express it in num­bers, your knowl­edge is of a mea­gre and unsat­is­fac­tory kind; it may be the begin­ning of knowl­edge, but you have scarcely in your thoughts advanced to the state of Sci­ence, what­ever the mat­ter may be.”

The days of ISO 17025 are shortly com­ing upon us. If the defense bar is prop­erly pre­pared, then we can pro­vide to the trier of fact and the cit­i­zen among us who has entrusted us with his lib­erty that final and all-important last check to the unfet­tered power of the great Leviathan that is the gov­ern­ment.  It is up to us, armed with knowl­edge, to defend the cit­i­zen among us who has been accused of a crime.  It is through our own igno­rance that we can insure that per­sonal tragedy and injus­tice results.  As Albert Ein­stein once penned “As far as the laws of math­e­mat­ics refer to real­ity, they are not cer­tain, as far as they are cer­tain, they do not refer to reality.”

 

Scan this QR mark into your phone to get bonus information on Control Charting

Scan this QR mark into your phone to get bonus infor­ma­tion on Con­trol Charting

In a series of posts, I am going to intro­duce the reader to the exis­tence of ISO 17025 and its impor­tance.  I am going to intro­duce it in bite-sized bits for easy diges­tion.  Just like all mat­ters of learn­ing, knowl­edge is incre­men­tal over time and builds upon pre­vi­ous exposure.

So far we have answered the fol­low­ing questions:

In this post we exam­ine how ASCLD/LAB Inter­na­tional con­flicts with ISO 17025 and hon­est sci­en­tific report­ing of Uncer­tainty Mea­sure­ment (UM) in foren­sic science.

Sec­tion 5.3 (Accom­mo­da­tions and envi­ron­men­tal con­di­tions) requires a full doc­u­men­ta­tion into the envi­ron­men­tal and other test­ing accom­mo­da­tions of the lab­o­ra­tory to iden­tify poten­tial sources of error and vari­ance to val­i­date that the ana­lyt­i­cal devices and the per­son­nel involved in the area where the ana­lyt­i­cal devices are used are suf­fi­ciently free from envi­ron­men­tally caused error and yields valid test­ing results suit­able for its intended pur­pose. Although it is up to the lab­o­ra­tory to estab­lish a pol­icy, pro­ce­dure, and instruc­tions to meet Sec­tion 5.3, these require­ments include a full account­ing into such hereto­fore pos­si­bly ignored sources of poten­tial error that includes, but is not lim­ited to, “bio­log­i­cal steril­ity, dust, elec­tro­mag­netic dis­tur­bances, radi­a­tion, humid­ity, elec­tri­cal sup­ply, tem­per­a­ture, and sound and vibra­tion lev­els.” It also requires a lab­o­ra­tory to mon­i­tor, con­trol and record these envi­ron­men­tal con­di­tions that may change within the lab­o­ra­tory and may influ­ence the qual­ity of the results.

Per­haps the most fun­da­men­tal change in the way ISO 17025 lab­o­ra­to­ries will be con­duct­ing their test­ing and cal­i­bra­tion ser­vices if they seek and obtain ISO 17025 accred­i­ta­tion comes with the imple­men­ta­tion of Sec­tion 5.4 (Test and cal­i­bra­tion meth­ods and method val­i­da­tion)[i].

Uncertainty Measurments in Forensic Science
Uncer­tainty Mea­sur­ments in Foren­sic Science

It is beyond the scope of this blog post to include the con­tri­bu­tions of Theodore “Ted” Vosk in this work con­cern­ing ISO 17025[ii]. I have also blogged on basic metrol­ogy and Uncer­tainty Mea­sure­ment (UM). They include:

How­ever, it is within this sub­sec­tion, Sec­tion 5.4.6, that we find the encap­su­la­tion of the need to report, under cer­tain cir­cum­stances, Uncer­tainty Mea­sure­ments (UM)[iii]. Specif­i­cally 5.10.3.1© reads that UM shall be included in a Test Report to the cus­tomer “when it is rel­e­vant to the valid­ity or appli­ca­tion of the test results, when a customer’s instruc­tion so requires, or when the uncer­tainty affects com­pli­ance to a spec­i­fi­ca­tion limit”.

A schematic of a basic definition of Uncertainty Measurment (UM)
A schematic of a basic def­i­n­i­tion of Uncer­tainty Mea­sur­ment (UM)

While a tremen­dous amount of empha­sis could be placed and should be placed on UM and whether or not the laboratory’s ver­sion of UM is fre­quen­tist or Bayesian in nature, Sec­tion 5.4 pro­vides addi­tional use­ful infor­ma­tion in the require­ment that the method to be employed must indeed be validated.

Although there is a com­mon mis­per­cep­tion among lawyers and even among lab­o­ra­tory man­agers that ISO 17025 pro­vides a method of val­i­da­tion spe­cific to the foren­sic sci­ence dis­ci­plines, this is not the case. Instead the require­ment of 5.4.5 is for the lab­o­ra­tory to have doc­u­men­ta­tion that includes specif­i­cally how it is deter­mined that a given method is to be applied and that its instruc­tions as well as its pro­ce­dures are in fact val­i­dated as pro­mul­gated and used.  This is to include lim­i­ta­tions on the assay to be performed.

This could be very use­ful for the crim­i­nal law prac­ti­tioner, for exam­ple, in the case of solid drug dose exam­i­na­tion and deter­mi­na­tion. It is pos­si­ble that within the laboratory’s own doc­u­ments there could be a dam­ag­ing admis­sion of its inabil­ity to deter­mine and dis­crim­i­nate between posi­tional iso­mers and chi­ral com­pounds[iv]. Dur­ing the val­i­da­tion process, although not specif­i­cally out­lined in ISO 17025, at a min­i­mum, the fol­low­ing should be addressed by the laboratory:

  • matrix effects,
  • sam­ple homogeneity,
  • speci­ficity,
  • demon­strated range of linearity,
  • pre­ci­sion,
  • inter­fer­ing substances,
  • sta­bil­ity of targets,
  • pop­u­la­tion dis­tri­b­u­tion, and
  • mea­sure­ment uncertainty.

It is accept­able per ISO 17025 to use reli­able, pub­lished, and com­mer­cially avail­able infor­ma­tion to estab­lish each para­me­ter so long as after the imple­men­ta­tion of the val­i­dated process, it is effec­tively mon­i­tored while it remains in place. If there is devi­a­tion from the reli­able, pub­lished, and com­mer­cially avail­able infor­ma­tion upon which the method relies, then it is required that the lab­o­ra­tory rec­og­nize that the pre­vi­ous method was pro­duc­ing inap­pro­pri­ate results and there­fore embark upon a new process of val­i­da­tion that will insure that the process employed and the method­ol­ogy is one that is indeed suit­able for its intended purpose.

Of addi­tional prac­ti­cal use to us is Sec­tion 5.6 and specif­i­cally Sec­tion 5.6.2.1.1. It holds that the lab­o­ra­to­ries, when they con­struct a cal­i­bra­tion curve or do other types of cal­i­bra­tion of the instru­ment that can con­tribute to the uncer­tainty, must prop­erly doc­u­ment the mea­sure­ment trace­abil­ity of those ref­er­ence stan­dards to the clas­sic mea­sure­ment item (i.e., the Inter­na­tional Sys­tem of Units (SI)) or in the case of items that can­not be strictly made in SI units such as in the case of drugs and DNA pro­files, these ref­er­ence mate­ri­als are to be trace­able to an appro­pri­ate mea­sure­ment standard.

There is a dis­tinc­tion between ref­er­ence stan­dards and ref­er­ence mate­ri­als as out­lined in Sec­tion 5.6.2. While one can­not cer­tify street cocaine (and hence would be a ref­er­ence mate­r­ial), one can cer­tify Cer­ti­fied Ref­er­ence Mate­ri­als (CRM’s or SRMs) (ref­er­ence stan­dards). ASCLD/LAB in its inter­pre­ta­tion and grant­ing of ISO 17025 accred­i­ta­tion takes Sec­tion 5.6 to an addi­tional safe­guard step in that it requires that what­ever cal­i­bra­tion ser­vice provider is used by a lab­o­ra­tory must be ISO 17025 accredited.

Another poten­tial source of uncer­tainty that is addressed by ISO 17025 is the dis­tinc­tion that is made regard­ing the equip­ment itself. Sec­tion 5.5 requires the lab­o­ra­tory to have a method to iden­tify and clas­sify its instru­ments that are used through­out the process. There is a dis­tinc­tion between class 1, class 2, and class 3 instru­ments that is impor­tant for the prac­ti­tioner to be aware of so as to be able to deter­mine whether or not the very best sci­en­tific process was employed and whether or not the best cal­i­bra­tion of the equip­ment was under­taken. Per ASCLD/LAB’s inter­pre­ta­tion of ISO 17025, Class 3 instru­ments are the only type of instru­ments whose cal­i­bra­tion ser­vice providers do not need to be ISO 17025 com­pli­ant per ASCLD/LAB. It is required of the lab­o­ra­to­ries to not only state that the cal­i­bra­tion ser­vice providers are ISO 17025 accred­ited but that they be able to prove through doc­u­men­ta­tion the com­pe­tence, the trace­abil­ity, and the mea­sur­ing abil­ity of the ser­vice provider espe­cially if it is not ISO 17025 accredited.

Per­haps the sin­gle biggest area of poten­tial uncer­tainty and one of the most use­ful to expose remains an unde­clared poten­tial source for erro­neous results. It is encap­su­lated and addressed in Sec­tion 5.7 (Sam­pling). In all ana­lyt­i­cal mea­sure­ments the ana­lyt­i­cal device very sel­dom weighs and/or mea­sures the entire sam­ple as it organ­i­cally exists[v]. There­fore, only a very small part of the whole, called an aliquot or ali­quant, is actu­ally tested by any ana­lyt­i­cal device.  As a result, it becomes cru­cial and vitally nec­es­sary for the lab­o­ra­tory to ensure homo­gene­ity in the aliquot or ali­quant tested. Right now, shock­ingly, most lab­o­ra­to­ries do not have a writ­ten pol­icy or pro­ce­dure or instruc­tions that addresses this.

Sampling versus sample selection
Sam­pling ver­sus sam­ple selection

It is this cru­cial dif­fer­ence between sam­pling ver­sus sam­ple selec­tion that needs to be exposed by all of us. In essence, what hap­pens in the lab­o­ra­tory when an aliquot is pre­pared is to exer­cise a mas­sive amount of truly sub­jec­tive dis­cre­tion by select­ing a “pinch of this” or a “sec­tion of that” from the whole unknown sam­ple sub­mit­ted for exam­i­na­tion. It is clear that by doing such, even with a pol­icy, pro­ce­dure and instruc­tion in place, mas­sive rep­re­sen­ta­tion errors with respect to non-colloidal mix­tures can occur. Sam­ple selec­tion in the case of trace evi­dence, for exam­ple, per ISO 17025 Sec­tion 5.7 would require a writ­ten pol­icy, pro­ce­dure, and instruc­tion that is uni­ver­sally enforced, imple­mented and mon­i­tored by the lab­o­ra­tory down to the tech­ni­cians at the bench as to which hairs or fibers out of many or what part of a stain to swath and exam­ine, for exam­ple. This is an exam­ple of sam­ple selec­tion. This is to be dis­tin­guished from sam­pling itself wherein there must be a writ­ten pol­icy, pro­ce­dure, and instruc­tions to make sure that homo­gene­ity of a sam­ple, in fact, exists. A fine exam­ple of this would be blood and blood alco­hol sam­pling. With­out assur­ance of homo­gene­ity in such a sam­ple ran­dom sam­pling error is intro­duced and inac­cu­rate results may be reported. Per ISO 17025 and ASCLD/LAB, there must be rig­or­ous train­ing as well as a plan and pro­ce­dure in place for sam­ple selec­tion as well as sam­pling. If one were to obtain the pol­icy, for exam­ple, of either sam­pling or sam­ple selec­tion, then there could and should be some very use­ful lan­guage con­tained that admits to this very fun­da­men­tal source of sub­jec­tive­ness and iden­ti­fies sam­pling and sam­ple selec­tion as a large poten­tial source of error.

From strictly a sci­en­tific aspect, per­haps one of the dis­ap­point­ments in the pro­mul­ga­tion of ISO 17025 is in Sec­tion 5.8. Sec­tion 5.8 con­cerns the proper han­dling of spec­i­mens, which in our appli­ca­tion is the seized evi­dence. In only a few words, it states gen­er­al­i­ties of how the evi­dence is to be han­dled and how the items are tested. There is very lit­tle guid­ance and require­ments as to this in the ISO 17025 doc­u­ment. There only needs to be a pro­ce­dure in place per ISO 17025. ASCLD/LAB has right­fully taken the posi­tion that this is a cru­cial part of the crime lab­o­ra­tory analy­sis and there­fore ded­i­cates an addi­tional two and a half pages of require­ments in its “Inter­na­tional” program.

In Sec­tion 5.9 we find that there must be a pro­ce­dure as to the assur­ing of the qual­ity of the reported results. While not directly offer­ing or even sug­gest­ing such a method, one pos­si­ble process and method­ol­ogy that could be used is prop­erly called “con­trol chart­ing.” Con­trol chart­ing is a graph­i­cal and empir­i­cal sta­tis­ti­cal tool used to detect exces­sive process vari­abil­ity to try to iden­tify spe­cific assign­a­ble causes that can be cor­rected. It serves to deter­mine whether a process is in a state of sta­tis­ti­cal con­trol; that is, the extent of vari­a­tion of the out­put of the process does not exceed that which is expected based on the nat­ural sta­tis­ti­cal vari­abil­ity of the process[vi].  Con­trol chart­ing is a great way to iden­tify the source of sta­tis­ti­cal out­liers where a machine can get pulled, an envi­ron­ment checked or an inap­pro­pri­ate oper­a­tor stopped or re-trained[vii].

labeling of a control chart
label­ing of a con­trol chart

In Sec­tion 5.10.1 we can find lan­guage that has a great pos­si­bil­ity of abuse.  Per Sec­tion 5.10.1:

The results of each test, cal­i­bra­tion, or series of tests or cal­i­bra­tions car­ried out by the lab­o­ra­tory shall be reported accu­rately, clearly, unam­bigu­ously and objec­tively, and in accor­dance with any spe­cific instruc­tions in the test or cal­i­bra­tion methods.

The results shall be reported, usu­ally in a test report or cal­i­bra­tion cer­tifi­cate, and shall include all the infor­ma­tion requested by the cus­tomer and nec­es­sary for the inter­pre­ta­tion of the test or cal­i­bra­tion results and all infor­ma­tion required by the method used.

The sim­ple fact that the even­tual Test Report that is issued to the “cus­tomer” must be one that is unam­bigu­ous con­notes to the author of this blog that 5.4.6 and 5.10.3.1© requires the report­ing of UM. ASLCD/LAB does not think so unless cer­tain cri­te­ria are met. To present any mea­sure as an absolute with­out acknowl­edg­ing UM, does pre­cisely that—makes the report­ing ambigu­ous. The coun­ter­ar­gu­ment that is offered by ASCLD/LAB and other lab­o­ra­tory man­agers is the oppo­site in that, if UM report­ing were placed into every report when­ever a mea­sure is made, it would con­fuse the trier of fact.  As such, they argue that a restric­tion on the raw data is war­ranted unless it is around a crit­i­cal mea­sure and that the unnec­es­sary report­ing of UM would invite ambi­gu­ity in the over­all result. In essence, ASCLD/LAB instead wants to be the judge of the facts, which is, of course imper­mis­si­ble as that role is specif­i­cally reserved to the trier of fact.

Why do forensic scientists get to ignore the facts and present their own version of it in the courtroom?
Why do foren­sic sci­en­tists get to ignore the facts as they sci­en­tif­i­cally exist and present their own ver­sion of it in the courtroom?

Regard­less of ASCLD/LAB’s inter­pre­ta­tion of this sec­tion, there is hope in that in Sec­tion 5.10.1 and in 5.10.4, we find lan­guage that states clearly that it is a require­ment that any infor­ma­tion not listed on the Test Report “shall be read­ily avail­able in the lab­o­ra­tory” and there­fore should be acces­si­ble to all.


[i] Typ­i­cal foren­sic appli­ca­tions and dis­ci­plines that involve mea­sure­ment sci­ence include: tox­i­col­ogy includ­ing a Blood Alco­hol Con­tent (BAC) test­ing; mass deter­mi­na­tion; drug purity; and dis­tance to muz­zle just to name a few. There are also other mea­sur­ing dis­ci­plines that report mea­sure­ment and there­fore fall under the new require­ments of ISO 17025 and include: trig­ger pull; bar­rel lengths; atomic absorp­tion spec­tropho­tom­e­try (AAS) and induc­tively cou­pled plasma-atomic emis­sion spec­trom­e­try (ICP-AES) analy­sis of gun­shot residue; refrac­tive index­ing; micro­scopic dimen­sional analy­sis; and DNA.

[ii] The lead­ing crim­i­nal defense attor­ney who has been suc­cess­fully lit­i­gat­ing the lack of UM report­ing in ana­lyt­i­cal mea­sure­ments when pre­sented in the court­room is Attor­ney Theodore “Ted” Vosk of Wash­ing­ton State.  Attor­ney Vosk has pub­lished on the con­cepts of UM.   Foren­sic Metrol­ogy: A Primer on Sci­en­tific Mea­sure­ment for Lawyers, Judges, and Foren­sic Sci­en­tists; Edited by Ted Vosk, Ash­ley F. Emery; http://www.taylorandfrancis.com/books/details/9781439826195/

[iii] Crit­i­cal quan­ti­ta­tive UM con­cerns will address the fol­low­ing concepts:

  • The iden­ti­fi­ca­tion and eval­u­a­tion of all sources of poten­tial error,
  • The iden­ti­fi­ca­tion of sig­nif­i­cance of iden­ti­cal uncer­tain­ties must be eval­u­ated in the uncer­tainty bud­get, and
  • The estab­lish­ing and close mon­i­tor­ing of results near crit­i­cal values

[iv] Iso­mers are com­pounds that pos­sess the same empir­i­cal for­mula, but are dif­fer­ent in struc­ture.  Cathine, which is not a con­trolled sub­stance, will likely be mis-identified as its diastere­omer, Phenyl­propanolamine, which is a con­trolled sub­stance, when solely a Gas Chro­mato­graph with Mass Spec­trom­e­ter (GC-MS) is uti­lized. While metham­phet­a­mine is a sched­ule II con­trolled sub­stance, the l-enantiomer of metham­phet­a­mine is found in the Vick’s Inhaler, which is a prod­uct exempted from con­trol.  Sim­i­larly, γ-Hydrox­y­bu­tyrate (GHB) and γ-Buty­ro­lac­tone (GBL) can be indis­tin­guish­able on some GC-MS. γ-Hydrox­y­bu­tyrate is a Sched­ule I con­trolled sub­stance that can­not be pos­sessed legally. γ-Buty­ro­lac­tone, on the other hand, may be pos­sessed and only becomes ille­gal if “intended for human consumption”.

[v] For exam­ple, in solid drug dose test­ing using Gas Chro­matog­ra­phy with Mass Spec­trom­e­try as the detec­tor, this issue becomes very patent.  In the lab, the tech­ni­cian starts with the whole sam­ple, then a small por­tion of the whole is removed that one hopes in rep­re­sen­ta­tive of the whole.  Next, a sol­vent is typ­i­cally applied such as methanol, ethanol or dichloromethane to deriv­a­tive the sam­ple that results in dilu­tion of the orig­i­nal item. An autosam­pler is employed that takes one micro­liter which is one mil­lionth of a liter of this deriva­tized or chem­i­cally altered and diluted sam­ple to inject it into the injec­tor port.  Typ­i­cally, a split injec­tor con­fig­u­ra­tion is used that results in a very, very small part of the micro­liter mak­ing it to the col­umn with the remain­der being vented out, not to be ana­lyzed. Of this very, very small amount that makes it onto the col­umn to be sep­a­rated into hope­fully unique ana­lytes, only 1% or 2% of this sep­a­rated mate­r­ial is ion­ized in the Mass Spec­trom­e­ter to be fur­ther frag­mented.  With the typ­i­cal scan level of between 40–400 times per sec­ond results that in a con­di­tion that in order for the ana­lyte to be detected at all a time frame of only 1/360th occurs where all of these con­di­tions can be met is to record a result at all.

[vi] Every process has some inher­ent vari­abil­ity due to ran­dom fac­tors over which there is no con­trol and which can­not be elim­i­nated economically.

[vii] As can be the case, a lab’s stated but unproven error of +/- x stated to xx%  of con­fi­dence (really a pre­dic­tive inter­val) can grossly under­state real­ity as later empir­i­cally estab­lished once con­trol chart­ing and other mea­sures per ISO 17025 are imple­mented. Once the con­trols and processes per ISO 17025 are in place, labs may be able to iden­tify sources of pro­found error.  Then after the imme­di­ate triage is com­pleted to iden­tify and end the source of the error, then train­ing and per­son­nel could be selec­tively fired or up-trained to reduce error and there­fore tight­en­ing the cov­er­age fac­tor thereby caus­ing it to be 8, 9, 10 or more sigma to get to the crit­i­cal measure.

 

Ok.  Any­one who knows me, has read this blog for a while or has seen me speak knows I love to use movies as trans­fer­able con­cepts to try to explain sup­pos­edly com­pli­cated sci­en­tific concepts.

What can the 1985 clas­sic movie “Fletch” teach us about method validation?

Just like when “Gordo” said in the above clip “Awww, come on guys, it’s so sim­ple. Maybe you need a refresher course. [leans arm on hot engine part] Hey! It’s all ball bear­ings nowadays..”,

We say to the lab peo­ple “Awww, come on guys, it’s so sim­ple. Maybe you need a refresher course… Hey! It’s all val­i­da­tion nowadays.”

Method val­i­da­tion has received con­sid­er­able atten­tion in the lit­er­a­ture and from indus­trial com­mit­tees and reg­u­la­tory agencies.

  • The U.S. FDA CGMP (1) request in sec­tion 211.165 (e) meth­ods to be val­i­dated: The accu­racy, sen­si­tiv­ity, speci­ficity, and repro­ducibil­ity of test meth­ods employed by the firm shall be estab­lished and doc­u­mented. Such val­i­da­tion and doc­u­men­ta­tion may be accom­plished in accor­dance with Sec. 211.194(a). These require­ments include a state­ment of each method used in test­ing the sam­ple to meet proper stan­dards of accu­racy and reli­a­bil­ity, as applied to the tested prod­uct. The U.S. FDA has also pro­posed an indus­try guid­ance for Ana­lyt­i­cal Pro­ce­dures and Meth­ods Val­i­da­tion (2).
  • ISO/IEC 17025 includes a chap­ter on the val­i­da­tion of meth­ods (3) with a list of nine val­i­da­tion para­me­ters. The ICH (4) has devel­oped a con­sen­sus text on the val­i­da­tion of ana­lyt­i­cal pro­ce­dures. The doc­u­ment includes def­i­n­i­tions for eight val­i­da­tion char­ac­ter­is­tics. ICH also devel­oped a guid­ance with detailed method­ol­ogy (5).
  • The U.S. EPA pre­pared a guid­ance for method’s devel­op­ment and val­i­da­tion for the Resource Con­ser­va­tion and Recov­ery Act (RCRA) (6). The AOAC, the EPA and other sci­en­tific orga­ni­za­tions pro­vide meth­ods that are val­i­dated through multi-laboratory studies.

Here is the whole point and my logic proof…

1.       I assume that there is a truly sci­en­tif­i­cally val­i­dated and robust method for the analy­sis under­taken in the foren­sic lab.  (To be sure of the use­ful­ness of their analy­sis, we want to exam­ine the lab’s val­i­da­tion stud­ies so as to not make this basic assump­tion.  Oth­er­wise, what’s the point…  With­out this basic prin­ci­ple sat­is­fied that there is a valid method, the lab should not be per­form­ing tests)

2.       Uni­ver­sally, if one fol­lows the val­i­dated method and only if one reli­giously fol­lows the val­i­dated method with no devi­a­tion, we get a valid result that is true (or more cor­rectly put as valid and as true a result as sci­en­tif­i­cally possible)

3.       The con­verse of that is not 100% true.  If we devi­ate from the val­i­dated method in some way or in any way, we MAY get a true result or we MAY not.  We don’t know with­out study­ing the impact of that devi­a­tion.  We do know that the devi­a­tion has caused the result to be not a val­i­dated result.  Not an invalid result, but rather a result that is not val­i­dated accord­ing to the valid method.

4.       As most of the val­i­da­tion stud­ies begin with “Take the filled 10mL grey tube top….”  This would con­note that the method was cre­ated using pre­cisely that a “filled 10mL grey tube top”.  It is the raw mate­r­ial, if you will.  It is like a con­di­tion precedent.

5.       If you do not have the con­di­tion prece­dent (i.e., the 10mL grey tube top), then you are miss­ing a part of the val­i­dated con­di­tions and you have by def­i­n­i­tion a devi­a­tion from the val­i­dated method.

6.       Ergo, you have a result that is not val­i­dated and may or may not be valid and may or may not be true.

To learn more about method val­i­da­tion, I would com­mend to you http://www.labcompliance.com/tutorial/methods/default.aspx

The tuto­r­ial is based upon the book Val­i­da­tion and Qual­i­fi­ca­tion in Ana­lyt­i­cal Lab­o­ra­to­ries, pub­lished by Infoma in 2007 which I highly rec­om­mend as the eas­i­est and best resource that I have come across that explains the impor­tance of method val­i­da­tion, the method of estab­lish­ing a val­i­dated method, the con­se­quences of method val­i­da­tion and the con­se­quences of devi­at­ing from the val­i­dated method.

DEFINITIONS USED IN THIS POST:

Invalid: proven to be not valid, not the same as “not val­i­dated” as “not val­i­dated” means that it may be valid or it may be invalid.

True: the value that char­ac­ter­izes a quan­tity per­fectly in the con­di­tions that exist when that quan­tity is con­sid­ered. It is an ideal value, which could be arrived at only if all causes of uncer­tainty (Type A error and Type B error) are eliminated.

Type A error: a method of eval­u­a­tion by sta­tis­ti­cal analy­sis of a series of observations.

Type B error: any­thing that is not Type A error; a method of eval­u­a­tion by any means other than sta­tis­ti­cal analy­sis of a series of observations.

Valid: doc­u­mented proof that the process under­taken is suit­able for its intended use and achieves the intended reported result cor­rectly and uniquely as free from pos­si­ble from issues of pre­ci­sion and bias as possible.

 

The Power of an assay-Bayes style

If you haven’t noticed a theme in these blog posts, it is really quite simple:

  1. Expos­ing lim­i­ta­tions of assays.
  2. Show­ing the inter­pre­tive parts of foren­sic science

I also admit that I love metrol­ogy as it should be applied to foren­sic sci­ence.

As I have writ­ten before:

I have blogged before on metrol­ogy and Uncer­tainty Mea­sure­ment (UM).  Even the mighty Inter­na­tional Stan­dard, ISO 17025, in Sec­tion 5.4.6 only applies the con­cept of UM to the quan­ti­ta­tive mea­sure and hence results not a true expres­sion of the Bayesian-based expanded uncer­tainty bud­get.  In order to do so, it must also must be an expres­sion of the uncer­tainty in the qual­i­ta­tive mea­sure as well to be truly metro­log­i­cally responsible.

In another post I asked and exam­ined the key ques­tion of:

What is the use of express­ing uncer­tainty in the quan­ti­ta­tive result if we are not as sci­en­tif­i­cally sure as pos­si­ble of what we in fact measuring?

The impor­tance of iden­ti­fy­ing and estab­lish­ing the Type I and Type II error in Bayesian-based metrol­ogy is that if done cor­rectly, it can result in dis­cov­er­ing the true “power” of the assay.

Type I and Type II errors are inversely related:  As one increases, the other decreases.  The Type I, or α (alpha), error rate is usu­ally set in advance by the researcher.  The Type II error rate for a given test is harder to know because it requires esti­mat­ing the dis­tri­b­u­tion of the alter­na­tive hypoth­e­sis, which is usu­ally unknown.

Power is the prob­a­bil­ity that a test will reject the null hypoth­e­sis when it is, in fact, false.  You can see from the below fig­ure, power is sim­ply 1 minus the Type II error rate (β).  High power is desir­able.  Like β, power can be dif­fi­cult to esti­mate accu­rately, but increas­ing the sam­ple size always increases power.

Power of the assay

Power of the assay

Boil­ing point.…

Assume that we were to use boil­ing point to dis­tin­guish between unknown sub­stances.  The boil­ing points are nor­mally dis­trib­uted with a mean of 480 degrees and a stan­dard devi­a­tion of 5 degrees, and the boil­ing points  of a coun­ter­feit sub­stance are nor­mally dis­trib­uted with a mean of 465 degrees  and a stan­dard devi­a­tion of 7 degrees. Assume also that 90% of the sub­stances are gen­uine, hence 10% are coun­ter­feit.
What is the prob­a­bil­ity that a ran­domly cho­sen gen­uine sub­stance has a boil­ing point more than 475 grains?
What is the prob­a­bil­ity that a ran­domly cho­sen coun­ter­feit sub­stance has a boil­ing point more than 475 grains?
What is the prob­a­bil­ity that a ran­domly cho­sen sub­stance has a boil­ing point more than 475 degrees  and is gen­uine?
What is the prob­a­bil­ity that a ran­domly cho­sen sub­stance has a boil­ing point more than 475 degrees  and is coun­ter­feit?
What is the prob­a­bil­ity that a ran­domly cho­sen sub­stance which has a boil­ing point more than 475 degrees is genuine?

These and more are the ques­tions that need to be answered in foren­sic sci­ence.  What is the power of your assay?  Stop report­ing ana­lyt­i­cal chem­istry as sin­gle num­ber and as an absolute!

 

As I have blogged before every assay has its lim­i­ta­tions.  There is no such thing as a per­fect test.

I have blogged before on metrol­ogy and Uncer­tainty Mea­sure­ment (UM).  Even the mighty Inter­na­tional Stan­dard, ISO 17025, in Sec­tion 5.4.6 only applies the con­cept of UM to the quan­ti­ta­tive mea­sure and to the quan­ti­ta­tive mea­sure alone.   Hence, this stan­dard results not in a true expres­sion of the Bayesian-based expanded uncer­tainty bud­get, but instead only part of it.  In order to do so, to report full Bayesian-based expanded uncer­tainty bud­get, it must also must be an expres­sion of the uncer­tainty in the qual­i­ta­tive mea­sure as well to be truly metro­log­i­cally responsible.

I pon­der openly here with all of you the fol­low­ing:  what is the use of express­ing uncer­tainty in the quan­ti­ta­tive result if we are not as sci­en­tif­i­cally sure as pos­si­ble of what we in fact measuring?

Can any of you answer that for me?

In forensic science, we put the cart before the horse
In foren­sic sci­ence, we put the cart before the horse

It is really the story of speci­ficity ver­sus selectivity.

The Inter­na­tional Union of Pure and Applied Chem­istry (IUPAC), which is the world author­ity on chem­i­cal nomen­cla­ture, ter­mi­nol­ogy, stan­dard­ized meth­ods for mea­sure­ment, atomic weights and other crit­i­cally eval­u­ated data and oth­ers have defined the dif­fer­ence between these often con­fused terms as follows:

A spe­cific reac­tion or test is one that occurs only with the sub­stance of inter­est, while a selec­tive reac­tion or test is one that can occur with other sub­stances but exhibits a degree of pref­er­ence for the sub­stance of inter­est.  Few reac­tions are spe­cific, but many “exhibit selectivity”.

Other com­mon def­i­n­i­tions include:

Selec­tiv­ity gives an indi­ca­tion of how strongly the result is affected by other com­po­nents in the sample.

and also

Selec­tiv­ity refers to the extent to which the method can be used to deter­mine par­tic­u­lar ana­lytes in mix­tures or matri­ces with­out inter­fer­ences from other com­po­nents of sim­i­lar behavior.

A selec­tive test may be not a spe­cific test due to cross-reactivity, inter­fer­ence, or codetermination.

There has been some remark­able research in try­ing to quan­tify and pre­cisely express the uncer­tainty in the qual­i­ta­tive measure.

Some attempts have been made in the lit­er­a­ture to quan­tify selec­tiv­ity (and even speci­ficity). These include:

  1. Mas­sart et al. dis­cussed both qual­i­ta­tive and quan­ti­ta­tive aspects of selec­tiv­ity and speci­ficity.  This approach involves quan­tifi­ca­tion of a sen­si­tiv­ity fac­tor matrix, K, involv­ing n sen­sor responses for m components.
  2. Otto and Wegschei­der com­pared dif­fer­ent pro­ce­dures to obtain fig­ures of merit for the judg­ment of the selec­tiv­ity of meth­ods for mul­ti­com­po­nent analysis.
  3. Lor­ber et al. have used the con­cept of net ana­lyte sig­nal and selec­tiv­ity defined in terms of loss of sig­nal due to spec­tral over­lap in mul­ti­vari­ate cal­i­bra­tions with some degree of success.

In the Court­room, where cer­tainty and rel­e­vant proof is demanded, we need to insist not just on Uncer­tainty Mea­sure­ment in terms of quan­ti­ta­tive mat­ters, but also in qual­i­ta­tive matters.

____

Hat tip to Josh Lee, Esquire for his con­tri­bu­tion to this blog post.

 

In our prior post, CRMs:  Ref­er­ence mate­ri­als and stan­dards, we saw how the library is impor­tant in ana­lyt­i­cal chem­istry.  It enables the ana­lyst to com­pare the unknown to a known to arrive at an opin­ion that is often expressed as a con­clu­sion.  We saw that there are many dif­fer­ent com­mer­cial sources for these libraries.  In fact, some sys­tems allow the ana­lyst to exer­cise his/her dis­cre­tion in employ­ing his/her own stan­dards as opposed to the com­mer­cially avail­able ones that are CRMs.

Elec­tron Ion­iza­tion (EI) based Mass Spec­trom­e­try has been so fan­tas­ti­cally removed from the base of the sci­ence, that it has been sim­pli­fied to accom­mo­date the masses.  In the mod­ern crime lab, it has been reduced to sim­ple computer-assisted pat­tern recog­ni­tion.

Sadly gone are the days of acid-based chem­istry to elu­ci­date mass spec­tral pat­terns.  Instead, just let the com­puter do it.  Com­put­ers are never, wrong are they?

This is what one of the NIST 08 library screen­shot looks like:

Science has been reduced to computer-assisted pattern recognition

Sci­ence has been reduced to computer-assisted pat­tern recognition

But what if it is not a trace­able source?  What if it is not a per­fect match mean­ing 100% prob­a­bil­ity?  As we can see from this screen­shot there is a list of probabilities.

What a minute!  You mean there is a possibility that it is not the analyte of interest?

What a minute! You mean there is a pos­si­bil­ity that it is not the ana­lyte of interest?

There is a judg­ment call that is made even in GC/MS.

At what probability percentage, does the analyst get to call it a specific compound to the exclusion of all others?

At what prob­a­bil­ity per­cent­age, does the ana­lyst get to call it a spe­cific com­pound to the exclu­sion of all others

Why are state sci­en­tists allowed to come in and present a con­clu­sion that is really an opin­ion that an unknown is def­i­nitely a known sub­stance?  Why aren’t court’s forc­ing the truth be pre­sented?  Why the cover up?

 

Before we had blogged on the impor­tance of metrol­ogy and uncer­tainty report­ing as well as advanc­ing (or rather inch­ing) towards ISO 17025.  Part of the integrity of any sort of test­ing of unknowns has to do with the verac­ity of the sources of knowns against which the unknowns are com­pared.  For if we do not have cer­tainty in our knowns and our ref­er­ences, then there can­not be any con­fi­dence in our con­clu­sions of the analy­sis of unknowns.

In a well-run foren­sic lab this need for cer­ti­fied and true stan­dards or knowns is solved by using some­thing that is called Cer­ti­fied Ref­er­ence Mate­ri­als (CRMs).  The Inter­na­tional Stan­dards Orga­ni­za­tion (ISO) has estab­lished guides to gov­ern CRMs.  These include:

  • ISO Guide 35: 1989 — Cer­ti­fi­ca­tion of ref­er­ence mate­ri­als — Gen­eral and sta­tis­ti­cal principles
  • ISO Guide 31: 2000 — Con­tents of cer­tifi­cates & labels of ref­er­ence materials
  • ISO Guide 33: 1989 — Uses of cer­ti­fied ref­er­ence materials
  • ISO Guide 34: 2000 — Gen­eral require­ments for the com­pe­tence of ref­er­ence mate­r­ial producers

NIST pro­vides a ser­vice that includes over 1300 Stan­dard Ref­er­ence Mate­ri­als ® or SRMs.  There is also the NIST 08 library which replaces the NIST 05 library.

NIST 08 is not just a mass spec­tral library. It con­tains these components:

  • (UPDATED) Elec­tron ion­iza­tion (EI) mass spec­tral library — 220,460 spec­tra of 192,108 unique com­pounds, with iden­ti­fi­ca­tions and usu­ally chem­i­cal struc­tures. You may search names of com­pounds online.
  • (UPDATED) MS/MS library — 14,802 spec­tra of 5,308 pre­cur­sor ions (3,898 cations and 1,410 anions).
  • (UPDATED) Gas Chro­matog­ra­phy (GC) data library — 224,038 Kovats reten­tion index val­ues for 21,847 com­pounds in the EI library, now on both polar and non-polar columns. Includes reten­tion indices with GC col­umn con­di­tions and lit­er­a­ture citations.
  • (UPDATED) NIST MS Search soft­ware — soft­ware for search­ing (iden­ti­fy­ing) com­pounds from their mass spec­tra and for brows­ing mass spec­tral libraries. Also includes MS inter­pre­ta­tion pro­grams for ana­lyz­ing mass spec­tra on the basis of chem­i­cal struc­ture, mol­e­c­u­lar for­mula, iso­topic pat­terns, and more.
  • (UPDATED) AMDIS soft­ware — soft­ware for decon­vo­lut­ing gas/liquid chromatograms
  • (UPDATED) Doc­u­men­ta­tion — Approx­i­mately 50 page printed and elec­tronic man­ual on setup and basic usage. Addi­tional infor­ma­tion is in the help files.

Libraries are for­mat­ted the binary for­mat suit­able for use alone or by the NIST MS Search soft­ware (and AMDIS). Addi­tional instrument-specific for­mats (e.g., Agi­lent Chem­Sta­tion) are avail­able sep­a­rately to per­mit library search­ing directly within the GC/MS or LC/MS data system.

NIST offers Standard Reference Materials

NIST offers Stan­dard Ref­er­ence Materials

Another source of CRMs are:

  1. COMAR, which has nearly 11,ooo CRMs from about 220 pro­duc­ers in 25 countries,
  2. Sigma Aldrich, which has well over 500 CRMs,
  3. AAFS Drug Library 2010 which con­tains 2700 pure drug spectra,
  4. Mass Spec­tral and GC Data of Drugs, Poi­sons, Pes­ti­cides, Pol­lu­tants and Their Metabo­lites, which is com­monly referred to as Pfleger 2007
 

While a lot has been made about the National Insti­tute of Stan­dards and Tech­nol­ogy (NIST) trace­able stan­dards and NIST’s own report­ing of uncer­tainty.  There is a large mis­take that has devel­oped even among well-credentialed sci­en­tists in that NIST-based and reported expres­sions of Uncer­tainty Mea­sure­ment (UM) are the be all and end all of the con­ver­sa­tion and con­sti­tutes Bayesian-based expanded uncer­tainty reporting.

It is not.

This mis­con­cep­tion is not unlike Plato’s alle­gory of the cave as writ­ten in The Repub­lic:

NIST traceable based uncertainty is only the beginning of the expanded Bayesian based metrology

NIST trace­able based uncer­tainty is only the begin­ning of the expanded Bayesian based metrology

In The Repub­lic the ancient Greek philoso­pher Plato uses the Alle­gory of the Cave to explain the lim­i­ta­tions of per­cep­tion ver­sus objec­tive real­ity.  In this con­struct he asks the reader to imag­ine a group of peo­ple who have lived chained in a cave all of their lives, fac­ing a blank wall with­out the abil­ity to turn around or change the direc­tion of their gaze.  Behind them a fire rages.  These pris­on­ers watch year in and year out the shad­ows pro­jected on the wall.  They see the flut­ter­ing images as time goes by as peo­ple and objects pass in front of a fire behind them.  Nat­u­rally, they are lim­ited in their world view and as such they begin to ascribe forms to these shad­ows and make cer­tain assump­tions about the world within which they live.  Some­times they are right, but most times they are wrong due to their extremely lim­ited abil­ity to directly view objec­tive real­ity.  Accord­ing to Plato, the shad­ows are as close as the pris­on­ers get to see and expe­ri­ence objec­tive real­ity. Later the pris­on­ers are released from their bondage to expe­ri­ence for the first time the world free of lim­i­ta­tions and the nar­row­ness of their world view.  Their ear­lier incor­rect per­cep­tions are revealed.  Plato then explains how the reader is like a pris­oner who is freed from the cave and comes to under­stand that the shad­ows on the wall are not the extent of real­ity at all, as the pris­oner and the reader now can per­ceive the true form of real­ity rather than the mere shad­ows seen by the prisoners.

As we exam­ined before together, metrol­ogy and the hon­est report­ing of expanded uncer­tainty is cru­cial in pre­sent­ing the truth and the lim­i­ta­tions of any ana­lyt­i­cal or even non-analytical measure.

Here is the truth straight from NIST:

NIST Tech­ni­cal Note 1297, Guide­lines for Eval­u­at­ing and Express­ing the Uncer­tainty of NIST Mea­sure­ment Results, 1994 Edition.

In addi­tion to the uncer­tainty indi­cated by NIST, other uncer­tain­ties are inher­ent in the instru­ment, asso­ci­ated with the method or pro­to­col in using the instru­ment, with the oper­a­tor of the instru­ment, and with the phys­i­cal envi­ron­ment (pres­sure, tem­per­a­ture, humid­ity, etc.) in which the mea­sure­ments are made. Thus, the mea­sure­ments made with the cal­i­brated instru­ments or arti­facts by orga­ni­za­tions out­side of NIST have total uncer­tainty bud­gets asso­ci­ated with them, only one com­po­nent of which is the uncer­tainty reported to them by NIST.