Daubert standardLegal test requiring judges to screen scientific evidence for reliability before allowing it in court, replacing simple general acceptance with multi-factor analysis. failures have allowed junk science to persist in American courtrooms for more than three decades. In 1993, the U.S. Supreme Court ruled in Daubert v. Merrell Dow Pharmaceuticals that federal judges must serve as “gatekeepers” for scientific evidence, screening out unreliable expert testimony before it reaches a jury[s]. The decision replaced a 70-year-old rule and promised a new era of scientific rigor in the courts. That promise has gone largely unfulfilled in criminal cases, where unreliable forensic methods continue to contribute to wrongful convictions at alarming rates.
Daubert Standard Failures Begin With the Gatekeepers
Before 1993, courts relied on the Frye standardLegal rule from 1923 requiring scientific evidence to be generally accepted in its field before admission in court, replaced by Daubert in 1993., established in a 1923 case involving a polygraph test. Under Frye, scientific evidence was admissible if it was “generally accepted” in its field. The Supreme Court’s Daubert decision replaced that single criterion with a more demanding five-factor test. Judges now had to evaluate whether a technique had been tested, whether it had been peer-reviewed, its known error rate, whether standards existed for its application, and whether it was widely accepted in the scientific community[s].
The problem is that most judges lack the scientific training to perform this evaluation. Research published in the journal Cureus found that about half of all judges have had no formal education in handling scientific evidence, yet 91% felt comfortable in their gatekeeper role[s]. That confidence gap between actual competence and self-assessment has allowed forensic methods with no scientific foundation to pass through the courthouse doors.
The Criminal Court Double Standard
Daubert standard failures are starkest when comparing how courts treat scientific evidence in civil versus criminal proceedings. In civil cases, judges routinely scrutinize the methodology behind expert claims and exclude testimony that falls short. In criminal cases, the same scrutiny rarely applies. Defendants in criminal trials face resource inequality, limited access to independent forensic experts, and judges who tend to admit forensic evidencePhysical evidence collected from a crime scene and analyzed scientifically to establish facts or reconstruct events; includes biological materials, trace evidence, and physical objects examined by forensic specialists. by relying on precedent rather than conducting rigorous inquiry[s].
The consequences of this double standard are measured in years of lost freedom and, in the worst cases, lives.
The FBI Hair Analysis Scandal
In April 2015, the FBI, the Department of Justice, the Innocence Project, and the National Association of Criminal Defense Lawyers released findings from a massive review of microscopic hair comparisonForensic technique of comparing hair samples under a microscope to link crime scene evidence to suspects, now known to lack scientific foundation. testimony. The results were staggering: in 268 cases where FBI examiners provided testimony used to inculpate a defendant at trial, erroneous statements appeared in 257 of them, a 96% error rate[s].
Twenty-six of 28 FBI analysts had provided flawed testimony or submitted lab reports with erroneous statements. The errors spanned cases in 41 states. Among the affected cases, at least 35 defendants received the death penalty, and errors were confirmed in 33 of those. Nine had already been executed[s].
Peter Neufeld, co-founder of the Innocence Project, called the FBI’s three-decade use of microscopic hair analysis “a complete disaster”[s]. The review itself was prompted by the 2009 National Academy of Sciences report, which noted that no scientifically accepted statistics existed for the frequency of particular hair characteristics in the population[s].
Bite Marks: Junk Science Still in Use
Among the most striking Daubert standard failures is the continued admissibility of bite-mark evidence. The practice rests on three claims: that human dentition is as unique as DNA, that skin can reliably record that uniqueness, and that forensic dentists can determine whose teeth made a given mark. None of these claims are supported by any research[s].
Steven Mark Chaney spent 28 years in a Texas prison after forensic dentists testified that a bite mark on a murder victim matched his teeth. One examiner called it a “one-in-a-million” match. Chaney had nine alibi witnesses. The Texas Court of Criminal Appeals eventually ruled that the bite-mark evidence “no longer proved anything,” and Chaney was declared actually innocent in 2018[s].
Bite-mark evidence has led to at least 35 wrongful convictions and indictments. The Texas Forensic Science Commission recommended a moratorium on its use unless the method could be demonstrated reliable. That demonstration has never materialized, yet some courts still admit it[s].
Arson Science and a Possible Wrongful Execution
Cameron Todd Willingham was executed in Texas in 2004 for allegedly setting a fire that killed his three young daughters. The conviction rested almost entirely on testimony from fire investigators who claimed to have found more than 20 indicators of arson. Five independent experts later reviewed the evidence and concluded that none of the scientific analysis used to convict him was valid[s].
In 2009, Dr. Craig Beyler, hired by the Texas Forensic Science Commission, found that “a finding of arson could not be sustained.” He described the key testimony from the fire marshal at Willingham’s trial as “hardly consistent with a scientific mind-set and more characteristic of mystics or psychics”[s]. Days before Beyler was set to present his findings, Governor Rick Perry replaced key members of the commission panel.
A Systemic Problem
These cases are not isolated incidents. Misapplied forensic science has contributed to more than half of all wrongful convictions identified by the Innocence Project and nearly a quarter of all wrongful conviction cases documented since 1989[s]. The affected disciplines range well beyond hair and bite marks, touching fingerprint analysis, arson investigation, bloodstain pattern analysis, shaken baby syndrome diagnosis, and dog scent evidence.
In each field, the pattern is similar: practitioners exaggerated the connection between crime scene evidence and a suspect, mischaracterized exculpatory results as inconclusive, or downplayed the limitations of their method[s].
The 2009 National Academy of Sciences report, Strengthening Forensic Science in the United States: A Path Forward, laid bare the scope of the problem. The committee found that most forensic methods, with the exception of DNA analysis, lacked consistent and repeatable scientific foundations. It called for Congress to establish an independent National Institute of Forensic Science to oversee standards and research[s]. Congress never created that body.
Why Reform Has Stalled
The persistence of Daubert standard failures stems from a structural conflict between science and law. Science advances by abandoning falsified hypotheses. Law advances by respecting precedent. Once a court admits a forensic technique, that ruling becomes the basis for admitting it again, creating a self-reinforcing cycle that can persist even after the underlying science has been discredited[s].
Chris Fabricant of the Innocence Project has argued that the United States should have something comparable to the FDA for forensic techniques: an independent body that evaluates methods before they are used to make decisions about liberty. “We do that with aspirin; we do it with toilet paper. We don’t do it with forensic sciences,” he told The Intercept[s].
Texas has been a rare bright spot. The state’s Forensic Science Commission and its “junk science writ,” Texas Statute 11.073, allow prisoners convicted on discredited scientific evidence to seek new trials. That statute led directly to Chaney’s exonerationThe official act of clearing someone of criminal charges, typically after new evidence proves their innocence. and has opened the door for others[s]. Most states have no equivalent mechanism.
The Daubert standard was a genuine improvement over the anything-goes era that preceded it. But the gap between the standard as written and the standard as applied in criminal courtrooms remains wide. Until courts treat forensic science with the same skepticism they bring to civil litigation, and until an independent body validates forensic methods before they reach the witness stand, Daubert standard failures will continue to cost innocent people their freedom.
Daubert standardLegal test requiring judges to screen scientific evidence for reliability before allowing it in court, replacing simple general acceptance with multi-factor analysis. failures represent one of the most consequential gaps in American evidentiary law. When the Supreme Court decided Daubert v. Merrell Dow Pharmaceuticals, Inc., 509 U.S. 579 (1993), it replaced the Frye “general acceptance” test with a multi-factor reliability inquiry and assigned trial judges the role of gatekeepers under Federal Rule of Evidence 702[s]. The Daubert trilogy, completed by General Electric Co. v. Joiner, 522 U.S. 136 (1997), and Kumho Tire Co. v. Carmichael, 526 U.S. 137 (1999), extended this gatekeeping function to all expert testimony, not just scientific evidence[s]. Three decades of application have revealed that this framework, while theoretically sound, has systematically failed in criminal proceedings.
The Daubert Framework and Its Five Factors
Justice Blackmun’s majority opinion articulated a flexible inquiry centered on five non-exclusive factors: whether the theory or technique can be and has been tested; whether it has been subjected to peer review and publication; its known or potential error rate; the existence and maintenance of standards controlling its operation; and whether it has attracted widespread acceptance within a relevant scientific community[s].
The Court emphasized that “the inquiry is a flexible one, and its focus must be solely on principles and methodology, not on the conclusions that they generate”[s]. This distinction between methodology and conclusions has proven difficult for judges without scientific training to operationalize. Research published in Cureus found that approximately half of all federal judges have had no formal education in evaluating scientific evidence, yet 91% reported feeling comfortable in their gatekeeper role[s].
Daubert Standard Failures in Criminal Versus Civil Litigation
The asymmetric application of the Daubert framework across civil and criminal proceedings constitutes the most significant of all Daubert standard failures. In civil litigation, Daubert motions are routinely filed and frequently granted, excluding expert testimony that fails to meet methodological standards. In criminal cases, defense counsel rarely file Daubert challenges, and when they do, courts overwhelmingly deny them. The structural explanations are well-documented: resource inequality between prosecution and defense, discovery rulesA legal doctrine holding that the statute of limitations on a lawsuit begins running only when the injured party discovers — or reasonably should have discovered — that harm occurred and who caused it. that disadvantage defendants, and judicial reluctance to exclude forensic evidencePhysical evidence collected from a crime scene and analyzed scientifically to establish facts or reconstruct events; includes biological materials, trace evidence, and physical objects examined by forensic specialists. that has been accepted for decades through precedent[s].
The practical effect is that Daubert operates as a rigorous screen in civil cases, where the primary risk is monetary liability, and a permissive rubber stamp in criminal cases, where liberty and life are at stake.
The Empirical Record: Hair, Bite Marks, and Arson
The FBI’s 2015 Microscopic Hair ComparisonForensic technique of comparing hair samples under a microscope to link crime scene evidence to suspects, now known to lack scientific foundation. Analysis Review provides the most comprehensive quantitative evidence of Daubert standard failures. The joint review by the DOJ, FBI, Innocence Project, and NACDL examined cases worked prior to 2000. Of 268 cases in which FBI examiners provided inculpating testimony at trial, 257 (96%) contained erroneous statements. Twenty-six of 28 FBI analyst-examiners provided flawed testimony or reports. Among 35 capital cases reviewed, errors appeared in 33. Nine defendants had already been executed[s].
The scope extends beyond the FBI laboratory. The Bureau trained several hundred state and local examiners through annual two-week courses that incorporated the same scientifically flawed language used by FBI analysts[s]. The NACDL noted that the NAS had found “no scientifically accepted statistics exist about the frequency with which particular characteristics of hair are distributed in the population”[s].
Bite-mark analysis exemplifies how scientifically illiterate case law propagates through the legal system. The foundational case, People v. Walter Marx (1975), admitted bite-mark evidence despite the trial judge conceding that the technique had not been tested through scientific research. The Marx court’s reasoning was subsequently adopted by state supreme courts nationwide and extended by analogy to other pattern-matching forensic disciplines, including firearms, tool marks, shoe prints, and tire treads[s].
The case of Steven Mark Chaney illustrates the downstream consequences. Convicted in 1987 on bite-mark testimony described as a “one-in-a-million” match, Chaney was exonerated in 2018 by the Texas Court of Criminal Appeals under Texas Statute 11.073, which allows post-conviction relief when scientific evidence used at trial has been discredited[s]. Bite-mark evidence has produced at least 35 documented wrongful convictions and indictments.
Arson investigation followed a similar trajectory. Cameron Todd Willingham was executed in 2004 based on fire investigation testimony that five independent experts subsequently determined was entirely invalid[s]. The Texas Forensic Science Commission’s independent expert, Dr. Craig Beyler, found that the testimony was “more characteristic of mystics or psychics” than scientific methodology[s].
The NAS Report and Its Unfulfilled Mandate
The 2009 National Academy of Sciences report, Strengthening Forensic Science in the United States: A Path Forward, documented the full extent of the problem. The committee found that the forensic science system had “serious problems” requiring a national overhaul and that most forensic methods, excluding DNA analysis, lacked consistent and repeatable scientific foundations[s]. Its central recommendation, the creation of an independent National Institute of Forensic Science, was never enacted by Congress.
Misapplied forensic science has contributed to more than half of wrongful convictions identified through the Innocence Project and nearly a quarter of all wrongful conviction cases since 1989[s]. The affected disciplines span microscopic hair analysis, bite marks, arson investigation, fingerprint analysis, bloodstain pattern analysis, and shaken baby syndrome diagnosis.
Structural Barriers to Reform
The persistence of Daubert standard failures reflects a fundamental tension between scientific epistemology and legal precedent. Scientific knowledge advances through continuous falsification and revision of hypotheses. Legal doctrine advances through stare decisisLegal doctrine requiring courts to follow precedents set by previous decisions, creating consistency but also perpetuating past errors., where prior judicial acceptance of a forensic method shields it from re-examination even after the scientific community has repudiated it[s].
Chris Fabricant of the Innocence Project has identified additional structural problems. Most forensic science disciplines were developed by and for law enforcement, not through independent scientific inquiry. Crime laboratories typically answer to prosecutors or police departments, creating institutional incentives that favor conviction over accuracy[s].
Texas offers one legislative model for addressing these Daubert standard failures. The state’s junk science writ (Texas Statute 11.073) provides a post-conviction mechanism for prisoners convicted on evidence that subsequent scientific developments have discredited. The Texas Forensic Science Commission, operating outside the adversarial process, has investigated individual cases and issued moratorium recommendations on discredited techniques[s]. Few other states have replicated either innovation.
The Daubert framework remains the most sophisticated judicial tool available for evaluating expert testimony. Its failures are not failures of design but of application: criminal courts have consistently declined to apply the same rigor that civil courts bring to the same standard. Addressing this gap requires legislative action to create independent forensic science oversight, statutory mechanisms for post-conviction review based on discredited evidence, and a cultural shift within the judiciary toward treating forensic evidence with appropriate scientific skepticism.



