When Normal Science fails: a clinical anomaly that opens the problem
When Normal Science fails: a clinical anomaly that opens the problem
Abstract:
This chapter closes the Normal Science phase by examining a central diagnostic vulnerability in Orofacial Pain (OP) and Temporomandibular Disorders (TMD): the risk of mistaking a “TMD-like” clinical presentation for systemic or neurological conditions capable of mimicking it, especially in early stages. Starting from a 5-year prospective clinical follow-up, we introduce the conceptual foundation of the Ψ Index as a diagnostic paradigm oriented toward managing uncertainty, in which the outcome depends not only on “which test” is performed, but also on when and in what order information is acquired.
In a first phase, Bayesian updating is applied to a diagnostic pathway based on RDC/TMD, yielding a high conditional probability of TMD when the RDC output is positive within the symptomatic cohort. However, long-term follow-up reveals that many symptomatic patients initially classified as TMD were in fact carriers of severe non-TMD conditions. This produces an “interference-like” distortion of diagnostic certainty: a quantitative drop from 81% to 9.56% in a contextual model, used here to formalize the epistemological transition from deterministic commutativity to clinical non-commutativity (order effects).
The chapter concludes by arguing that deterministic diagnostic formalism is insufficient in multifactorial conditions, and that future clinical research should explore contextual and quantum-like probabilistic frameworks as tools to model order effects, diagnostic interference, and clinically high-impact uncertainty.
Introduction
We reach the end of the Normal Science section (in Kuhnian terms: a mature phase of “puzzle-solving” under a dominant paradigm), in which the current status quo in OP and TMDs has been presented together with a set of unresolved diagnostic vulnerabilities. These vulnerabilities are not yet “anomalies” in the strict Kuhnian sense, but they are critical points that systematically threaten diagnostic safety and timeliness.
Chronic OP conditions are difficult to diagnose and treat because the mechanisms of etiology and pathogenesis remain only partially understood.[1][2] A frequent feature of OP is its multifactorial nature: pain related to TMDs often overlaps with symptoms and signs that may also be early manifestations of neurological or systemic diseases, making differential diagnosis complex.[3][4][5][6]
In patients with functional disorders of the stomatognathic system, peripheral risk factors (including occlusal variables) have been reported,[7][8][9][10][11][12] as well as central biopsychosocial risk factors involving CNS dysfunction.[13][14] However, these frameworks have neither resolved causal mechanisms nor generated a decisive improvement in clinical management; consequently, a wide range of predominantly conservative treatments persists.[15][16][17][18][19]
A relevant obstacle in OP/TMD differential diagnosis remains the instability of standardized criteria to define TMD subtypes, historically addressed through the development of the Research Diagnostic Criteria (RDC). For an overview of RDC, see the dedicated chapter.
The prospective study presented here constitutes the bridge toward the Ψ Index paradigm. The cohort was evaluated longitudinally in three clinically distinct phases:
- — preliminary diagnostic phase
- — advanced diagnostic phase (after follow-up and expert re-evaluation)
- — definitive diagnostic closure (protocol modeling: Index , read “ket Psi Index”)
Since the first step is anchored to Bayesian updating, we recall the minimal formalism.
Bayes' Theorem (minimal recall)
Bayes' theorem updates the probability of a hypothesis in light of evidence:
where:
- — hypothesis (e.g., “TMD present”)
- — evidence (e.g., “RDC output positive”)
and:
Important clinical note: in diagnostic practice, probabilities are often computed on a selected cohort (e.g., symptomatic patients entering a pathway), not on the general population. For this reason, the prevalence must be defined with respect to the cohort under analysis.
Ψ Index at time (Bayesian certainty in the symptomatic cohort)
At time , the RDC model is applied to a sample of 40 subjects (30 asymptomatic, 10 symptomatic). Within the symptomatic cohort, 9 subjects are classified as TMD-positive according to RDC, whereas 1 symptomatic subject is classified as non-TMD according to RDC (Table 1).
To avoid a misleading “population prevalence”, we state explicitly:
- Cohort = symptomatic patients entering the diagnostic pathway (n=10)
- = “TMD (as classified by the RDC model at time )”
- = “positive RDC test output”
From the cohort:
- Sensitivity (observational/assumed for the RDC pathway in this cohort):
- Specificity (observed/assumed in this dataset): , hence
Then:
and:
Therefore, in the symptomatic cohort and under these conditions, a positive RDC output produces an apparently “certain” TMD classification.
This is where the epistemological risk emerges: a probabilistic certainty can be produced within a model even when the clinical ground truth (etiology and systemic risk) is not aligned with the model’s hypothetical space.
For example, clinically complex scenarios may mimic OP/TMD patterns and confound classification.[20]
For this reason, symptomatic subjects were re-evaluated by a multidisciplinary team and followed longitudinally. The advanced phase at time changes the diagnostic landscape.
Ψ Index at time (order effects and diagnostic non-commutativity)
Follow-up and expert re-evaluation reveal that many symptomatic subjects were carriers of severe non-TMD pathologies (e.g., meningiomas, pineal cavernomas, scleroderma, hemimasticatory spasm). These conditions were already discussed in the Normal Science chapters, but their occurrence within an “RDC-TMD positive” cohort constitutes genuine epistemological pressure on the paradigm.
Classical diagnostic thinking often assumes commutativity: variables and tests can be permuted without changing meaning. Clinically, however, time creates an order: lesion evolution, symptom emergence, and test sensitivity are time-dependent. Consequently, the order in which information is acquired can modify the diagnostic outcome.
Paradigmatic example (prudent): temporal and sequential dependence
A minimal clinical example makes the point intuitive without forcing conclusions: given the same reported symptomatology (bruxism/orofacial pain), the clinical reading can change substantially when time and the order of acquisition of informational blocks change.
- In 2000: a standard dental pathway (RDC-type criteria and tests oriented to the TMD context) produces a locally coherent classification: “TMD/bruxism”.
- In 2014: a neurophysiological work-up, performed at a more advanced phase and with greater discriminative power for some trigeminal conditions, produces data compatible with a different etiological scenario and forces a differential re-evaluation.
Here, the example is not used to claim that a single test “diagnoses” a specific pathology, but to introduce a clinically crucial fact: the diagnostic outcome may depend not only on which information is available, but also on when and in which sequence it is acquired.
Operationally, we can describe the pathway as a sequential application of two blocks:
- = standard dental block (local TMD/RDC reading)
- = neurophysiological/systemic block (higher-level, differential reading)
and explicitly admit that between and there is a time :
When the system is complex and has overlapping phases, it is not guaranteed at all that inverting the order is neutral:
The full formalization of this point (diagnostic non-commutativity as outcome dependence on order and time) is developed in the dedicated chapter: Non-commutative variables in clinical practice.
Why an index of state is needed
The results presented indicate that the diagnostic outcome cannot be adequately represented as a single scalar numerical value. What matters, in complex clinical systems, is not only the intensity of an index, but the overall configuration of the system’s state: modulus, phase, informational orientation, and dependence on the observational path.
A purely scalar model allows data to be ordered along a single dimension (for example “more” or “less” pathological), but it cannot distinguish clinically different states that share the same numerical value. In such cases, equality of value does not imply equivalence of state.
It is in this context that the need emerges for a mesoscopic state index of a vector nature, capable of representing the clinical system as a configuration rather than as a simple binary classification healthy/sick. Such an index describes not only “how much” a phenomenon is present, but “how” it is organized within the observed clinical system. (Figures 1,2)
That said, before introducing a vector representation of the clinical state, it is appropriate to examine the operational limits that appear within a classical statistical model consolidated in scientific practice: Bayes' Theorem.
Quantum-like contextual formalization (toy model) — reading guide
Methodological note: the following matrices are a toy model introduced to make the order effect explicit. They are not direct empirical estimates: the coefficients are didactic parameters chosen to make dependence on informational sequence visible.
1) States and state vector
For simplicity we consider two clinical states:
- = “healthy / absence of evidence of trigeminal hyperexcitability”
- = “pathological / evidence of trigeminal hyperexcitability”
We represent the patient’s “state” as a vector:
where and are weights (here treated as probabilities/credence) and satisfy .
At baseline we assume:
(i.e., initially the system is considered “healthy”).
2) How to read a test-matrix
Each test is represented by a matrix that updates the state vector:
Reading convention:
- the columns represent the “input” state (before the test),
- the rows represent the “output” state (after the test).
Thus the coefficient indicates how much the test shifts weight from to .
3) EMG operator (low early discrimination)
We assume an EMG with poor discriminative power in early phases:
Applied to baseline:
Reading: after EMG the system remains “predominantly healthy” (), with a small share on ().
4) rcIMR operator (high discrimination)
We assume an rcIMR test (recovery cycle of the Masseter Inhibitory Reflex) that is highly discriminative:
Applied to the post-EMG state:
Reading: after rcIMR the weight shifts toward ().
This makes the order effect visible: the EMG→rcIMR sequence produces a different outcome than the case in which rcIMR is performed as the first test.
5) Scenario 2: rcIMR as the first test
If rcIMR is performed immediately:
Clinical interpretation: early use of a more discriminative test can make the pathological state emerge sooner, reducing diagnostic delay.
Classical probability vs quantum-like probability (contextual interference)
Classical probability (Kolmogorov, 1933)[21] describes events as sets and assigns to each event a numerical “weight” between 0 and 1.
1) Additivity: what “disjoint events” means
Two events are disjoint when they cannot occur together (they do not overlap). Simple example: in a single draw, “3 occurs” and “5 occurs” are disjoint.
In this case the additive rule holds:
- means “ occurs or occurs”.
- Therefore: if and cannot coexist, the probability of “either one or the other” is the sum of the two.
2) The total probability formula: decomposing a complex problem into simple cases
Often we want to compute the probability of an event (for example: “test positive”). But may occur for different reasons: it depends on whether the patient belongs to different classes .
The classical total probability formula says:
Step-by-step reading:
- is a variable that separates the possible cases (e.g., clinical classes).
- indicates a specific case (a specific class).
- is how often that case occurs (its frequency/probability).
- is the probability that we observe if we are in the case .
- The sum means: “I put together all possible cases”.
In other words: → we compute as a weighted average of the various scenarios .
Classical clinical example: how incidence changes the Bayesian result
Consider a deliberately simple example, drawn from daily clinical practice in TMDs.
Suppose:
- a general population in which the prevalence of TMDs is about 30%
(a figure compatible with many epidemiological estimates).
- a diagnostic test (e.g., RDC-type criteria) with:
* sensitivity
* specificity
Where:
- = “the subject has a TMD”
- = “the test is positive”
Case 1: general population (prevalence 30%)
Let:
Compute the probability of a positive test:
Now the probability that a subject with a positive test truly has a TMD is:
Case 2: low-prevalence population (prevalence 9%)
Suppose we apply the same test in a population where TMDs are relatively rare (for example, an unselected general population), with prevalence 9%.
Let:
Compute the probability of a positive test:
Now compute the probability that a subject with a positive test truly has a TMD:
Immediate clinical reading
The test is identical to the previous case. Sensitivity and specificity do not change.
Yet:
- with prevalence 30% →
- with prevalence 9% →
A “good” test can therefore yield a **clinically weak result** when applied in a low-incidence context.
Simplified clinical reading
The test is **the same**. Sensitivity and specificity do **not** change.
Yet:
- in the general population (example reported in the table on 40 subjects) a positive test means “79% probability of TMD”;
- in the selected population (same example but on 10 symptomatic subjects) it means “93% probability of TMD”.
This shows a key point: diagnostic probability is not a property of the test, but of the **context in which the test is applied**. This means that: for the same test, a positive result does not have an absolute diagnostic meaning, but depends on the population in which the test is applied.
In a low-prevalence TMD population, a positive test provides limited certainty;
in an already selected population, the same result produces much greater certainty.
Already in the classical Bayesian model, therefore, the numerical result depends on the population, selection, and observational history.
Beyond Bayes: when classical probability is not enough
At this point a structural limit of classical Bayesian reasoning emerges, and it is important to make it explicit.
Suppose we are faced with a patient who presents a painful symptomatology fully compatible with a TMD and who, within the standard dental pathway, yields a positive output on the available diagnostic tests. In a classical Bayesian framework, this information tends to progressively increase the posterior probability of the “TMD” hypothesis, especially when the patient belongs to an already selected population.
However, there is a clinically crucial possibility: that the patient does present an odontofacial symptomatology compatible with a TMD, but the true etiology of the pain is different and more severe, for example a neurological pathology such as a pineal cavernoma.
In standard Bayesian formalism, this eventuality can only be handled by expanding the hypothesis space and recomputing probabilities over alternative classes. But in doing so, one implicitly assumes that the different etiological hypotheses contribute independently and additively to the final probability.
The problem is that, in these clinical scenarios, hypotheses are not simply alternatives: they **interact**, compete to explain the same clinical signs, and can produce symptomatic overlaps such that a false sense of diagnostic certainty is generated.
In other words, the presence of a severe non-TMD pathology does not merely add a new hypothesis, but can **reduce** the reliability of the TMD interpretation, even in the presence of positive tests. This effect cannot be represented as a simple subtraction or redistribution of probabilities in the classical model.
This is where the idea of a corrective term is introduced: a contribution that does not sum linearly, but can **attenuate** (or, in other contexts, amplify) the resulting classical probability.
This term, which in the formal language is called “interference”, does not imply any physical transposition of quantum mechanics into medicine. It represents instead a mathematically controlled way to describe the effect of context, etiological competition, and informational order on the diagnostic result.
On these bases it becomes possible to introduce a quantum-like probabilistic formalism, conceived not to replace Bayes, but to extend it in cases where classical probability, even when correctly applied, fails to represent the true instability of the clinical state.
This phenomenon is not a mathematical artifact, but a compact description of a real clinical experience.
3) Why introduce a “quantum-like” term
In complex systems (as in clinical medicine), the outcome may depend on context and on the order in which information is acquired. When this happens, the simple “sum of cases” may not describe well what we observe in practice.
To represent this contextual dependence, a quantum-like formalism introduces a corrective term, called “interference-like”. Attention: here “interference” does not mean there is quantum physics in the patient, but that informational contributions from different cases can combine in a non-purely additive way.
4) The quantum-like formula: same base + a correction
The formula is written as:
Naive explanation of the additional part
- means: “I do not sum one case at a time, but I consider pairs of cases”.
(i.e., how two possible classes interact when the context is ambiguous).
- The square root is used to build a term that is “symmetric” between the two contributions.
It is a standard way to combine two probabilistic weights without privileging either one.
- is a parameter that modulates the correction:
'if it is positive' the correction increases the final value;
'if it is negative' the correction reduces it;
'if it is zero' the correction vanishes and we return to the classical model.
- The factor is part of the canonical form of the interferential term: it does not change the conceptual meaning, it only serves to maintain the symmetry of the construction.
In summary:
- Part 1 is the “standard” total probability.
- Part 2 adds (or subtracts) a contribution that represents the contextual effect: overlap, ambiguity, informational order, competition between classes.
This is why the model is called “quantum-like”: not because it applies quantum physics to tissues, but because it uses a mathematical structure capable of representing non-additive effects typical of complex systems.
Data from follow-up at time (contextual reinterpretation)
From the follow-up point of view, the symptomatic cohort (n=10) contains:
- subjects initially classified as TMD by RDC at time : 9
- subjects subsequently recognized as carriers of severe non-TMD pathologies in the same symptomatic cohort: 7
This indicates that “RDC positive” does not necessarily map onto a “real etiological class” in medically complex OP patients. In other words: the target space of the paradigm is incomplete, and this incompleteness generates a systematic distortion.
We define (for the contextual calculation):
- probability that a subject in the symptomatic cohort belongs to the “RDC-TMD” class at time .
- probability of obtaining a positive RDC output given that the subject belongs to the RDC-TMD class (sensitivity of the RDC pathway in the cohort considered).
- probability that, following follow-up and expert re-evaluation, a subject in the same symptomatic cohort belongs to a severe non-TMD class (neurological or systemic pathologies with TMD mimicry).
- illustrative parameter encoding the diagnostic discrepancy between the two competing etiological readings ( and ).
In this context does not represent a physical angle measured on the patient, but a formal indicator of the degree of conflict between two clinical explanations that produce a similar symptom pattern.
- The value corresponds to the didactic choice of , used to represent a “strong but not maximal” negative interference case, in which the presence of a severe non-TMD etiology substantially reduces the diagnostic certainty obtained in the classical model.
True-positive term (classical)
Corrective “interference-like” term (contextual)
Contextual effectiveness
Interpretation: the RDC pathway can produce a high internal certainty for “TMD classification” while not protecting the patient with respect to severe non-TMD etiologies. The collapse from 81% to 9.56% formalizes the paradigm gap: when the diagnostic space is incomplete, certainty becomes fragile and dependent on the order of information.
A counterintuitive result (but clinically crucial)
The result obtained may appear paradoxical and even unsettling: from an internal Bayesian probability of 81% for TMD classification (within the symptomatic cohort), the contextual model yields an effective probability of about 9.56%.
It is essential to clarify that this “collapse” does not indicate a mathematical error, nor a failure of the RDC test per se. On the contrary, it reveals an epistemological fragility: a high probability can be perfectly coherent **within an incomplete model** and yet be clinically misleading.
In the classical Bayesian model, certainty grows because the “TMD” hypothesis is updated within an etiological space that does not adequately account for alternative pathologies capable of mimicking the symptom profile. The posterior probability correctly answers the question posed by the model, but not necessarily the clinically relevant question.
The introduction of the interference term does not “penalize” TMD: it makes explicit the fact that the same evidence can be shared by competing etiological explanations and that this competition reduces decision safety. In this sense, contextual probability no longer measures the internal coherence of a classification, but its reliability as a clinical guide in the presence of severe alternatives.
The value 9.56% should therefore not be read as a new prevalence estimate, but as an indicator of diagnostic instability: a quantitative signal warning that the decision system is operating in a high-risk overlap region, where numerical certainty does not coincide with clinical safety.
So, what are we really saying?
At this point the reader might legitimately ask: we have shown calculations, probabilities, interferences, matrices, and impressive numerical collapses — but what is the real clinical point of all this?
The point is not to prove that the classical formalism “gets it wrong”, nor to introduce a more sophisticated language for the sake of complexity. The point is simpler and more uncomfortable:The absolute value of an index, even when statistically grounded, is not sufficient to describe the system’s state. The order in which information is acquired matters; the time separating tests matters; the presence of competing etiologies matters. In these scenarios, diagnosis is not a snapshot, but a path-dependent process.there are clinical contexts in which high numerical certainty does not coincide with a certain diagnosis
This raises a set of questions that can no longer be avoided:
- does it make sense to rely on a scalar numerical value when the clinical state evolves and reorganizes over time?
- what does “diagnostic certainty” mean if the outcome depends on the order of tests?
- how can we distinguish a correct but late diagnosis from a truly early diagnosis?
- is it possible to represent the clinical state as something richer than a number: a vector, a phase, a configuration?
The Index arises from these questions, not as a definitive answer, but as an operational attempt to give quantitative form to diagnostic instability. Not to replace existing criteria, but to signal when the clinical system is operating in a dangerous overlap region, where the absolute value ceases to be informative and a concept of state becomes necessary.
If the reader has reached this point with a sense of discomfort — the perception that “something does not add up” despite reassuring numbers — then the problem has been posed correctly. The subsequent sections of the Ψ Index will only explore this discomfort systematically
These chapters are not theoretical ornamentation, but the necessary path to formalize the construction of the Index as a clinical tool: collaborative, dynamic, and deliberately not reducible to a single scalar metric.
- ↑ NASEM Temporomandibular Disorders: Priorities for Research and Care The National Academies Press, Washington, DC (2020), 10.17226/25652
- ↑ B.J. Sessle. Chronic orofacial pain: models, mechanisms, and genetic and related environmental influences. Int J Mol Sci, 22 (2021), 7112, 10.3390/ijms22137112
- ↑ Sollecito T.P., Richardson R.M., Quinn P.D., Cohen G.S.: Intracranial schwannoma as atypical facial pain. Oral Surg Oral Med Oral Pathol. 1993;76:153-6.
- ↑ Shankland W.E.: Trigeminal neuralgia: typical or atypical? Cranio. 1993;11:108-12.
- ↑ Graff-Radford S.B., Solberg W.K.: Is atypical odontalgia a psychological problem? Oral Surg Oral Med Oral Pathol. 1993;75:579-82.
- ↑ Ruelle A., Datti R., Andrioli G.: Cerebellopontine angle osteoma causing trigeminal neuralgia: case report. Neurosurgery. 1994;35:1135-7.
- ↑ B.C. Cooper. Temporomandibular disorders: a position paper of the ICCMO. Cranio, 29 (2011), 237-244, 10.1179/crn.2011.034
- ↑ M.S. Nguyen et al. Occlusal support and temporomandibular disorders among elderly Vietnamese. Int J Prosthodont, 30 (2017), 465-470, 10.11607/ijp.5216
- ↑ M.S. Nguyen et al. The impact of occlusal support on temporomandibular disorders: a literature review. Proc Singap Healthc, 31 (2021), 1-12, 10.1177/2010105821102
- ↑ T.R. Walton, D.M. Layton. Mediotrusive occlusal contacts: best evidence consensus statement. J Prosthodont, 30(S1) (2021), 43-51, 10.1111/jopr.13328
- ↑ A. Kucukguven et al. Temporomandibular joint innervation: anatomical study and clinical implications. Ann Anat, 240 (2022), 151882, 10.1016/j.aanat.2021.151882
- ↑ E. Tervahauta et al. Asymmetries and midline shift in relation to TMD in a Finnish adult population. Acta Odontol Scand, 80 (2022), 1-11, 10.1080/00016357.2022.2036364
- ↑ R.B. Fillingim et al. Potential psychosocial risk factors for chronic TMD: OPPERA domains. J Pain, 12 (2011), T46-T60, 10.1016/j.jpain.2011.08.007
- ↑ G.D. Slade et al. Painful temporomandibular disorder: decade of discovery from OPPERA studies. J Dent Res, 95 (2016), 1084-1092, 10.1177/0022034516653743
- ↑ P. Svensson, F. Exposto. Commentary: overlaps among chronic pain conditions, but no news about causal relationships. J Oral Facial Pain Headache, 34(Suppl) (2020), s6-s8, 10.11607/ofph.2020.suppl.c2
- ↑ C.S. Stohler, G.A. Zarb. On the management of TMD: a low-tech, high-prudence approach. J Orofac Pain, 13 (1999), 255-261.
- ↑ J. Feng et al. Treatment modalities of masticatory muscle pain: network meta-analysis. Medicine (Baltimore), 98 (2019), e17934, 10.1097/MD.0000000000017934
- ↑ Z. Al-Ani. Occlusion and TMD: a long-standing controversy. Prim Dent J, 9 (2020), 43-48, 10.1177/2050168420911029
- ↑ C. Penlington et al. Psychological therapies for TMDs. Cochrane Database Syst Rev, Issue 8 (2022), CD013515, 10.1002/14651858.CD013515.pub2
- ↑ Martina K. Shephard & Gary Heir. Orofacial Pain in the Medically Complex Patient. Contemporary Oral Medicine, 26 January 2019.
- ↑ Kolmogorov A.N. Grundbegriffe der Wahrscheinlichkeitsrechnung. Springer-Verlag, Berlin (1933).

