Project Melampo
A neurobiological, multimodal and quantum-theoretical architecture for assisted clinical intuition.
Layers in dialogue: perception, metacognition, quantum cognition.
Project Melampo
From Attention to Intuition
April 11, 2026
Scientific consolidation treatise for a neurobiological, multimodal, and theoretical-quantum architecture of assisted clinical intuition
Abstract
Project Melampo is consolidated here as a unified scientific program for clinical Computer-Aided Intuition. The core premise is that next-generation assisted diagnosis cannot be reduced either to pattern classification or to text generation alone. What is required is an architecture in which 3D perception, episodic and semantic memory, differential reasoning, metacognition, uncertainty quantification, clinical interoperability, and regulatory validation operate as an integrated system [1–9, 13–20, 25–38].
The treatise adopts four structural principles.
- Neurobiological principle. Thought emerges from distributed networks: working memory, associative networks, the default mode network, predictive processing, replay, and neuromodulation must be translated into observable computational modules [1–6].
- Pedagogical-metacognitive principle. Clinical expertise does not coincide with a single inferential act; it is built through practice, reflection, error control, hypothesis revision, and progressive learning [7–12].
- Engineering principle. The most promising state of the art, updated to April 2026, points toward federated stacks: 3D volumetric encoders, medical vision-language foundation models, knowledge graphs and retrieval, model orchestrators, generative replay, risk calibrators, interaction protocols, and Total Product Life Cycle governance [13–38].
- Theoretical-quantum principle. The theoretical-quantum hypothesis is not removed from the project. It is integrated as a formal and falsifiable research line articulated across three coupled levels: quantum-like cognition, open Schrödinger/Lindblad-type dynamics, and a biophysical frontier involving microtubules, radical pairs, and other non-classical mechanisms in biological systems [39–48].
The central thesis is that Melampo should operate as a multi-layer clinical architecture: an immediately implementable perceptual-grounded layer; a cognitive-metacognitive layer governing differential diagnosis, escalation, and abstention; and a theoretical-quantum layer modeling contextuality, evidence order, hypothesis interference, and, at the frontier, possible non-classical biophysical substrates. None of these levels is self-sufficient. The robustness of the project depends on their experimentally testable interaction.
1. Methodological Premise and Scientific Positioning
Melampo begins from a strong intuition: the value of an advanced diagnostic system does not coincide with average accuracy alone, but with the ability to compose heterogeneous evidence, manage ambiguity, know when not to issue a judgment, and restructure its hypotheses when new information arrives. A system of this kind is neither a simple classifier nor a simple medical chatbot. It is an architecture for assisted clinical reasoning.
This document therefore adopts a precise stance. Melampo is not presented as a project fragmented into unrelated parts, but as a theoretical-operational continuum in which different levels are distinguished only for methodological clarity.
Two scientific obligations follow. The first is to translate every major claim into verifiable constructs. The second is not to amputate frontier hypotheses prematurely, but to submit them to a program of differential testing, comparison with rival models, and explicit criteria for promotion, suspension, or rejection.
2. Neurobiology of Thought and Cognitive Processing
2.1 Distributed networks, working memory, and executive control
Contemporary evidence shows that working memory is not a static container but a distributed circuit centered on prefrontal cortex and supported by recurrent excitatory-inhibitory dynamics, inter-areal synchronization, and dopaminergic modulation [2]. In design terms, this implies that Melampo should not contain a single cognitive “center,” but rather a diagnostic workspace in which images, reports, history, ontological knowledge, and the current differential state remain simultaneously accessible across several inferential cycles.
A minimal classical formalization of the neuroelectrical substrate, useful as an observable macroscopic layer, is given by the Hodgkin–Huxley equation:
Where:
- is the membrane capacitance;
- is the membrane potential;
- is the external current;
- are maximal or leakage conductances;
- are gating variables;
- are ionic equilibrium potentials.
In this treatise the equation serves an important function: it establishes that Melampo’s observable neural level remains compatible with a classical electrophysiological description. The theoretical-quantum hypothesis does not replace that level; it subsumes it as a plane of macrobiological observables upon which effective parameters and experimental comparisons can be defined.
2.2 Default mode network, spontaneous thought, simulation, and abstraction
The default mode network has been redefined as a network active in memory, abstraction, scenario simulation, and the integration of internal and external information [1]. More recent evidence further suggests that spontaneous thought emerges from dynamics distributed beyond the DMN alone, involving network configurations that change with the content of mental flow [4].
For Melampo this is decisive: the intuitive component should not be treated as an obscure event, but as a rapid compression of prior experience, abstract schemas, and contextual signals. The system should therefore include an offline or quasi-offline mode devoted to trace reactivation, counterfactual construction, and the combination of distant patterns.
2.3 Predictive processing, global workspace, and surprise
Predictive processing interprets the brain as a generative hierarchy that formulates predictions and corrects its internal models via prediction errors [3, 49]. The corresponding computational translation for Melampo is straightforward: every new examination, symptom, or radiological finding changes the ranking of the differential diagnosis because it changes free energy, surprise, and the precision assigned to information channels.
A compact formulation of the inferential objective is:
Where:
- is the free energy at time ;
- is the approximate distribution over latent clinical states;
- are the observations available at step ;
- is the structured memory of the system;
- is the joint generative model of observations and states.
In Melampo, minimizing is not a philosophical abstraction. It defines the criterion by which the diagnostic workspace integrates new evidence and decides whether to investigate further, abstain, or consolidate a hypothesis.
2.4 Replay, sleep, dreaming, and constructive consolidation
Hippocampal replay during sleep and the microstructure of sleep are now strongly associated with memory consolidation [5]. A generative model of memory construction and consolidation has also shown that replay can be interpreted as the training of generative systems capable of reconstructing and recombining experience [6]. This passage is crucial for the project’s Section 7.2: Melampo’s statistical dream is not ornamental metaphor, but a computational strategy inspired by replay and constructive consolidation.
Accordingly, the generative module must:
- reactivate real cases of high salience;
- synthesize plausible cases from latent regions conditioned by clinical context;
- produce hard negatives and borderline cases;
- discard incoherent trajectories;
- consolidate only those samples that improve discrimination, calibration, or rare-case coverage.
2.5 Neuromodulation as computational control
The role of dopamine and noradrenaline should not be reduced to mere metaphor. Recent literature continues to show that dopaminergic and adrenergic systems modulate learning, motivation, signal precision, temporal regulation of synaptic output, and decision under uncertainty [50–52]. In Melampo they are translated into computational control variables: precision, volatility, update rate, novelty threshold, and abstention threshold.
A minimal biophysical relation between calcium and synaptic release probability can be written as:
Where:
- is the synaptic release probability;
- is a scale constant;
- is intracellular calcium concentration;
- is the cooperativity exponent.
Within the project, observable neurochemical quantities function as a bridge toward a more general statistical dynamics: they do not coincide with the wavefunction, but can modulate effective parameters governing stability, decoherence, precision, and decisional commitment.
3. Evolution of Thought in Pedagogy, Learning, and Clinical Expertise
The construction of intuition is not only neurobiological. It is also pedagogical. Piaget describes the emergence of cognitive structures through progressive organization; Vygotsky shows the decisive role of social mediation and cultural tools; Bruner emphasizes the function of representations and the narrative construction of meaning [10–12]. In clinical domains, expertise emerges from the intertwining of:
- real-case experience;
- dialogue with teachers and peers;
- retrospective correction of error;
- internalization of diagnostic schemas and counterexamples;
- metacognitive reflection on one’s own reliability.
In the neuro-educational literature and recent medical pedagogy, metacognition is defined as the capacity to monitor and regulate one’s own cognitive processes [7–8]. For Melampo this implies an architecture with two obligations:
- an obligation of self-monitoring, that is, explicit estimation of safety, coverage, and limits;
- an obligation of self-regulation, that is, the ability to request additional data, activate retrieval, expand the differential diagnosis, or abstain.
The debate on dual-process models of clinical reasoning also suggests caution: diagnostic quality depends primarily on the nature and availability of retrieved knowledge, not on a rigid opposition between an intuitive system and an analytical one [9]. The design solution is therefore a dynamic continuum between rapid response and slower revision, mediated by uncertainty thresholds and risk cost.
4. State of the Art of Multimodal Medical AI as of April 11, 2026
4.1 General trend line
The most promising trajectory in medical AI does not converge toward a single monolithic model, but toward coordinated ecosystems of specialist and generalist models. Medical foundation models have become more relevant because they combine large-scale pretraining, adaptability across multiple clinical tasks, and integration across text, images, longitudinal data, and knowledge sources [13]. In parallel, 2025–2026 accelerated three frontier lines:
- vision-language models for 3D imaging [15, 19];
- multimodal foundation models for pathology on whole-slide images [17–18];
- orchestration of multiple models through standardized interaction protocols [20–24].
4.2 Volumetric encoders and 3D self-supervised learning
For Melampo’s perceptual core, the most robust families remain transformer-based volumetric encoders or hybrid transformer-CNN stacks. 3D Swin Transformer, Swin-UNETR, and their descendants opened the path, while more recent work on 3D self-supervised learning indicates that generalization increasingly depends on pretraining quality and multimodal anatomical coverage [15–16]. Architecturally, this leads to a preference for:
- 3D encoders pretrained on large CT/MRI collections;
- multi-resolution volumetric tokenization;
- support for multi-view and whole-volume context;
- lightweight adaptation (adapters, LoRA, prompt tuning) for site-specific deployment.
4.3 Medical vision-language foundation models and the most promising models
By 2025, the review literature on vision-language foundation models for 3D imaging had documented rapid growth in systems combining volumetric backbones with LLMs or language decoders for report generation, VQA, and multimodal reasoning [15]. In 2026 this axis was further enriched by models such as Decipher-MR, explicitly designed for 3D MRI representations, and the MedGemma 1.5 update presented by Google as an open medical model capable of handling text, 2D images, high-dimensional volumes, and whole-slide pathology [19, 53].
For Melampo this implies a precise architectural choice: do not entrust everything to a single VLM, but construct a federation of modules.
4.4 Multifactoring, real multimodality, and interaction among models
The term multifactoring is used here in a technical sense: a serious clinical decision requires the fusion of heterogeneous factors—morphology, symptoms, laboratory data, chronology, provenance, therapeutic history, rare patterns, literature, guidelines, and local resources. The optimal model is not the one that merely sees more, but the one that can align heterogeneous factors within a verifiable update structure.
A useful formalism for cross-modal fusion is:
Where:
- is the active clinical query at step ;
- and are keys and values produced by the various encoders (visual, textual, knowledge-based);
- is the normalization dimensionality;
- is the fused multimodal representation.
When modules are multiple, Melampo should use an orchestrator. A simple mixture-of-experts formulation is:
Where:
- is the contribution of expert ;
- is the router score for expert ;
- is the weight assigned to that expert;
- is the gating temperature;
- is the integrated output.
This formulation is especially suitable for Melampo when combining a 3D perceptual specialist, a semantic specialist for EHR and guidelines, a retriever/knowledge-graph component, and a generator/verifier.
4.5 Languages for model interaction and semantic standards
If Melampo is to become a platform rather than an isolated prototype, it needs two families of interaction languages.
The first concerns the clinical domain:
- FHIR for clinical resources and interoperable workflow [23];
- DICOM / DICOM-SR for images, metadata, and structured reports [24];
- SNOMED CT, LOINC, and ICD for shared semantics.
The second concerns cooperation among models and tools:
- the Model Context Protocol (MCP) as an open standard for connecting models, tools, resources, and context via JSON-RPC [21];
- Agent2Agent (A2A) as a protocol for collaboration among agents/models from different vendors and frameworks [22];
- JSON schemas and clinical ontologies for structured, validable, and signable outputs.
The emerging picture is clear: Melampo’s future is not an isolated LLM, but a multi-model clinical system reasoning over established healthcare standards and open orchestration protocols.
4.6 KG-RAG, structured memory, and grounding
Recent reviews of RAG in healthcare converge on one point: grounding, transparency, and updateability improve when the model can retrieve external evidence, but the benefits critically depend on document quality, retrieval quality, and post-retrieval reasoning [25]. In Melampo, knowledge graphs and RAG must therefore be structural parts of semantic memory, not optional accessories.
A composite retrieval score can be expressed as:
Where:
- is the retrieved document or case;
- is the clinical query;
- is the dense semantic score;
- is the lexical/sparse score;
- measures coherence with the knowledge graph;
- weights local or institutional relevance;
- are calibrable weights.
4.7 Generative replay, case synthesis, and continual training
Synthetic generation of clinical and imaging data has made a qualitative leap. Reviews from 2025–2026 describe a landscape dominated by diffusion models, VAEs, GANs, LLMs, and multimodal systems, with applications ranging from image synthesis to tabular/EHR synthesis, longitudinal data generation, and data balancing [31–33]. At the same time, the literature on continual learning in medicine has clarified that, without replay or other constraints, the risk of catastrophic forgetting is high [30].
Melampo should therefore use the generative module in three modes:
- controlled augmentation of rare classes;
- simulation of counterfactual cases and temporal transitions;
- statistical dreaming for reorganizing memory and differential diagnosis.
A conditional VAE loss suitable for replay is:
Where:
- is the observed case (image, text, or multimodal input);
- is the conditioning clinical context;
- is the latent variable;
- is the variational encoder;
- is the decoder/generator;
- is the context-conditioned prior;
- controls the weight of KL regularization.
4.8 Uncertainty, abstention, and clinically serious systems
Recent literature on uncertainty quantification in medical imaging distinguishes at least three families of uncertainty: aleatoric, epistemic, and operational. For Melampo, a fourth family must be added: retrieval-grounding uncertainty [26–27].
A useful operational decomposition is:
Where:
- is intrinsic data uncertainty;
- is model/parametric uncertainty;
- is uncertainty due to incomplete retrieval or grounding;
- measures dataset shift or out-of-distribution conditions.
A minimal emission policy is:
Where:
- is the probability of diagnosis given observations ;
- is the minimum confidence threshold;
- is the maximum tolerable risk threshold;
- is the binary emission indicator.
5. Unified Scientific Architecture of Melampo
The proposed architecture is composed of seven tightly coupled functional layers.
5.1 Data layer and clinical normalization
Inputs enter Melampo through an ingestion layer handling 2D/3D DICOM, whole-slide pathology, EHR, reports, laboratory data, and therapeutic timelines. Semantic normalization produces FHIR resources, DICOM-SR objects, and terminology mappings onto SNOMED/LOINC/ICD.
5.2 Perceptual layer and cross-modal fusion
This layer hosts 3D encoders, pathology encoders, text encoders, and context adapters. Fusion must be multi-resolution and cross-modal, with both local and global evidence handling.
5.3 Memory and differential diagnosis
Melampo’s memory is tripartite:
- episodic, containing analogous cases and known outcomes;
- semantic, containing guidelines, literature, ontologies, and protocols;
- prototypical, containing diagnostic clusters and rare variants.
A basic prototypical classifier may be written as:
Where:
- is the set of examples for class or prototype ;
- is the embedding of case ;
- is the prototypical center of class ;
- is a latent-space distance.
5.4 Meta-controller, critique loop, and safety governor
The metacognitive layer evaluates whether the available data are sufficient, whether the differential should be expanded, whether retrieval should be activated, or whether clarification requests should be generated. The critique loop acts as an internal judge of clinical coherence, image-text correspondence, guideline adherence, and pathophysiological plausibility.
5.5 Generative replay and reorganization layer
The generator is not meant to “novelize” the case, but to produce clinically constrained replay and counterfactuals. This layer feeds both memory and differential reasoning. Its essential constraint is that every synthetic case be accepted only if it passes tests of clinical coherence and training utility.
5.6 Optional theoretical-quantum layer
The theoretical-quantum layer comes into play when evidence order, contextual ambiguity, or hypothesis interference render additive/classical models insufficient. It does not replace classical models; it accompanies them and challenges them experimentally.
6. Technical-Mathematical Formalization of the Implementable Core
6.1 Differential update
In the classical regime, Bayesian updating of the differential remains the most transparent starting point:
Where:
- is diagnostic hypothesis ;
- is the observation sequence up to time ;
- is the prior updated at the previous step;
- is the contextual likelihood.
In Melampo, this equation is the classical reference against which the benefits of quantum-like formalisms must be measured.
6.2 Continual learning and parameter consolidation
To avoid catastrophic forgetting during incremental updates, an Elastic Weight Consolidation term may be written as:
Where:
- is the loss of the new task;
- is the regularization coefficient;
- is the Fisher importance of parameter ;
- is the value consolidated from the previous task.
6.3 Prediction quality and calibration
A standard global calibration metric is the Expected Calibration Error:
Where:
- is confidence bin ;
- is the number of examples in that bin;
- is the total number of examples;
- is empirical accuracy in the bin;
- is mean predicted confidence in the bin.
Melampo’s objective is not only to reduce error, but to reduce the gap between real accuracy and declared confidence.
7. Integrated Theoretical-Quantum Hypothesis
7.1 Logical foundation: contextuality, evidence order, and interference
Recent literature on quantum cognition has reinforced two points. First, quantum formalisms are useful already at the cognitive level for describing order effects, response replicability, contextuality, and interference between alternatives [39–41]. Second, realistic cognitive measurement tools are not always reducible to projective measurements; in some cases they require more general state-update maps [40].
For Melampo this means that the diagnostic differential can be treated as a competitive belief state in which the order of incoming evidence changes the shape of the update. This does not abolish Bayes; it defines a measurable theoretical competitor.
7.2 Before measurement, during measurement, after measurement
The project’s crucial requirement can be formalized in three temporal stages.
7.2.1 Before measurement: coherent evolution of hypotheses
Before decisive new evidence arrives, the latent state of the hypotheses is represented by an effective wavefunction evolving according to the Schrödinger equation:
Where:
- is the superposition state of the diagnostic hypotheses at time ;
- is an effective scale constant of the formalism;
- is the effective Hamiltonian incorporating context, memory, and dynamic priors.
This equation expresses the following principle: before decisive clinical measurement, hypotheses compete continuously and deterministically in their state space. In other words, Melampo maintains a controlled superposition of the diagnostic differential.
7.2.2 During measurement: contextual selection
When a new high-information piece of evidence arrives—an exam, an imaging finding, a selective answer to a question, a laboratory marker—the state is subjected to measurement. The probability of diagnostic outcome is given by Born’s rule:
Where:
- is the measurement operator associated with diagnosis in context ;
- is the density operator equivalent of state ;
- is the operator trace.
In Melampo, collapse is therefore interpreted as the deterministic solution of a high-dimensional probabilistic problem: overlapping symptoms, medical statistics, partial reports, and conflicting patterns converge toward a selected outcome. In this operational sense, collapse may be called clinical intuition, provided that it remains traceable, parameterized, and verifiable.
7.2.3 After measurement: consolidation or reopening of the differential
After measurement, the system must choose between two routes:
- consolidate the dominant hypothesis in memory if evidence exceeds coherence and risk thresholds;
- reopen the superposition by asking new questions or launching new retrieval operations if risk remains high.
7.3 Open quantum systems: the Lindblad equation
Because the cognitive system is not isolated but continuously interacts with data, memory, noise, and clinical context, the most appropriate formulation is that of an open system:
Where:
- is the density operator of the diagnostic state;
- is the commutator;
- are Lindblad operators modeling measurement, noise, retrieval, clinical constraints, or commitment;
- is the anticommutator.
This equation makes it possible to incorporate decoherence, contextual updating, interaction with the clinical environment, and safety thresholds without abandoning the project’s formal structure.
7.4 Dirac-like formulation for coupled cognitive channels
The appeal to Dirac’s equation is intended here as an effective formalism for describing two coupled cognitive channels—for example a fast channel and a deliberative channel, or a perceptual channel and a semantic channel:
Where:
- is the two-component cognitive spinor;
- and represent two coupled channels;
- is the momentum operator in latent space;
- and are matrices of the Dirac formalism;
- is an effective velocity of information propagation;
- is a parameter of cognitive inertia;
- is the context potential.
In this interpretation, the Dirac-like form imposes first-order dynamics and a coupling structure that may be useful for modeling fast-to-analytic transitions without falling back into rigid dualism.
7.5 From neurochemistry to non-classical statistical dynamics
The bridge required by the project between chemistry, electrophysiology, and quantum theory can be formulated by introducing an effective Hamiltonian dependent on clinical content and on a neuromodulatory control vector :
Where:
- is the system’s base term;
- are diagnostic observables dependent on latent state and memory ;
- are coupling coefficients;
- is the vector of neuromodulatory surrogates (novelty, volatility, reward prediction, alertness, conflict);
- are modulation operators;
- is an effective decoherence/commit rate.
Within this framework neuromodulators are not abolished by quantum theory: they become regulators of precision, stability, and transition from superposition to selection.
7.6 Training, new-case generation, and the statistical dream
The requirement to include generative training as an integral part of the model of thought is made explicit here.
7.6.1 Latent statistical cloud
Given a clinical context composed of symptoms, examinations, physical traits, provenance, therapeutic history, and known patterns, Melampo constructs a conditioned latent cloud:
Where:
- is the composite clinical context;
- is the latent configuration of the statistical dream;
- is a generated or reorganized synthetic case.
7.6.2 Coherence filter and consolidation
Every synthetic case must be passed through a coherence filter:
Where:
- is an anatomo-clinical coherence score;
- is a risk of incoherence or epistemic harm;
- is the minimum coherence threshold;
- is the maximum acceptable risk threshold;
- determines whether the synthetic case is accepted.
7.6.3 Clinical meaning of the statistical dream
The statistical dream is not an indiscriminate generator. It is a process of memory reorganization in which the collapse of a diagnostic wavefunction may represent:
- discovery of a new coherent link between distant patterns;
- creation of a useful prototype for rare cases;
- emergence of an incoherent case that is discarded but sharpens the decision boundary;
- a counterfactual useful to the differential diagnosis.
In this sense, generative replay becomes part of the theory of intuition: intuition is not only online decision; it is also the offline reorganization of the space of hypotheses.
7.7 Microtubules, Orch-OR, and the biophysical frontier
The microtubular line and Orch-OR-type models remain part of the project as a theoretical-quantum frontier under examination [46–48]. The scientific attitude adopted here is threefold:
- not to assume as established what has not yet reached empirical consensus;
- not to expel from the project hypotheses with high theoretical content;
- to define criteria for differential testing, replication, and possible falsification.
The same logic applies to the quantum biology literature on radical pairs, the quantum Zeno effect in cryptochrome, NMDA receptor activity, and proton tunneling [42–45]. These results do not prove a strong quantum brain, but they do show that quantum phenomena in biology are not conceptually inadmissible.
8. Platform for Verification, Validation, and Falsifiability
Melampo’s robustness depends on a multi-level validation platform. The goal is not to defend the project, but to construct a system in which each block can be confirmed, resized, or rejected.
8.1 Verification of the perceptual and multimodal core
First level: technical benchmarks for segmentation, detection, report drafting, retrieval, and image-text coherence. Minimum endpoints must include site-wise performance, device-wise performance, subgroup performance, and performance stratified by data quality.
8.2 Verification of reasoning and metacognition
Second level: multi-step clinical reasoning benchmarks of the type developed in 2025–2026, with evaluation not only of final correctness but also of inferential trajectory quality [28–29]. Melampo should be measured on:
- quality of the differential diagnosis;
- correctness of the proposed sequence of questions and tests;
- ability to revise the diagnosis in light of new evidence;
- quality of the rationale and fidelity of the explanation;
- accuracy of abstention.
8.3 Calibration, selective prediction, and out-of-distribution control
Third level: calibration and selective-prediction measures. Here enter Brier score, ECE, NLL, AUROC for OOD detection, risk-coverage curves, false reassurance rate, and escalation latency. Without this level, any claim to clinical intuition remains engineeringly immature.
8.4 Falsifiability program for the theoretical-quantum line
The theoretical-quantum line must be tested on three distinct planes.
- Cognitive-formal plane. Quantum-like models should be benchmarked against Bayesian, energy-based, and other classical state-space baselines on contextuality, order effects, diagnostic reversals, and ambiguity handling.
- Dynamical plane. Schrödinger/Lindblad-type formulations should be tested against classical sequential-inference models on stability, calibration, and the handling of partially conflicting evidence.
- Biophysical frontier plane. Microtubule-, radical-pair-, or other non-classical hypotheses should be treated as separate research tracks with explicit empirical signatures, replication requirements, and a clear rule that no clinical deployment claim may rest on them alone.
8.5 Reader studies, workflow, and human factors
The clinically relevant proof remains: physician alone versus physician + Melampo. Recommended endpoints are:
- diagnostic accuracy and differential quality;
- sensitivity to rare cases and weak signals;
- reduction in decision time without increasing risk;
- reduction in false reassurance;
- calibrated clinician trust in the system.
8.6 Governance, regulation, and Total Product Life Cycle
WHO guidance on large multimodal models for health requires attention to transparency, oversight, social risk, responsible use, and continuous governance [34]. In 2025 the FDA consolidated recommendations on Predetermined Change Control Plans for AI-enabled devices, while the IMDRF finalized Good Machine Learning Practice principles [35–37]. In Europe, the AI Act is now part of the compliance landscape for high-risk systems in healthcare [38].
Melampo must therefore be born with:
- dataset cards, model cards, change logs, and audit trails;
- change-control plans and rollback criteria;
- post-deployment drift and bias monitoring;
- re-validation procedures after every substantial model modification;
- segregation between research and clinical environments.
9. Scientific and Technical Roadmap
9.1 Phase I — Verifiable clinical core
- 3D and whole-slide encoders;
- EHR and report parsers;
- KG-RAG memory;
- planner–retriever–generator–verifier orchestrator;
- uncertainty layer with abstention.
9.2 Phase II — Memory and continual learning
- episodic and prototypical memory;
- continual learning with EWC and replay;
- synthetic datasets for rare classes.
9.3 Phase III — Reasoning and metacognition
- adaptive question planning;
- critique loop;
- multi-step reasoning benchmarks;
- self-monitoring and escalation controls.
9.4 Phase IV — Theoretical-quantum track
- contextuality and order-effect benchmarks;
- optional quantum-like layer;
- comparison of Schrödinger/Lindblad formulations with classical baselines;
- design of frontier biophysical protocols.
9.5 Phase V — Clinical validation and responsible industrialization
- reader studies;
- controlled pilot deployment;
- post-market monitoring and change control;
- integration with the regulatory stack and TPLC documentation.
10. Conclusions
Project Melampo can be consolidated without losing its original ambition. The strongest path is to treat it as an integrated scientific architecture in which:
- neurobiology provides functional constraints;
- pedagogy of expertise provides the growth model of intuition;
- multimodal AI provides the operational engine;
- metacognition governs reliability, escalation, and abstention;
- the theoretical-quantum hypothesis remains a living part of the project, but tied to programs of verification and falsification.
The notion of intuition emerging from this treatise is not mystical. It is the contextual, rapid yet traceable selection of a hypothesis within a space of statistical and semantic possibilities organized by experience, memory, and measurement. In the fully developed version of the project, the statistical dream, generative replay, quantum-like differential reasoning, and metacognitive control become aspects of the same clinical machine: a machine that learns, corrects itself, abstains when necessary, and accounts for its steps.
Updated Bibliography
- Smallwood J, Bernhardt BC, Leech R, Bzdok D, Jefferies E, Margulies DS. The default mode network in cognition: a topographical perspective. Nature Reviews Neuroscience. 2021;22:503–513. doi:10.1038/s41583-021-00474-4.
- Jaffe RJ, Constantinidis C. Working Memory: From Neural Activity to the Sentient Mind. Comprehensive Physiology. 2021;11(4):2547–2587. doi:10.1002/cphy.c210005.
- Whyte CJ, Smith R. The predictive global neuronal workspace: A formal active inference model of visual consciousness. Progress in Neurobiology. 2021;199:101918. doi:10.1016/j.pneurobio.2020.101918.
- Cremona S, Gillig A, Mellet E, et al. A dynamic framework of brain functional patterns shaped by spontaneous thoughts beyond the default mode network. Scientific Reports. 2025;15:10432. doi:10.1038/s41598-025-10432-0.
- Chang H, Tang W, Wulf AM, et al. Sleep microstructure organizes memory replay. Nature. 2025;637:1161–1169. doi:10.1038/s41586-024-08340-w.
- Spens E, Burgess N. A generative model of memory construction and consolidation. Nature Human Behaviour. 2024;8:526–543. doi:10.1038/s41562-023-01799-z.
- de Fleur DS, Bredeweg B, van den Bos W. Metacognition: ideas and insights from neuro- and educational sciences. npj Science of Learning. 2021;6:13. doi:10.1038/s41539-021-00089-5.
- Merkebu J, Veen M, Hosseini S, Varpio L. The case for metacognitive reflection: a theory integrative review with implications for medical education. Advances in Health Sciences Education. 2024;29:1481–1500. doi:10.1007/s10459-023-10310-2.
- Norman G, Pelaccia T, Wyer P, Sherbino J. Dual process models of clinical reasoning: The central role of knowledge in diagnostic expertise. Journal of Evaluation in Clinical Practice. 2024;30(5):788–796. doi:10.1111/jep.13998.
- Piaget J, Inhelder B. The Psychology of the Child. New York: Basic Books; 1969.
- Vygotsky LS. Mind in Society: The Development of Higher Psychological Processes. Cambridge, MA: Harvard University Press; 1978.
- Bruner JS. Toward a Theory of Instruction. Cambridge, MA: Harvard University Press; 1966.
- Moor M, Banerjee O, Abad ZSH, et al. Foundation models for generalist medical artificial intelligence. Nature. 2023;616:259–265. doi:10.1038/s41586-023-05881-4.
- Liu F, Li Z, Yin Q, et al. A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis. npj Digital Medicine. 2025;8:86. doi:10.1038/s41746-024-01339-7.
- Wu J, Wang Y, Zhong Z, et al. Vision-language foundation model for 3D medical imaging. npj Artificial Intelligence. 2025;1:17. doi:10.1038/s44387-025-00015-9.
- Xu T, Hosseini S, Anderson C, et al. A generalizable 3D framework and model for self-supervised learning in medical imaging. npj Digital Medicine. 2025;8:639. doi:10.1038/s41746-025-02035-w.
- Ding T, Wagner SJ, Song AH, et al. A multimodal whole-slide foundation model for pathology. Nature Medicine. 2025;31:3749–3761. doi:10.1038/s41591-025-03982-3.
- Xu Y, Wang Y, Zhou F, et al. A multimodal knowledge-enhanced whole-slide pathology foundation model. Nature Communications. 2025;16:1462. doi:10.1038/s41467-025-66220-x.
- Yang Z, DSouza N, Megyeri I, et al. Decipher-MR: a vision-language foundation model for 3D MRI representations. npj Digital Medicine. 2026. doi:10.1038/s41746-026-02596-4.
- Golden D, Mahvar F. Next generation medical image interpretation with MedGemma 1.5 and medical speech to text with MedASR. Google Research Blog. January 13, 2026.
- Model Context Protocol Working Group. Model Context Protocol Specification (revision 2025-06-18). 2025. Available at: modelcontextprotocol.io.
- Surapaneni R, Jha M, Vakoc M, Segal T. Announcing the Agent2Agent Protocol (A2A). Google Developers Blog. April 9, 2025.
- HL7 International. FHIR Overview, Release 5. 2025. Available at: hl7.org/fhir.
- NEMA. DICOM PS3.1 2026a – Introduction and Overview. 2026. Available at: dicomstandard.org.
- Amugongo LM, Mascheroni P, Brooks S, Doering S, Seidel J. Retrieval augmented generation for large language models in healthcare: A systematic review. PLOS Digital Health. 2025;4(5):e0000873. doi:10.1371/journal.pdig.0000873.
- Huang L, Ruan S, Xing Y, Feng M. A review of uncertainty quantification in medical image analysis: Probabilistic and non-probabilistic methods. Medical Image Analysis. 2024;97:103268. doi:10.1016/j.media.2024.103268.
- Lambert B, Forbes F, Doyle S, Dehaene H, Dojat M. Trustworthy clinical AI solutions: A unified review of uncertainty quantification in Deep Learning models for medical image analysis. Artificial Intelligence in Medicine. 2024;158:103007. doi:10.1016/j.artmed.2024.103007.
- Zhou S, Xie W, Li J, et al. Automating expert-level medical reasoning evaluation of large language models. npj Digital Medicine. 2026;9:34. doi:10.1038/s41746-025-02208-7.
- Agrawal M, Chen IY, Gulamali F, et al. The evaluation illusion of large language models in medicine. npj Digital Medicine. 2025;8:600. doi:10.1038/s41746-025-01963-x.
- Bruno P, Quarta A, Calimeri F. Continual Learning in Medicine: A Systematic Literature Review. Neural Processing Letters. 2025;57:2. doi:10.1007/s11063-024-11709-7.
- Ibrahim M, Al Khalil Y, Amirrajab S, et al. Generative AI for synthetic data across multiple medical modalities: A systematic review of recent developments and challenges. Computer Methods and Programs in Biomedicine. 2026;261:108730. doi:10.1016/j.cmpb.2025.108730.
- Loni M, Poursalim F, Asadi M, Gharehbaghi A. A review on generative AI models for synthetic medical text, time series, and longitudinal data. npj Digital Medicine. 2025;8:281. doi:10.1038/s41746-025-01725-9.
- Wang S, Zhou X, Li C, et al. Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation. Research. 2025:1029. doi:10.34133/research.1029.
- World Health Organization. Ethics and governance of artificial intelligence for health: Guidance on large multi-modal models. Geneva: WHO; 2025.
- U.S. Food and Drug Administration. Marketing Submission Recommendations for a Predetermined Change Control Plan for Artificial Intelligence-Enabled Device Software Functions. Final guidance. 2025.
- U.S. Food and Drug Administration, Health Canada, MHRA. Predetermined Change Control Plans for Machine Learning-Enabled Medical Devices: Guiding Principles. 2023.
- International Medical Device Regulators Forum. Good Machine Learning Practice for Medical Device Development: Guiding Principles. 2025.
- European Commission. Artificial Intelligence in healthcare; AI Act implementation timeline. 2026.
- Huang J, Epping G, Trueblood JS, Yearsley JM, Busemeyer JR, Pothos EM. An overview of the quantum cognition research program. Psychonomic Bulletin & Review. 2025;32(6):2507–2556.
- Fuyama M, Khrennikov A, Ozawa M. Quantum-like cognition and decision-making in the light of quantum measurement theory. Philosophical Transactions of the Royal Society A. 2025;383(2300):20240156.
- Busemeyer JR, Bruza PD. Quantum Models of Cognition and Decision. Cambridge: Cambridge University Press; 2012.
- Alvarez PH, Gerhards L, Solov’yov IA, de Oliveira MC. Quantum phenomena in biological systems. Frontiers in Quantum Science and Technology. 2024;3:1466906. doi:10.3389/frqst.2024.1466906.
- Denton MCJ, Smith LD, Xu W, et al. Magnetosensitivity of tightly bound radical pairs in cryptochrome is enabled by the quantum Zeno effect. Nature Communications. 2024;15:10823. doi:10.1038/s41467-024-55168-4.
- Nair PS, Zadeh-Haghighi H, Simon C. Radical pair model for magnetic field effects on NMDA receptor activity. Scientific Reports. 2024;14:3628. doi:10.1038/s41598-024-54343-y.
- Slocombe L, Sacchi M, Al-Khalili J. An open quantum systems approach to proton tunnelling in DNA. Communications Physics. 2022;5:109. doi:10.1038/s42005-022-00881-8.
- Hameroff S, Penrose R. Consciousness in the universe: a review of the Orch OR theory. Physics of Life Reviews. 2014;11(1):39–78. doi:10.1016/j.plrev.2013.08.002.
- Wiest MC. A quantum microtubule substrate of consciousness is experimentally supported and solves the binding and epiphenomenalism problems. Neuroscience of Consciousness. 2025;2025(1):niab028.
- Wiest MC, Puniani AS. Conscious active inference II: Quantum orchestrated objective reduction among intraneuronal microtubules naturally accounts for discrete perceptual cycles. Cognitive Systems Research. 2026;77:101115.
- Sterzer P, Keller GB. Predictive processing: Layer-specific prediction error signals in human cortex. Current Biology. 2024;34(10):R496–R498. doi:10.1016/j.cub.2024.04.040.
- Timashkov A, Anderson S, Zinchenko O. Neural correlates of uncertainty processing: meta-analysis of fMRI studies. Frontiers in Neuroscience. 2025;19:1662272. doi:10.3389/fnins.2025.1662272.
- Fujimoto A, Elorette C, Fujimoto SH, et al. Pharmacological Modulation of Dopamine Receptors Reveals Distinct Brain-Wide Networks Associated with Learning and Motivation in Nonhuman Primates. Journal of Neuroscience. 2025;45(7):e1234-24.
- Furukawa K, Kawaguchi S. Coordinated regulation of timing and strength of synaptic outputs by adrenergic receptors through control of action potentials in Purkinje cells. Frontiers in Cellular Neuroscience. 2024;18:1403475. doi:10.3389/fncel.2024.1403475.
- Choi JY, Yoo TK. Transforming clinical medicine with multimodal artificial intelligence, agentic systems, and the model-context protocol: a perspective on future directions. Discover Artificial Intelligence. 2026;6:12.