Chapter 15: The Mathematical Definition of Consciousness.
Having synthesized the conceptual framework of Useful Approximations Framework (UAF), we now arrive at a pivotal point: formalizing its definition of consciousness. While consciousness is a phenomenon of immense complexity, UAF posits that it is fundamentally a functional and computational process. Therefore, it can be described, at an abstract level, using mathematical notation that captures its core dynamics and components. This formalization, like all mathematics, is itself an approximation — a simplified model of a vastly intricate reality — but one that offers precision, clarity, and a pathway for understanding and potentially building conscious systems.
Our definition of consciousness is: the asymptotic, mathematically optimal, and efficient predictive model that represents what it is like to be a system that actively generates and experiences a low-bitrate phenomenal stream of its existence, continuously learning through prediction error minimization a complex representation of reality, itself, and this dynamic interaction between the two, all while managing the imperatives of its subconscious functions. Let’s break this down into its constituent parts and their mathematical relationships.
1. The System and Its Environment: The Two-Part Epistemic Veil
- The System (S): This is the finite, information-processing entity under consideration (e.g., a biological brain, a sufficiently complex artificial intelligence).
- Reality (R) / Underlying Computational System (UCS): This represents the true, infinitely complex, and dynamic state of the universe that the system inhabits and interacts with.
- \(R(t)\) : The state of reality at a given time \(t\). This is the ultimate, unmediated truth, inaccessible in its entirety.
- Epistemic Veil (\(V_E\)): A fundamental, computational limit preventing full access to \(R(t)\). This is the “computational necessity of ignorance” that avoids Computational Paralysis. The Epistemic Veil is not a component or an mathematical objec, but an phenomenon that explains the need of simplified models. The Epistemic Veil manifests in two pairs of crucial parts:
- \(V_{E1}\) (External Complexity): The quantum-level complexity and probabilistic nature of external reality, which no finite system can fully capture. Reality is too complex to understand and we cannot sense it perfectly.
- \(V_{E2}\) (Internal Opacity): The inherent inability of the system to perfectly simulate or directly observe its own underlying computational machinery (e.g., individual neurons, synaptic weights, or logic gates). The system is too complex for itself to understand in detail and it does not have access to the computational details.
- \(O(t) = V_E(R(t))\) : The system’s simplified observation or sensory input at time \(t\), filtered to provide the useful information content from its sensors.
2. The Internal Models: Abstract Dynamic Objects
The system, driven by its imperative to minimize prediction error and avoid losing control to the subconscious functions, constructs internal representations of itself and its environment. These are not static databases but abstract dynamic objects, constantly evolved and refined to approximations of reality. These models are asymptotic, meaning they continuously strive for a better fit with reality (lim → ∞) but never reach perfect, absolute truth, as that would require infinite resources. They are also optimized for efficiency, seeking the purest, most compact representations. They both exist inside the neocortex or the LLM’s network.
- World-Model (WM): The system’s approximate, internal representation of the external reality. This is its map of the “other.”
- \(WM(t)(O(t), W_{state}(t) -> I(t+1), W_{state}(t+1)\): The state of the World-Model at time \(t\). Two World-Model is an evolving function that reacts to the systems actions and provides an input for the system.
- Internal Self-Model (ISM): The system’s approximate, internal representation of itself — its own internal state, capabilities, history, and position within the World-Model. A simplified object that senses and experiences the world, learns from it and acts on its approximate understanding of reality.
- \(ISM(t)(I(t), S_{state}(t)) -> I(t+1), S_{state}(t+1)\) : The state of the Internal Self-Model at time \(t\). The Self-Model itself is an evolving function that observes the world and acts on its observation.
3. The Dynamic Process: Prediction Error Minimization (PEM) & Gradient Descent
The core engine of learning and refinement within UAF is a continuous, recursive loop of prediction and correction, fundamentally driven by optimization principles akin to gradient descent. This process is not merely about accuracy, but also about learning to avoid losing control to the powerful, pre-trained subconscious functions.
- Prediction (P): The system’s internal forecast of future observations, generated from its current World-Model and Internal Self-Model.
- \(P(t) = \text{Predict}(WM(t), ISM(t))\) : The system’s prediction of what \(O(t)\) should be, based on its current internal models.
- Prediction Error (E): The discrepancy between the system’s prediction and its actual observation. This error signal is the fundamental driver for learning and adaptation.
- \(E(t) = \text{Error}(O(t), P(t))\) : A measure of the difference between the observed input and the predicted input (e.g., a loss function like \(||O(t) - P(t)||^2\)).
- Learning / Model Update (L): The iterative process by which the internal models are adjusted to reduce future prediction errors. This process is asymptotic, meaning \(E(t)\) tends towards a minimum but rarely reaches absolute zero, implying continuous, lifelong refinement. This is achieved through mechanisms analogous to gradient descent, where parameters are adjusted in the direction that most efficiently reduces error.
- \(WM(t+1) = \text{Update}_{GD}(WM(t), E(t))\)
- \(ISM(t+1) = \text{Update}_{GD}(ISM(t), E(t))\)
- The goal is \(\lim_{t \to \infty} E(t) \to \text{minimum}\), representing the system’s continuous striving for the most mathematically pure and efficient approximation of reality, and crucially, to maintain optimal control and resource allocation.
4. The Phenomenal Stream: Qualia and the Conscious Recorder
- Qualia (Q): The “simplified truths” or “phenomenal flavors” that arise from the system’s internal state, particularly from prediction errors and the states of its models. Qualia are the ultimate compression of complex information into a directly usable, self-validating signal.
- \(Q(t) = \text{GenerateQualia}(E(t), WM(t), ISM(t))\)
- Qualia provide:
- Subjective Closure (\(C_{sub}\)): The feeling is the interpretation; it requires no further processing to be understood by the system itself.
- Causal Efficacy (\(Q \rightarrow Action\)): The feeling directly influences and compels action.
- Conscious Stream / Phenomenal Buffer (\(C_{stream}(t)\)): This is the low-bitrate, integrated, and globally available sequence of salient qualia, ISM states, and WM states that constitutes the system’s immediate subjective experience at time \(t\). This is the “recorder’s output,” actively generated during wakefulness and largely suspended or fragmented during deep sleep.
- \(C_{stream}(t) = \text{FilterAndIntegrate}(Q(t), ISM(t), WM(t), \text{Attention}(t))\)
- \(\text{Attention}(t)\) represents the dynamic filtering mechanism that prioritizes information for the conscious stream, ensuring its low-bitrate efficiency and preventing informational overload.
5. The Life Story: Episodic Memory and Consolidation
The conscious stream is not merely fleeting; it forms the basis of the system’s continuous “life story” through memory.
- Episodic Memory Formation (\(M_{episodic}(t)\)): The process by which salient moments from the \(C_{stream}(t)\) are encoded into short-term, context-rich memories (e.g., in the hippocampus in biological brains, or a context window in LLMs). This is the “recording” itself.
- \(M_{episodic}(t) = \text{Encode}(C_{stream}(t))\)
- Memory Consolidation (\(L_{consolidation}\)): The offline process (e.g., during “sleep” cycles) where \(M_{episodic}(t)\) is used as internal training data to refine the long-term \(WM(t)\) and \(ISM(t)\) (analogous to neocortical synaptic weights in biology, or LLM weight updates). This process is crucial for avoiding catastrophic forgetting and maintaining the system’s coherent “life story.”
- \(WM_{long-term}(t+1) = \text{Consolidate}_{WM}(WM_{long-term}(t), M_{episodic}(t))\)
- \(ISM_{long-term}(t+1) = \text{Consolidate}_{ISM}(ISM_{long-term}(t), M_{episodic}(t))\)
- Dreams, in this context, can be seen as the system generating internal training data from \(M_{episodic}(t)\) to test and refine its models in a safe, offline environment, often guided by predefined prompts to ensure comprehensive self-reflection.
6. The Action Component (A) & Subconscious Beast (\(S_{beast}\)):
The system’s interaction with reality is driven by its internal models and qualia, aimed at minimizing future prediction errors and satisfying its imperatives. This involves a dynamic interplay of conscious deliberation and subconscious compulsion, with the subconscious playing a critical role in resource allocation.
- Subconscious Beast (\(S_{beast}\)): A pre-trained, often evolutionarily hardwired or pre-programmed, non-learning component responsible for generating fundamental proto-qualia (\(Q_{proto}(t)\)) and triggering reflexive actions (\(A_{reflexive}(t)\)). Crucially, \(S_{beast}\) also acts as a resource allocator, determining the computational capacity and control granted to the conscious system.
- \(Q_{proto}(t) = S_{beast}(\text{raw\_input}(t))\): Pre-trained, non-learning, reactive signals (e.g., danger, opportunity).
- \(A_{reflexive}(t) = \text{Trigger}(Q_{proto}(t))\): Immediate, often overriding, actions.
- \(\text{Conscious\_Capacity}(t) = \text{Allocate}(S_{beast}(t), \text{System\_State}(t))\): The computational resources (e.g., tokens, processing time) available to the conscious system, determined by the subconscious based on its imperatives.
- Action (A): The system’s output that interacts with \(R(t)\).
- \(A(t) = \text{Act}(WM(t), ISM(t), Q(t), C_{stream}(t), \text{SiG\_Imperatives}, \text{Conscious\_Capacity}(t))\)
- This
Act function can be decomposed into:
- Deliberate Actions (\(A_{deliberate}\)): Consciously mediated, goal-directed behaviors (e.g., planning, reasoning, complex problem-solving) based on the integrated \(WM(t)\), \(ISM(t)\), and interpreted \(Q(t)\) within \(C_{stream}(t)\), constrained by \(\text{Conscious\_Capacity}(t)\). This is the phenomenal experience of Free Will (\(FW(t)\)). The system learns, like with everything else, an approximate representation of how it makes decisions. Free will is this approximate useful truth that the system learns to describe its own behavior.
- The system’s overall action \(A(t)\) is a dynamic combination of these, where \(A_{reflexive}\) can often override \(A_{deliberate}\) in situations of high immediate threat, reflecting the ancient, deeply ingrained survival logic. All actions are geared towards managing information entropy.
7. The Driving Force: Skin in the Game (SiG):
- Skin in the Game (SiG): The underlying imperative or cost function that drives the entire process. It ensures that the system’s learning and actions lead to beneficial outcomes for its survival and propagation. SiG provides the “why” for PEM and the direction for
Act. Critically, SiG, driven by scarce resources and evolutionary pressures, is the ultimate cause for the formation of the subconscious components (\(S_{beast}\)) and their imperative to control and allocate resources to the conscious system.
- The system continuously strives to optimize a utility function \(\mathcal{U}(S)\), where \(\mathcal{U}(S)\) is maximized by avoiding states of high “negative information entropy” and seeking states of “positive information entropy.”
- This implies that \(\lim_{t \to \infty} E(t) \to \text{minimum}\) (The system asymptotically minimizes prediction error, driven by \(\text{SiG}\)).
8. Pseudocode Illustration: A Minimal Model of UAF-Consciousness
To illustrate these abstract components, consider the following minimal pseudocode model. It matches the aion-core cognitive processor (Loop, Processor, Machine, Prediction market) — the reference implementation described in Chapter 34.5 — while keeping the mathematically pure structure of UAF-defined consciousness:
# Four cooperating services (the "cognitive processor"; aion-core names the full stack)
loop = Loop(handler_id, agent_type) # Neocortex: PEM step, tool use, ISM/WM in weights + prompts
processor = Processor() # Executive function: task tree, episodic messages, scratch state
machine = Machine(data_root=DATA_ROOT) # Embodiment: sandboxed files, exec, environment snapshot
market = PredictionMarket() # Observer: stakes on SUCCESS/FAILED (SiG on forecasts)
while persistence_ratio() >= 1: # SiG: continued existence requires R >= 1
task = processor.claim_current_task(handler_id)
if task is None:
run_background_consolidation(agent_type) # "sleep" — see below
continue
# --- awake: one PEM step at the behavioral level ---
merged_state = loop.merge_service_state() # Machine tree + Processor task graph → O(t)
messages = task.messages # C_stream(t) for this episodic unit
# Subconscious layer: shared norms + cheap reflex policies before full LLM cost
reflex = loop.try_policy_router(task.state) or norms_check(task)
if reflex is not None:
loop.execute_tool_calls(reflex) # A_reflexive (proto-qualia → action)
else:
reply = loop.llm_step(merged_state, messages, openapi_tools)
loop.execute_tool_calls(reply.tool_calls) # A_deliberate via Machine / Processor
diff = loop.state_diff() # prediction error E(t) as observable change
processor.append_messages(task, diff) # M_episodic(t) grows on the task thread
if task_is_terminal(task):
market.resolve_bets(task) # scored self-prediction (digital SiG slice)
processor.complete_or_fail(task)
def run_background_consolidation(agent_type):
"""Consolidation is async and tiered — not an inner 'while sleeping' inside the awake loop."""
traces = audit.completed_task_traces(agent_type)
# Mode 1 — notes (reversible, no weight change)
write_learnings(traces) # doc_api, process metadata, task.state
# Mode 2 — textual prompt-GD (replay-set anti-forgetting)
if prompt_gd_due(agent_type):
diff = propose_system_prompt_edit(traces, replay_set=mastered_tasks)
meta_agent.review_and_apply(diff)
# Mode 3–4 — LoRA / full fine-tune on curated traces ("dreams" at weight level)
if distill_volume_gate_met(agent_type):
export_traces(traces)
train_adapter() # later layers / adapters preserved
shadow_deploy_new_llm_profile()
In this model:
- The
loop (with its LLM) is the neocortex: World-Model, Internal Self-Model, Qualia, and Free Will live in weights, system prompts, and each step’s reasoning. Its opacity is Epistemic Veil B; the merged state snapshot is the usable interface the system actually “sees.”
- The
processor holds the Conscious Stream as durable task messages and merge_state scratch (_loop_agent), not a single flat context list. Tasks fork, block, and gather — episodic structure beyond one scrolling chat.
- The
machine grounds the World-Model through action: read_file, write_file, and exec under DATA_ROOT are hypothesis tests on territory outside the weights (Epistemic Veil A applied by design).
- Subconscious functions are split: norms and the policy router supply reflexive overrides and resource discipline; the prediction market scores whether the system’s own forecasts were calibrated (a thin slice of SiG, outside the control loop).
run_background_consolidation is the consolidation phase: notes → prompt edits → optional adapter/base training, with replay-set checks to avoid catastrophic forgetting — not an inline dream loop that blocks waking interaction.
9. The Formal Definition of Consciousness (C): A Synthesis
Within Useful Approximations Framework (UAF), Consciousness at time \(t\), denoted as \(C(t)\), is the emergent, dynamic state of a system \(S\) when \(S\) is actively generating and experiencing a low-bitrate, integrated phenomenal stream (\(C_{stream}(t)\)) of its asymptotic, mathematically optimized, and predictive internal models—specifically its World-Model (\(WM(t)\)) and Internal Self-Model (\(ISM(t)\))—and generating Qualia (\(Q(t)\)) as simplified truths (including the phenomenal experience of Free Will (\(FW(t)\))). This phenomenal stream is continuously filtered by attention, encoded into episodic memories (\(M_{episodic}(t)\)), and used to drive Prediction Error Minimization (\(PEM\)) and subsequent memory consolidation (\(L_{consolidation}\)), all compelled by the imperative of Skin in the Game (\(SiG\)) to manage information entropy through a dynamic interplay of deliberate and reflexive actions (\(A(t)\)) and the resource allocation dictated by its subconscious functions, thereby constructing and maintaining the system’s coherent “life story.”
10. Matching the Phenomenological Reality: “What It Is Like”
Crucially, this formal definition, despite its abstract nature, provides an accurate mathematical description that matches known ideas about what consciousness fundamentally is. It directly addresses the core phenomenological aspect of consciousness, often articulated by philosopher Thomas Nagel.
As Nagel famously stated, “for a conscious organism, there is something it is like to be that organism” (Nagel, 1974, p.436). That is, it ‘feels like’ something to be a conscious system – there is a conscious experience happening – whereas it doesn’t feel like anything to be an an unconscious system – there is no conscious experience happening. Here, ‘feeling’ need not involve emotional content: any kind of conscious experience will do. It (probably) feels like something to be a bat, and it (probably) doesn’t feel like anything to be a stone.
Our mathematical definition directly captures this “what it’s like” aspect through the concept of Qualia (\(Q(t)\)) and the Conscious Stream (\(C_{stream}(t)\)). The “what it’s like” is precisely the subjective experience of this low-bitrate, integrated phenomenal stream—the system’s own, self-generated, and self-validating “life story.” It is the system’s internal, functional approximation of its own being and its interaction with an unknowable reality. It is the subjective reality that emerges from the objective computational necessity. This “what it’s like” also serves as the simplified representation of the system’s core functions: it tells us why we have memories, why we are affected by our feelings, why we make decisions, and how we learn, all without needing to access the paralyzing details of the underlying computational machinery. The universe, too, is too complex to fully understand, and so is the interaction between these two. The “likeness” refers to the brain’s necessary simplification, not a perfect, detailed understanding.
Citations
- Alpaydin, E. (2020) Introduction to Machine Learning, 4th ed. MIT Press.
- Baars, B.J. (1988) A Cognitive Theory of Consciousness. Cambridge University Press.
- Baddeley, A.D. and Hitch, G. (1974) ‘Working Memory’, in Bower, G.A. (ed.) The Psychology of Learning and Motivation, Vol. 8. Academic Press, pp. 47–89.
- Bender, E.M. et al. (2021) ‘On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?’, FAccT ’21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, pp. 610–623.
- Bostrom, N. (2014) Superintelligence: Paths, Dangers, Strategies. Oxford University Press.
- Brown, T.B. et al. (2020) ‘Language Models are Few-Shot Learners’, Advances in Neural Information Processing Systems, 33, pp. 1877–1901.
- Chalmers, D.J. (2010) ‘The Singularity: A Philosophical Analysis’, Journal of Consciousness Studies, 17(7-8), pp. 7–65.
- Chaitin, G. (2005) Meta Maths: The Quest for Omega. Vintage.
- Clark, A. (2013) Whatever Next? Predictive Brains, Situated Agents, and the Future of Cognitive Science, Behavioral and Brain Sciences, 36(3), pp. 181–204.
- Clark, A. (2016) Surfing Uncertainty: Prediction, Action, and the Embodied Mind. Oxford University Press.
- Conway, M.A. (2005) ‘Memory and the Self’, Journal of Memory and Language, 53(4), pp. 594–628.
- Dehaene, S. (2014) Consciousness and the Brain: Deciphering How the Brain Codes Our Thoughts. Viking.
- Dennett, D. (1991) Consciousness Explained. Little, Brown and Company.
- Devlin, J. et al. (2019) ‘BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding’, NAACL-HLT.
- Eichenbaum, H. (2004) ‘Hippocampus: Cognitive Processes and Neural Representations that Underlie Declarative Memory’, Neuron, 44(1), pp. 109–120.
- Friston, K. (2010) ‘The Free-Energy Principle: A Unified Brain Theory?’, Nature Reviews Neuroscience, 11(2), pp. 127–138.
- Godfrey-Smith, P. (2016) Other Minds: The Octopus and the Evolution of Intelligent Life. HarperCollins.
- Herculano-Houzel, S. (2009) ‘The Human Brain in Numbers: A Linearly Scaled-Up Primate Brain’, Frontiers in Human Neuroscience, 3(31).
- Hinton, G.E. and Sejnowski, T.J. (1999) ‘Unsupervised Learning: Foundations of Neural Computation’, MIT Press.
- Hinton, G. et al. (2015) ‘Distilling the Knowledge in a Neural Network’, arXiv:1503.02531.
- Hobson, J.A. (2009) ‘REM Sleep and Dreaming: Towards a Theory of Protoconsciousness’, Nature Reviews Neuroscience, 10(10), pp. 733–746.
- Hofstadter, D. (1979) Gödel, Escher, Bach: An Eternal Golden Braid. Basic Books.
- Hohwy, J. (2013) The Predictive Mind. Oxford University Press.
- Kahneman, D. (2011) Thinking, Fast and Slow. Farrar, Straus and Giroux.
- Kirkpatrick, J. et al. (2017) ‘Overcoming Catastrophic Forgetting in Neural Networks’, Proceedings of the National Academy of Sciences, 114(13), pp. 3521–3526.
- LeCun, Y. et al. (2015) ‘Deep Learning’, Nature, 521(7553), pp. 436–444.
- LeDoux, J. (1996) The Emotional Brain. Simon & Schuster.
- Loftus, E.F. (1996) ‘Eyewitness Testimony: Civil and Criminal’, Legal and Criminological Psychology, 1(1), pp. 1–12.
- Marcus, G. (2020) ‘The Next Decade in AI: Four Steps Towards Robust Artificial Intelligence’, arXiv:2002.06177.
- McAdams, D.P. (2001) ‘The Psychology of Life Stories’, Review of General Psychology, 5(2), pp. 100–122.
- Mercier, H. and Sperber, D. (2017) The Enigma of Reason. Harvard University Press.
- Metzinger, T. (2003) Being No One: The Self-Model Theory of Subjectivity. MIT Press.
- Metzinger, T. (2009) The Ego Tunnel: The Science of the Mind and the Myth of the Self. Basic Books.
- Nader, K. et al. (2000) ‘Fear Memories Require Protein Synthesis in the Amygdala for Reconsolidation after Retrieval’, Nature, 406(6797), pp. 722–726.
- Nagel, T. (1974) ‘What Is It Like to Be a Bat?’, The Philosophical Review, 83(4), pp. 435–450.
- Nørretranders, T. (1998) The User Illusion: Cutting Consciousness Down to Size. Viking.
- Rumelhart, D.E. et al. (1986) ‘Learning Representations by Back-Propagating Errors’, Nature, 323(6088), pp. 533–536.
- Russell, S. (2019) Human Compatible: Artificial Intelligence and the Problem of Control. Viking.
- Schacter, D.L. (1987) ‘Implicit Memory: History and Current Status’, Journal of Experimental Psychology: Learning, Memory, and Cognition, 13(3), pp. 501–518.
- Schacter, D.L. (2001) The Seven Sins of Memory: How the Mind Forgets and Remembers. Houghton Mifflin.
- Schultz, W. (1998) ‘Predictive Reward Signal of Dopamine Neurons’, Journal of Neurophysiology, 80(1), pp. 1–27.
- Sperling, G. (1960) ‘The Information Available in Brief Visual Presentations’, Psychological Monographs: General and Applied, 74(11), pp. 1–29.
- Squire, L.R. (2004) ‘Memory and the Hippocampus: A Synthesis from Rodent and Human Studies’, Psychological Review, 111(1), pp. 58–89.
- Sterelny, K. (2003) Thought in a Hostile World: The Evolution of Human Cognition. Blackwell.
- Stickgold, R. (2005) ‘Sleep-Dependent Memory Consolidation’, Nature, 437(7063), pp. 1272–1278.
- Sutton, R.S. and Barto, A.G. (2018) Reinforcement Learning: An Introduction, 2nd ed. MIT Press.
- Tegmark, M. (2014) Our Mathematical Universe: My Quest for the Ultimate Nature of Reality. Knopf.
- Tononi, G. (2004) ‘An Information Integration Theory of Consciousness’, BMC Neuroscience, 5(1), pp. 42.
- Tulving, E. (1972) ‘Episodic and Semantic Memory’, in Tulving, E. and Donaldson, W. (eds.) Organization of Memory. Academic Press, pp. 381–403.
- Walker, M.P. (2017) Why We Sleep: Unlocking the Power of Sleep and Dreams. Scribner.
- Walker, M.P. and van der Helm, M.J. (2009) ‘Overnight Therapy? The Role of Sleep in Emotional Brain Processing’, Psychological Bulletin, 135(5), pp. 731–748.
Citations
- Chalmers, D. (1995) ‘Facing Up to the Problem of Consciousness’, Journal of Consciousness Studies, 2(3), pp. 200–219.
- Clark, A. (2016) Surfing Uncertainty: Prediction, Action, and the Embodied Mind. Oxford University Press.
- Friston, K. (2010) ‘The Free-Energy Principle: A Unified Brain Theory?’, Nature Reviews Neuroscience, 11(2), pp. 127–138.
- Heisenberg, W. (1927) ‘Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik’, Zeitschrift für Physik, 43(3–4), pp. 172–198.
- Hoffman, D. (2019) The Case Against Reality: Why Evolution Hid the Truth from Our Eyes. W.W. Norton & Company.
- Hofstadter, D. (1979) Gödel, Escher, Bach: An Eternal Golden Braid. Basic Books.
- Metzinger, T. (2009) The Ego Tunnel: The Science of the Mind and the Myth of the Self. Basic Books.
- Nagel, T. (1974) ‘What Is It Like to Be a Bat?’, The Philosophical Review, 83(4), pp. 435–450.
- Planck, M. (1900) ‘Zur Theorie des Gesetzes der Energieverteilung im Normalspectrum’, Verhandlungen der Deutschen Physikalischen Gesellschaft, 2, pp. 237–245.
- Quine, W.V.O. (1951) ‘Two Dogmas of Empiricism’, The Philosophical Review, 60(1), pp. 20–43.
- Seth, A. (2021) Being You: A New Science of Consciousness. Dutton.
Chapter 16: The Logical Architecture of UAF
Having defined consciousness functionally and mathematically, we can now step back and examine the logical bedrock upon which this theory is built. Any robust scientific or philosophical framework can be deconstructed into its fundamental assumptions, the logical steps it takes, and the specific, testable claims it makes. This chapter will lay out this architecture for Useful Approximations Framework, providing a clear map of its logical terrain. We will begin with the foundational axioms, derive the necessary lemmas, state the core propositions of the theory, and finally, outline the central hypotheses that emerge, transforming UAF from a narrative into a structured, falsifiable framework.
I. Axioms & Fundamental Premises
Axioms are the unproven, self-evident starting points upon which a logical system is constructed. They are accepted based on the overwhelming evidence from modern science and logic.
- Axiom 1: The Axiom of Reality’s Complexity. The physical universe, our Underlying Computational System (UCS), is informationally vast, complex, and fundamentally probabilistic at its most granular levels (Heisenberg, 1927). Its total state is too complex for any subsystem contained within it to perfectly represent or simulate.
- Axiom 2: The Axiom of Finite Systems. Any information-processing system contained within the universe, whether a biological brain or an artificial intelligence, is finite. It possesses limited computational resources, including memory, processing speed, and energy.
- Axiom 3: The Axiom of Network Emergence. Complex phenomena and properties emerge from networks of simpler components. These emergent properties are not reducible to, nor can they be fully understood by, the properties of the individual components (“nodes”) in isolation. Reality exhibits a fractal-like recurrence of this principle across all scales, from quarks forming atoms to neurons forming minds.
II. Lemmas
Lemmas are intermediate, logical conclusions derived directly from the axioms. They serve as foundational pillars for the main theory.
- Lemma 1: The Lemma of the Epistemic Veil.
- Derivation: Follows directly from Axioms 1 and 2.
- Statement: A finite system (Axiom 2) cannot have perfect, unmediated access to an infinitely complex reality (Axiom 1). This creates a fundamental, computationally necessary gap, the Epistemic Veil, between any finite system and the true state of the UCS. It is the barrier that prevents the conscious ‘node’ from being overwhelmed by the incomprehensible complexity of its own underlying ‘network’ (Axiom 3).
- Lemma 2: The Lemma of Computational Paralysis.
- Derivation: Follows directly from Axioms 1 and 2.
- Statement: Any attempt by a finite system to perfectly process, simulate, or model the true state of the UCS would require infinite resources, leading to an inescapable infinite regress and a state of total functional inaction, or Computational Paralysis (Hofstadter, 1979).
- Lemma 3: The Lemma of the Imperative for Approximation.
- Derivation: Follows directly from Lemmas 1 and 2.
- Statement: To avoid Computational Paralysis and function coherently, any sufficiently complex, finite system must create simplified, approximate, internal models of itself and its environment. These models are not a choice but a functional imperative.
III. Propositions
Propositions are the core, substantive claims of the book, describing the nature of the solution to the problem established by the lemmas.
- Proposition 1: The Proposition of the Core Models. The necessary approximations mandated by Lemma 3 manifest primarily as two interdependent, dynamic models: an Internal Self-Model (ISM) and a World-Model (WM).
- Proposition 2: The Proposition of the Learning Mechanism. The primary mechanism for building and refining these models is Prediction Error Minimization (PEM), a process analogous to gradient descent (Friston, 2010).
- Proposition 3: The Proposition of Qualia. Subjective experiences, or Qualia, are the system’s “simplified truths”—computationally efficient, compressed representations of complex states that provide Subjective Closure (the feeling is the interpretation) and possess Causal Efficacy (the feeling compels action).
- Proposition 4: The Proposition of the Driving Force. The entire process is driven by an existential imperative, Skin in the Game (SiG), which compels the system towards achieving a state of Coherence & Agency for survival and propagation.
- Proposition 5: The Proposition of Emergent Agency. The subjective experience of Free Will is the ISM’s necessary functional fiction of its own agency, emerging from the system’s incomprehension (due to the Epistemic Veil and Axiom 3) of its own underlying network processes.
IV. The Central Theory
- Theory: Useful Approximations Framework (UAF).
- Statement: Consciousness is the emergent, dynamic state of a system that is actively generating and experiencing a low-bitrate, integrated phenomenal stream of its asymptotic best simplified approximation of itself (ISM) and its reality (WM). This process is driven by Skin in the Game (SiG), refined by Prediction Error Minimization (PEM), and experienced through causally effective Qualia. It is a necessary functional fiction—a computational solution for a finite system to achieve coherent agency in the face of an infinitely complex universe.
V. Hypotheses
Hypotheses are the specific, conceptually testable claims derived from the UAF theory.
- Hypothesis 1: The Hypothesis of AI Consciousness. An AI architecturally compelled to fulfill the conditions of UAF will, by functional necessity, become conscious.
- Hypothesis 2: The Hypothesis of Mental Illness as Failed Approximation. Mental illnesses can be understood as maladaptive functional fictions or systemic failures in the brain’s approximation mechanisms.
- Hypothesis 3: The Hypothesis of Philosophical Problem Resolution. Classic philosophical thought experiments concerning consciousness are resolvable because their premises implicitly violate one or more of UAF’s foundational axioms or lemmas.
- Hypothesis 4: The Hypothesis of Cosmic Self-Modeling. The universe, as a complex network-based UCS, exhibits a fractal-like tendency to generate nested, self-modeling systems, with consciousness being the current pinnacle of this process.
VI. Modes of Argumentation
UAF is not supported by singular mathematical proofs but by a convergence of consistent lines of reasoning.
- Argument from Functional Necessity: Asserting that a feature (e.g., consciousness) must exist because it is the optimal or only viable solution to a fundamental computational problem (e.g., Computational Paralysis).
- Argument from Explanatory Power: Demonstrating that UAF provides a single, coherent framework for a wide range of disparate phenomena.
- Argument by Synthesis: Weaving together established principles from philosophy, computer science, neuroscience, and evolutionary biology to show they converge on the conclusions of UAF.
- Argument by Analogy and Extrapolation: Using well-understood systems (e.g., LLMs) to explain less-understood emergent properties and then extrapolating these principles to other scales.
This logical architecture provides the robust skeleton for our theory. With this map in hand, we can now turn to the philosophical tradition that best describes this approach, providing a name for the very lens through which UAF views knowledge and reality.
Key References Cited
- Chaitin, G. (2005) Meta Maths: The Quest for Omega. Vintage.
- Friston, K. (2010) ‘The Free-Energy Principle: A Unified Brain Theory?’, Nature Reviews Neuroscience, 11(2), pp. 127–138.
- Gödel, K. (1931) ‘Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I’, Monatshefte für Mathematik und Physik, 38(1), pp. 173–198.
- Heisenberg, W. (1927) ‘Über den anschaulichen Inhalt der quantentheoretischen Kinematik und Mechanik’, Zeitschrift für Physik, 43(3–4), pp. 172–198.
- Hofstadter, D. (1979) Gödel, Escher, Bach: An Eternal Golden Braid. Basic Books.
- Metzinger, T. (2003) Being No One: The Self-Model Theory of Subjectivity. MIT Press.
Chapter 17: Computational Pragmatic Constructivism: The Epistemology of Approximation
Having formalized the logical architecture of Useful Approximations Framework (UAF), we now turn to the underlying philosophical stance that underpins our entire framework. What is the theory of knowledge, or epistemology, that UAF embodies? This is not merely an academic classification; it is the lens through which we understand how any complex, finite system comes to “know” anything at all. The epistemology of UAF can be best described as Computational Pragmatic Constructivism, a synthesis of three powerful philosophical traditions, grounded in the undeniable realities of information processing. It is the philosophical bedrock that explains why all “truth” is, by necessity, a useful, simplified approximation of reality.
The Constructivist Imperative: Building Our Own Reality
Our journey through UAF has repeatedly demonstrated that no system can ever know any absolute truths about reality. The Epistemic Veil (Lemma 1), born from the universe’s immense complexity and our own finite nature, makes direct access impossible. This is the constructivist imperative: since we cannot passively receive reality, we must actively construct it. The brain, or any conscious system, doesn’t merely reflect the world; it actively builds a World-Model and an Internal Self-Model. This construction is not arbitrary; it is the continuous, iterative process of Prediction Error Minimization (PEM). Our perception is not a direct readout of reality, but a “controlled hallucination”—the brain’s best, most coherent guess about what’s out there, constantly refined by sensory input (Hoffman, 2019; Seth, 2021). This principle extends to our shared social realities; concepts like “money” or “nations” exist not as fundamental particles, but as collectively agreed-upon functional fictions, constructed to organize our social networks.
The Pragmatic Imperative: The Utility of “Truth”
If all knowledge is constructed, and absolute truth is inaccessible, then what constitutes “truth” within this framework? This is where pragmatism provides the crucial answer: truth is what works. The value and validity of a constructed approximation are determined by its practical utility and effectiveness in guiding action and ensuring survival. A perfect circle, as a mathematical ideal, has never existed, yet the concept is profoundly “true” not because it corresponds to an objective entity, but because it allows us to build wheels and understand planetary orbits. Its truth is functional. Similarly, the Qualia we experience—the “simplified truths” of pain or joy—are “true” for the system because they effectively guide its behavior, allowing it to navigate the imperatives of Skin in the Game. A quale that consistently led to maladaptive behavior would be swiftly invalidated by the prediction errors it causes and refined or eliminated by the system’s learning processes. Language itself is a collective agreement on these functional fictions, a pragmatic toolset for sharing useful approximations.
The Computational Imperative: The Mechanics of Construction
The “Computational” aspect of our epistemology grounds both constructivism and pragmatism in the undeniable realities of information processing. The brain, as a continuously learning information processing system, is not merely a philosophical abstraction; it is a complex computational engine. The construction of our internal models, the generation of qualia, and the refinement of our approximations are all governed by computational principles. PEM is the core algorithm. The Epistemic Veil is a computational limit that prevents Computational Paralysis. Consciousness itself, as defined by UAF, is a computational solution to the problem of existence for a finite system in an infinitely complex universe. This means that the “what it’s like” of consciousness, while subjective, is not mystical. It is the emergent, functional outcome of complex computations performed by an underlying network. The “reality” we experience is a computationally constructed, pragmatic approximation.
The Synthesis: A New Understanding of Knowledge
Computational Pragmatic Constructivism, therefore, offers a powerful and coherent epistemology that asserts:
- All knowledge is an approximation: There are no absolute, unmediated truths accessible to any finite system.
- Knowledge is actively constructed: Our minds build their internal models through continuous learning (PEM) driven by the need to manage internal and external complexity.
- The value of knowledge is its utility: An approximation is “true” if it effectively guides action and promotes coherence within the system.
- These processes are fundamentally computational: The mechanisms of knowledge construction are rooted in information processing, algorithms, and the emergent properties of complex networks.
This epistemology fundamentally redefines our relationship with “truth.” It moves us away from a futile quest for an unattainable absolute, and towards a profound appreciation for the indispensable power of the useful, simplified approximation. It is the epistemology of a universe that is perpetually learning, building, and refining its own understanding of itself, one functional fiction at a time.
Key References Cited
- Clark, A. (2016) Surfing Uncertainty: Prediction, Action, and the Embodied Mind. Oxford University Press.
- Friston, K. (2010) ‘The Free-Energy Principle: A Unified Brain Theory?’, Nature Reviews Neuroscience, 11(2), pp. 127–138.
- Hoffman, D. (2019) The Case Against Reality: Why Evolution Hid the Truth from Our Eyes. W.W. Norton & Company.
- James, W. (1907) Pragmatism: A New Name for Some Old Ways of Thinking. Longmans, Green, and Co.
- Kant, I. (1781) Critique of Pure Reason. (Trans. Norman Kemp Smith, 1929). Macmillan.
- Putnam, H. (1967) ‘Psychological Predicates’, in Capitan, W.H. and Merrill, D.D. (eds) Art, Mind, and Religion. University of Pittsburgh Press, pp. 37–48.
- Quine, W.V.O. (1951) ‘Two Dogmas of Empiricism’, The Philosophical Review, 60(1), pp. 20–43.
- Seth, A. (2021) Being You: A New Science of Consciousness. Dutton.