Chapter 5: Geometric Characterization of Free Will—Information-Theoretic Foundation of Empowerment and Causal Control
Introduction: Millennia-Long Confusion of Free Will
“Do I really have free will?”
This question has troubled philosophers for millennia. Determinists argue: Universe is causal chain, each “choice” is inevitable result of past states—free will is illusion. Non-determinists counter: Quantum uncertainty, chaotic dynamics prove future undetermined—free will has space.
But this debate deadlocked, because lacks operational definition: What is “freedom”? How to measure? How to test?
This chapter will start from information geometry and causal theory, give operational geometric characterization of free will:
where defined as:
That is “maximum mutual information of causal influence on future state through action ”.
This definition transforms abstract “freedom” concept into measurable, optimizable, physically constrained geometric quantity.
graph TB
subgraph "Hierarchical Structure of Free Will"
A["Physical Layer<br/>Non-Equilibrium Supply Δ F≠0"]
B["Causal Layer<br/>Intervention Channel P(S'|do(A))≠P(S')"]
C["Information Layer<br/>Empowerment E<sub>T</sub>>0"]
D["Geometric Layer<br/>Causal Control Manifold (M,G)"]
E["Experience Layer<br/>Subjective Sense of Freedom"]
end
A --> B
B --> C
C --> D
D --> E
style C fill:#e1f5ff
style D fill:#fff4e1
style E fill:#ffe1f5
Core Insight: Triple Constraints of Free Will
Physical Foundation Theorem: Observer has operational freedom, if and only if simultaneously satisfies:
- Controllability: Exists non-degenerate intervention channel
- Non-Equilibrium Supply: Exists steady-state free energy flow
- Barrier Separation: Exists Markov blanket , making influence of internal states on outcomes distinguishable
Meaning: Free will not “uncaused cause”, but causal intervention ability under physical constraints—it needs energy, needs channel, needs information, but within these constraints, it is real, measurable.
Part One: Empowerment—Information Measure of Causal Control
1.1 From Mutual Information to Directed Information
Classic Mutual Information:
Measures “correlation” of two random variables, but doesn’t distinguish causal direction: .
Directed Information (Massey, 1990):
For sequences and , define:
where , .
Meaning: measures “cumulative causal influence of on ”—it considers temporal order, distinguishes causal direction.
Properties:
- (asymmetric)
- if and only if holds for all (no causal influence)
graph LR
subgraph "No Causal Influence"
A1["M<sub>1</sub>"] --> A2["M<sub>2</sub>"] --> A3["M<sub>3</sub>"]
B1["Y<sub>1</sub>"] --> B2["Y<sub>2</sub>"] --> B3["Y<sub>3</sub>"]
end
subgraph "With Causal Influence"
C1["M<sub>1</sub>"] --> C2["M<sub>2</sub>"] --> C3["M<sub>3</sub>"]
D1["Y<sub>1</sub>"] --> D2["Y<sub>2</sub>"] --> D3["Y<sub>3</sub>"]
C1 -.->|causal| D2
C2 -.->|causal| D3
end
E["I(M<sup>n</sup>→Y<sup>n</sup>)=0"] -.corresponds to.- A3
F["I(M<sup>n</sup>→Y<sup>n</sup>)>0"] -.corresponds to.- C3
style A3 fill:#fff4e1
style C3 fill:#ffe1f5
1.2 Definition and Physical Meaning of Empowerment
Definition 1.1 (-Step Empowerment) (Klyubin, Polani, Nehaniv, 2005)
At time , observer’s -step Empowerment defined as:
where:
- is action sequence
- are initial and terminal perceptual states
- means taking supremum over all possible strategies
Physical Meaning:
- measures “maximum ability of observer to exert distinguishable influence on future state through -step actions”
- Equivalent to supremum of “action–sensor channel capacity”
- When , observer’s actions have no detectable influence on future state—completely no freedom
Analogy: Imagine person locked in room. If pushing wall, shouting, knocking door cannot change any observable result (wall doesn’t move, no one hears), then —he “has no freedom”. Conversely, if pushing door can open, shouting can attract rescue, then —he “has freedom”.
1.3 Empowerment and Causal Intervention
In Pearl causal framework, intervention operator means “externally force to take certain value”, different from conditioning .
Proposition 1.1 (Empowerment and Intervention Distinguishability)
If exists such that:
then . Conversely, if for all above equality holds, then .
Proof Sketch: Intervention distinguishable actions carry causal information .
Meaning: Positive Empowerment is necessary and sufficient condition for causal intervention detectable. This bridges “free will” with Pearl causal theory.
1.4 Geometric Characterization of Empowerment
On control manifold , Empowerment can be represented as curvature quantity on information geometry:
Proposition 1.2 (Fisher Metric Representation of Empowerment)
Under local linearization approximation, -step Empowerment can be written as:
where:
- is Jacobian matrix (causal sensitivity)
- is covariance of action space
- is identity matrix
Meaning: Empowerment proportional to log determinant of “causal sensitivity matrix”—geometrically, it measures “local volume amplification rate from action space to state space”.
Analogy: Imagine joystick controlling robot. If slight movement of joystick causes large change in robot position, then singular values of Jacobian large, Empowerment high—operator “has power”. Conversely, if pushing joystick hard, robot doesn’t move, then , —operator “powerless”.
Part Two: Physical Foundation Theorem—Thermodynamic Constraints of Free Will
2.1 Feedback Thermodynamics and Information–Work Duality
Classic Second Law of Thermodynamics:
That is average work at least equals free energy change.
Generalized Second Law with Measurement–Feedback (Sagawa–Ueda, 2008):
where is mutual information acquired by measurement.
Meaning: Information can be converted to work—Maxwell’s demon doesn’t violate second law, but needs pay entropy cost of “erasing information” (Landauer principle).
Feedback Jarzynski Equality:
This is probability-1 identity, stronger than inequality.
graph LR
A["Measurement M<br/>Acquire Mutual Information I"] --> B["Feedback Control<br/>Choose Strategy π"]
B --> C["Apply Work W<br/>Change System"]
C --> D["Free Energy Change Δ F"]
E["Thermodynamic Constraint<br/>⟨W⟩≥Δ F-k<sub>B</sub>T⟨I⟩"]
A -.information.-> E
D -.energy.-> E
style A fill:#e1f5ff
style C fill:#ffe1f5
style E fill:#fff4e1
2.2 Markov Blanket and Barrier Separation
Definition 2.1 (Markov Blanket) (Pearl, 2000; Friston, 2010)
For random variable set , say is Markov blanket of relative to , if:
That is internal states and external states conditionally independent on blanket.
Physical Meaning:
- : Observer internal states (like neural activity, memory)
- : External environment states
- : Actions (output)
- : Observations (input)
Markov blanket is “system boundary”—internal–external interaction must pass through this boundary.
Proposition 2.1 (Barrier Separation Condition)
If exists Markov blanket, and:
then action carries distinguishable causal information of internal states on external states.
Meaning: Markov blanket cannot “completely block” causal flow—otherwise changes in internal states have no influence on external states, Empowerment zero.
2.3 Physical Foundation Theorem
Theorem 2.1 (Physical Foundation of Free Will)
Observer has operational freedom on time domain , if and only if simultaneously satisfies:
(i) Controllability: Exists non-degenerate intervention channel , i.e.:
(ii) Non-Equilibrium Supply: Exists sustained free energy flow or entropy flow, satisfying:
That is system not in thermal equilibrium.
(iii) Barrier Separation: Exists Markov blanket , making causal influence of internal states on external states detectable through action :
Then exists strategy such that:
And constrained by thermodynamic inequality .
Proof Idea:
-
Controllability Empowerment Positive: Non-degenerate channel guarantees (channel capacity non-zero)
-
Barrier Separation Directed Information Positive: Markov blanket allows , therefore (Massey directed information)
-
Non-Equilibrium Supply Thermodynamically Feasible: Sagawa–Ueda feedback Jarzynski equality guarantees work–information duality consistency
Meaning: This theorem transforms “free will” from philosophical concept into conjunction of three operational physical conditions. Each condition can be experimentally tested.
2.4 Thermodynamic Cost of Free Will
Corollary 2.2 (Energy Lower Bound of Freedom)
If observer maintains Empowerment within time , must consume minimum average work:
Meaning: Maintaining free will requires sustained energy supply. When energy exhausted (like fatigue, sleep, coma), Empowerment tends to zero, free will lost.
Biological Evidence:
- Brain accounts for 2% of body weight, but consumes 20% of basal metabolism—maintaining neural plasticity and action control
- Glucose supply interruption (hypoglycemia) causes confusion, decision ability decline—
- Sleep deprivation reduces prefrontal activity, impulse control weakened—causal control impaired
Part Three: Geometric Structure of Free Will—Control Manifold and Variational Principle
3.1 Definition of Control Manifold
Recall Chapter 0, observer moves on joint manifold . Now, we view Empowerment as scalar field on control manifold :
Definition 3.1 (Causal Control Manifold)
Control manifold is triple:
- : Parameter space (like strategy parameters, control parameters)
- : Complexity metric (like Fisher metric)
- : Empowerment scalar field
On , define Empowerment gradient flow:
where is gradient operator with respect to metric , is learning rate.
Physical Meaning: Observer through gradient ascent, maximizes causal control —this is instinctive drive: Biological systems tend to enhance control ability over environment.
graph TB
subgraph "Empowerment Landscape"
A["Low Empowerment<br/>E<sub>k</sub>≈0<br/>No Control"]
B["Medium Empowerment<br/>E<sub>k</sub>>0<br/>Partial Control"]
C["High Empowerment<br/>E<sub>k</sub>≫0<br/>Strong Control"]
end
A -->|gradient flow| B
B -->|gradient flow| C
D["Constraint<br/>Thermodynamic Cost ⟨W⟩↑"] -.limits.- C
style A fill:#fff4e1
style C fill:#ffe1f5
style D fill:#e1f5ff
3.2 Maximum Empowerment Principle
Hypothesis 3.1 (Maximum Empowerment Hypothesis)
Biological system’s strategy tends to maximize long-term average Empowerment:
Under constraints:
- Resource constraint:
- Thermodynamic constraint:
Theoretical Basis:
- Empowerment maximization can be derived from “maximum entropy reinforcement learning” (Salge et al., 2014)
- Evolutionary pressure favors high-Empowerment strategies (stronger survival ability)
- Neuroscience evidence: Dopamine system encodes “expected control” (Sharot & Sunstein, 2020)
Engineering Applications:
- Robot navigation: Maximize entropy of future reachable positions—Empowerment guides exploration
- Game AI: Learn strategies “controlling key resources”—enhance strategic choice space
- Neural prosthetics: Design interfaces “giving patients maximum control freedom”
3.3 Duality of Variational Free Energy and Empowerment
In free energy principle (Friston, 2010), observer minimizes variational free energy:
where is internal generative model, is true distribution.
Proposition 3.1 (Duality of Free Energy and Empowerment)
Under Markov blanket setting, minimizing variational free energy equivalent to maximizing expected Empowerment:
Under appropriate regularization and time discounting.
Meaning: Free energy principle and Empowerment maximization are different formulations of same optimization objective—former from “prediction error minimization” perspective, latter from “causal control maximization” perspective.
Part Four: Hierarchical Structure of Free Will—From Determinism to Creativity
4.1 Level Zero: No Freedom ()
Characteristics:
- Observer’s actions have no detectable influence on future state
- Examples: Completely passive observer, patients in coma or deep anesthesia
Extreme Cases:
- Physical death: ,
- Completely deterministic system: Although has “actions”, results completely determined by initial conditions, no real choice
4.2 Level One: Passive Choice (, Single Peak)
Characteristics:
- Empowerment positive but small, choice space limited
- Empowerment landscape has only one global optimum—“only correct choice”
Examples:
- Emergency escape: Only safe exit in fire—has “freedom” (can choose not to run), but rational choice unique
- Chess endgame: Master facing “forced win”—has “freedom” to choose other moves, but only one optimal solution
Geometric Picture: Empowerment landscape as below, only one sharp peak.
4.3 Level Two: Active Choice (, Multiple Peaks)
Characteristics:
- Empowerment positive and large, choice space rich
- Empowerment landscape has multiple local optima—“multiple paths all viable”
Examples:
- Career choice: Multiple career paths can achieve “controlling life” goal
- Artistic creation: Multiple styles can express theme—creative choice
Geometric Picture: Empowerment landscape has multiple peaks, observer can “explore” between different peaks.
graph TB
subgraph "Level One: Single Peak Landscape"
A1["Unique Optimum"]
A2["Suboptimum"] --> A1
A3["Suboptimum"] --> A1
end
subgraph "Level Two: Multiple Peak Landscape"
B1["Local Optimum 1"]
B2["Local Optimum 2"]
B3["Local Optimum 3"]
B4["Valley"] --> B1
B4 --> B2
B4 --> B3
end
style A1 fill:#fff4e1
style B1 fill:#ffe1f5
style B2 fill:#ffe1f5
style B3 fill:#ffe1f5
4.4 Level Three: Meta-Freedom ()
Characteristics:
- Not only choose action , but also choose “how to modify own control manifold ”
- Observer changes own strategy space, value function, goals—“reshaping self”
Examples:
- Learning new skills: Expand action space , thus enhance Empowerment
- Value reassessment: Change objective function , thus change Empowerment landscape
- Self-transformation: Through training, education, therapy change neural circuits—modify parameter space of
Philosophical Meaning: Meta-freedom is “freedom of freedom”—choosing what kind of chooser to become. This is highest form of human free will.
Part Five: Experimental Testing of Free Will—Operational Protocols
5.1 Behavioral Estimation of Empowerment
Protocol E1 (Two-Alternative Forced Choice Task)
- Present two options
- Observer chooses one
- Record subsequent state
- Repeat trials, estimate conditional distribution
- Calculate Empowerment:
Expected Results:
- If : Actions have no influence on results—no freedom
- If : Actions have distinguishable influence on results—has freedom
5.2 Detection of Causal Intervention
Protocol E2 (do-Calculus Experiment)
- Randomly assign action (external intervention, not subject choice)
- Observe result
- Estimate intervention distribution
- Compare with observational distribution
Criterion:
- If : Exists confounders, non-causal effect
- If : Observational distribution already captures causal effect
Experimental Evidence:
- Drug clinical trials: Randomized controlled trials (RCT) vs observational studies
- Neural modulation: Transcranial magnetic stimulation (TMS) vs natural movement
5.3 Measurement of Thermodynamic Cost
Protocol E3 (Work–Information Martingale Test)
Under continuous monitoring, construct martingale:
where:
- : Cumulative work (can estimate through energy monitoring or metabolic rate)
- : Free energy change
- : Cumulative mutual information
Criterion: Feedback Jarzynski equality requires .
Verification:
- Test if is martingale (conditional expectation )
- Estimate Jensen lower bound whether holds
Biological Applications:
- Measure ATP consumption of neural activity (like fMRI BOLD signal)
- Estimate energy cost of decision process
- Verify hypothesis “high-Empowerment strategies need higher energy”
5.4 Identification of Markov Blanket
Protocol E4 (Conditional Independence Test)
- Define candidate blanket (like: sensory organ input + motor organ output)
- Test conditional independence:
- Use kernel independence test (like HSIC) or Bayesian network inference
Expectation:
- If independence holds: is valid Markov blanket
- If independence doesn’t hold: Need expand blanket (add more mediator variables)
Neuroscience Applications:
- Identify neural representation of “self–world boundary”
- Verify Markov blanket hypothesis of free energy principle
Part Six: Philosophical Postscript of Free Will—Reconciliation of Determinism and Freedom
6.1 Geometric Reconstruction of Compatibilism
Classic Compatibilism (Hume, Ayer): Free will and determinism can be compatible, because “freedom” means “acting according to own will”, not “uncaused cause”.
Extension of This Theory: Free will Empowerment , this completely compatible with determinism, because:
- Causal Chain Unbroken: Empowerment still in causal network—action determined by internal state
- Freedom is Relative: measures “choice space relative to constraints”, not “absolute unconstrained”
- Operational Testable: Doesn’t depend on metaphysical “freedom” concept, only depends on measurable information quantity
Geometric Picture: Deterministic system is geodesic flow on , but “multi-peak structure” of Empowerment landscape allows “branching paths”—within deterministic framework, still has “freedom of choice”.
6.2 From Free Will to Moral Responsibility
Traditional Argument: If no free will, then no moral responsibility—because “cannot choose” means “no need to be responsible”.
Response of This Theory: Moral responsibility doesn’t need “absolute freedom”, only needs:
- Causal Intervention Ability: , actor has distinguishable influence on results
- Predictability: Actor’s internal model can predict consequences of actions
- Adjustability: Actor can modify through learning (meta-freedom)
Corollary:
- Infants or severe mental disorder patients: or missing—exempt from responsibility
- Normal adults: and mature—bear responsibility
- Borderline cases (like mild cognitive impairment): proportional to degree of responsibility—partial responsibility
6.3 Emergence of Free Will
Question: Empowerment defined at macroscopic level (actions , states are coarse-grained variables). Does microscopic particle level have “freedom”?
Answer: Free will is emergent phenomenon:
- Macroscopic Causal Effectiveness: Coarse-grained variables can have higher causal effective information (Tononi, Hoel, 2013)
- Scale-Dependent: Empowerment maximum at appropriate coarse-graining scale—this is natural scale of “free will”
- Downward Causation: Macroscopic intentions constrain microscopic dynamics through neural implementation—although microscopic follows physical laws, macroscopic still has “control”
Philosophical Meaning: Free will not at fundamental particle level, but emerges at appropriate organizational level—like “liquid” not at single water molecule level, but emerges in macroscopic collective behavior.
6.4 Cost and Finitude of Freedom
Existentialist Insight (Sartre): “Man is condemned to freedom”—freedom is burden, because choice means responsibility.
Quantification of This Theory: Cost of freedom is thermodynamic cost and cognitive load:
Corollary:
- When energy exhausted (fatigue) or cognitive overload (decision fatigue), observer tends to “autopilot”—
- High freedom (multiple options, high uncertainty) causes anxiety, paralysis—“paradox of choice”
- Moderate constraints (like habits, norms) can reduce cognitive cost, while preserving core freedom
Conclusion: Geometric Truth of Free Will
This chapter starts from information geometry and causal theory, gives operational definition of free will:
Core Theorems Review:
-
Physical Foundation Theorem (Theorem 2.1): Free will needs triple conditions—controllability, non-equilibrium supply, barrier separation
-
Thermodynamic Cost (Corollary 2.2): —freedom needs energy
-
Compatibilism: Empowerment compatible with determinism—freedom is “choice space within constraints”
-
Hierarchical Structure: No freedom () passive choice active choice meta-freedom
Experimental Path:
- Behavioral estimation: Two-alternative task
- Causal detection: Random intervention
- Thermodynamic verification: Work–information martingale Jarzynski equality
- Neural identification: Markov blanket conditional independence
Philosophical Significance:
- Free will not “uncaused cause”, but causal control under physical constraints
- It is measurable, optimizable, resource-limited—but within these constraints, it is real
- It emerges at appropriate organizational level, not at fundamental particle level
Next chapter (Chapter 6) will explore multi-observer consensus geometry, revealing how individual free will couples through social networks, forming collective decisions and consensus emergence.
References
Empowerment Theory
- Klyubin, A. S., Polani, D., & Nehaniv, C. L. (2005). Empowerment: A universal agent-centric measure of control. IEEE Congress on Evolutionary Computation.
- Salge, C., Glackin, C., & Polani, D. (2014). Empowerment–an introduction. In Guided Self-Organization: Inception (pp. 67-114).
Causal Theory
- Pearl, J. (2000). Causality: Models, Reasoning, and Inference. Cambridge University Press.
- Massey, J. L. (1990). Causality, feedback and directed information. Proc. Int. Symp. Inf. Theory Applic.(ISITA-90), 303-305.
Thermodynamics and Information
- Sagawa, T., & Ueda, M. (2008). Second law of thermodynamics with discrete quantum feedback control. Physical Review Letters, 100(8), 080403.
- Jarzynski, C. (1997). Nonequilibrium equality for free energy differences. Physical Review Letters, 78(14), 2690.
Free Energy Principle
- Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews Neuroscience, 11(2), 127-138.
- Parr, T., Pezzulo, G., & Friston, K. J. (2022). Active Inference: The Free Energy Principle in Mind, Brain, and Behavior. MIT Press.
Philosophy
- Hume, D. (1748). An Enquiry Concerning Human Understanding.
- Sartre, J.-P. (1943). L’Être et le néant (Being and Nothingness).
- Dennett, D. C. (1984). Elbow Room: The Varieties of Free Will Worth Wanting. MIT Press.
This Collection
- This collection: Observer–World Section Structure (Chapter 1)
- This collection: Structural Definition of Consciousness (Chapter 2)
- This collection: Attention–Time–Knowledge Graph (Chapter 4)
- This collection: Value–Meaning Unification: Optimal Geometry of Ethical Values and Physical Foundation of Free Will (Source theory document)