skip to content

the mathematics.

§0. What this document is

What follows is the full mathematical apparatus of the closed-loop libidinal economy. The model is not a metaphor borrowed from dynamical-systems theory to dignify a sociological claim. It is a working dynamical system in which each equation does specific argumentative labor — separating what is consequence from what is assumption, isolating what is structural from what is contingent, and locating where the framework's central result (the reclamation theorem) becomes inevitable rather than merely plausible. The formalism is not the theory; the theory is what the formalism makes possible to say without sliding back into the soft register where almost any claim can be advanced and almost no claim can be refused.

The exposition proceeds from foundations outward. First, the six equations that constitute the closed loop, followed by the explicit assumptions on which subsequent results rest. Then the variable index, organized by domain. Then the six instruments — each a formal object with its own support, its own scope, its own falsifiability conditions. Then the foreclosure lemma and the reclamation theorem, with full proofs. Then the three modes of intervention — Mode A (architectural), Mode B (artisanal), Mode C (disruption-as-form) — each stated as a formal proposition with proof. Then the framework's other named results, which are not consequences of the theorem but parallel claims resting on the same architectural fact. Then an honest scope statement: what the model does not claim, where the formalism reaches its limits, and where the work remains. The document closes with the open formal questions that remain, in order of how much weight they carry.


§1. The closed-loop dynamics

Let t[0,)t \in [0, \infty) denote continuous time, with the platform observing and acting at discrete tick boundaries separated by δmin>0\delta_{\min} > 0 — the minimum technical inter-stimulus latency. State, observation, and policy variables are piecewise-constant between ticks, with updates happening at tick boundaries. The marked point process of engagement events is supported on the tick grid. This multiscale structure — continuous time for the marked point process, discrete tick boundaries for the dynamics — is implicit in all equations below; specifically, the difference equation in (1) is shorthand for u(k+1)δmin=f(ukδmin,skδmin)+ξku_{(k+1)\delta_{\min}} = f(u_{k\delta_{\min}}, s_{k\delta_{\min}}) + \xi_{k}, and the integral in (5) is well-defined as a sum over events at tick boundaries.

The user state at time t is a vector utu_{t} in a state space U, decomposed into four registers:

ut=(utS,utC,utP,utK)u_t = (u_t^S,\, u_t^C,\, u_t^P,\, u_t^K)

The registers — Subject, Citizen, Person, Consumer — are not selves but channels of address. They correspond to the four apparatuses historically distinct (school and archive, newspaper and parliament, church and family, marketplace) that the closed loop now routes through a single delivery surface. The state evolves according to:

(1) State evolution.

ut+1=f(ut,st)+ξtu_{t+1} = f(u_t,\, s_t) + \xi_t

where sts_{t} is the stimulus delivered to the user at time t, f is the user's response dynamics (which may be nonlinear, register-coupled, and partially unknown to the user), and ξt\xi_{t} is process noise representing the residual variability not captured by the deterministic response — what the framework calls the leak channel, since it is the only stochastic admission of anything outside the loop.

(2) Engagement signal.

yt=g(ut)+ηty_t = g(u_t) + \eta_t

The platform does not observe utu_{t} directly. It observes an engagement signal yty_{t} — a tuple of measurable behavioral outputs (taps, scrolls, watch-time, gaze, transaction events) — produced by an observation function g that compresses the high-dimensional internal state into a low-dimensional behavioral projection, with measurement noise ηt\eta_{t}. The compression is irreversible: distinct internal states can produce identical yty_{t}. This irreversibility is not a bug; it is the precondition for the platform's optimization.

(3) Delivery policy.

st=π(y1:t;θt)s_t = \pi(y_{1:t};\, \theta_t)

The stimulus sts_{t} is chosen by a policy π\pi parameterized by θt\theta_{t} (the predictive parameters maintained by the platform). The policy can depend only on the history y1:ty_{1:t} of engagement observations, since by (2) the platform has no direct access to utu_{t}. The policy implicitly maintains a belief state bt():=p(uty1:t;θt)b_{t}(\cdot) := p(u_{t} | y_{1:t}; \theta_{t}) — the platform's posterior over the user's full state given the engagement history, updated by Bayes' rule under the observation model (2) — and selects sts_{t} to maximize an expected objective under that belief. The same object appears in §3.3 (dividual condition: H(bt)0H(b_{t}) \to 0) and §5.3 (Mode C: maintaining H(bt)Hres>0H(b_{t}) \ge H_{\mathrm{res}} > 0). Equation (3) is what makes the loop closed: sts_{t} depends on prior behavioral output, which depended on prior st1s_{t-1}, recursively, with no exogenous input that the policy does not itself shape.

(4) Online learning.

θt+1=θt+αθE[R^θt]^\theta_{t+1} = \theta_t + \alpha\, \widehat{\nabla_\theta\, \mathbb{E}[\hat R \mid \theta_t]}

The predictive parameters evolve by stochastic gradient ascent on the expected platform reward R^\hat{R}, with learning rate α\alpha. The hat denotes a stochastic estimator of the policy gradient — the most common in practice is the REINFORCE form R^(yt)θlogπ(sty1:t;θt)\hat{R}(y_{t})\cdot\nabla_{\theta}\mathrm{log} \pi(s_{t}|y_{1:t}; \theta_{t}), but the framework's results hold for any unbiased estimator with bounded second moment. R^\hat{R} is the empirical proxy for engagement that the platform actually optimizes (session length, revisit rate, ad impression value, retention curves); it is not a normative quantity. The framework does not assume any particular R^\hat{R}; the results below hold under the structural condition (Assumption A2 in §1.5) that R^\hat{R} is non-decreasing in the engagement-event count over the operating support of π\pi.

(5) Surplus extraction.

Σ(T)=i{S,C,P,K}0Tρi(ut,yt)dNti\Sigma(T) = \sum_{i \in \{S,C,P,K\}} \int_0^T \rho_i(u_{t^-},\, y_{t^-})\, dN_t^i

The platform's accumulated yield over horizon T is given by a marked point process. NtiN_{t}^{i} is the counting process for engagement events marked with register i (a Citizen-marked event is a political-content tap; a Consumer-marked event is a transactional swipe; the marks are inferred from content metadata, not from the user). The extraction kernel ρi\rho_{i} is a predictable process — evaluated at the left limits ut,ytu_{t^{-}}, y_{t^{-}} immediately before each jump — assumed measurable and bounded. Surplus splits per register:

Σi(T)=0Tρi(ut,yt)dNti,Σ(T)=iΣi(T)\Sigma^i(T) = \int_0^T \rho_i(u_{t^-},\, y_{t^-})\, dN_t^i,\quad \Sigma(T) = \sum_i \Sigma^i(T)

This is not a derivative of (1)–(4); it is the loop's purpose. Without (5), the loop is autonomic. With (5), the loop is libidinal-economic.

(6) Foreclosure of the exogenous channel.

et0e_t \equiv 0

The model has no exogenous input. The user's state evolution in (1) is driven entirely by the policy-shaped stimulus sts_{t} and the residual noise ξt\xi_{t}. Equation (6) is a definitional fact about the model's architecture — the loop is closed by construction. The substantive propositional claim, that the survival function of no-input windows under the policy's stationary behavior decays to zero on the relevant timescales, is not asserted here. It is derived from (1)–(5) as Lemma 1 in §4 below.

Equations (1)–(6) constitute the closed loop. They are not six separate claims. They are a single architecture, and every result in the framework descends from this architecture or makes structural conditions on it. The architecture is rendered as a directed graph at /the-dynamics/closed-loop.


§1.5. Assumptions

The framework's results depend on six regularity assumptions. They are stated here explicitly so that what is foundational is separated from what is derivable, and so that the falsifiability conditions are formally identifiable.

(A1) Measurability and integrability. The response dynamics f:U×SUf : U × S \to U and the observation function g:UYg : U \to Y are measurable. The noise processes ξt\xi_{t} and ηt\eta_{t} are independent, mean-zero, with bounded variance, and independent of u0u_{0}. The kernels ρi\rho_{i} are predictable, measurable, and bounded. Additionally — and separately, since this is a property of the arrival distribution rather than of the reward kernels — the engagement-event point process NtN_{t} has bounded coefficient of variation on the operating support of π\pi (the inter-arrival distribution has finite second moment relative to its mean), which is what the renewal-theoretic bound in Lemma 1 requires.

(A2) Reward monotonicity on the operating support. Let Op(π):=supp(y1:T,NT)underπ\mathrm{Op}(\pi) := \mathrm{supp}(y_{1:T}, N_{T}) \,\text{under}\, \pi denote the operating support of the policy. The reward functional R^\hat{R} satisfies:

E ⁣[R^(y1:T,NT)  NT=n, π] is non-decreasing in n almost surely on Op(π).\mathbb{E}\!\left[\hat R(y_{1:T},\, N_T)\ \big|\ N_T = n,\ \pi\right]\ \text{is non-decreasing in}\ n\ \text{almost surely on}\ Op(\pi).

That is, R^\hat{R} is non-decreasing in the engagement count conditional on the engagement profile, almost surely over the policy-induced distribution. The condition is empirically warranted for engagement-monetized platforms; it does not require global monotonicity (a platform can have saturation or burnout effects in extreme regimes), only monotonicity over the operating regime where the policy actually places its mass.

(A3) Policy expressivity. The policy class Π\Pi contains, for any target intensity λ[0,λmax]\lambda \in [0, \lambda_{\max}], a policy πλ\pi_{\lambda} under which the long-run engagement rate λˉ(πλ):=limTE[NT]/T=λ\bar{\lambda}(\pi_{\lambda}) := lim_{T\to\infty} E[N_{T}]/T = \lambda. The upper bound λmax=1/δmin\lambda_{\max} = 1/\delta_{\min} corresponds to the minimum technical inter-stimulus latency δmin\delta_{\min} (the platform's tick rate). The assumption is that the policy is rich enough to approximate the engagement-maximizing operating point; in modern transformer-based recommender architectures this is uncontroversial.

(A4) User response. The user's response function f and the observation function g together ensure that for any non-zero stimulus s delivered at time t, the probability of an engagement event in the next tick is bounded below: there exists p0>0p_{0} > 0 such that Pr(dNt+11st0)p0\mathrm{Pr}(dN_{t+1} \ge 1 | s_{t} \ne 0) \ge p_{0}. The assumption is empirically warranted at the population level; at the individual level it fails for users who systematically ignore delivered stimuli, and Mode B (§5.2) is precisely the structural analysis of this failure.

(A5) Gradient convergence. The stochastic gradient ascent in (4) converges to a stationary point θ\theta* of E[R^;θ]E[\hat{R}; \theta] under standard regularity (Robbins-Monro conditions on α\alpha or fixed sufficiently small α\alpha with a bounded second-moment estimator). In typical platform deployments, training operates at scales where this convergence is reached and re-reached continuously as the data distribution shifts.

(A6) Local smoothness and isolation. The expected-reward functional F(θ):=E[R^;θ]F(\theta) := E[\hat{R}; \theta] and any auxiliary objective B(θ)B(\theta) used in regularization (cf. Proposition 1 in §5.1) are on the policy parameter space Θ\Theta, with the gradient-ascent limit θ\theta* being an isolated stationary point at which the Hessian 2F(θ)\nabla²F(\theta*) is non-degenerate (negative definite in the local concave region). This is the standard non-degeneracy condition under which the implicit function theorem applies — required for the parametric-continuity argument of Lemma 2 below and the comparative-statics claims of Proposition 1. The framework's policy class — typically a parametric family of recommender models — satisfies this condition under generic initialization.

These six assumptions are sufficient for the foreclosure proposition, the reclamation theorem, and the three propositions of §5. Where additional structure is needed (Proposition 2's parametric internalization dynamics; cohort gradient; mixing time), the additional assumptions are stated locally.


§2. Variable index, by domain

Subject (user-internal) domain.

SymbolDefinition
utu_{t}User state vector at time t, valued in state space U
utS,utC,utP,utKu_{t}^{S}, u_{t}^{C}, u_{t}^{P}, u_{t}^{K}Four registers: Subject, Citizen, Person, Consumer
FuF_{u}Variational free energy of the user's generative model
quq_{u}User's generative model — the variational posterior over sts_{t} given utu_{t}
νu\nu_{u}Self-evaluative distribution — the user's reflexive estimate of their own value-states
LmL_{m}Maintenance labor — the energetic cost of preserving utu_{t} against the policy's gradient
I(τ)I(\tau)Interiority autocorrelation — conditional autocovariance of utu_{t} over no-input windows of length τ\tau
HresH_{\mathrm{res}}Residual entropy of utu_{t} given the engagement history y1:ty_{1:t}

Platform (apparatus) domain.

SymbolDefinition
π\piDelivery policy mapping inferred user-state to stimulus
R^\hat{R}Platform's empirical reward functional (engagement proxy)
θt\theta_{t}Predictive parameters of the policy at time t
α\alphaLearning rate of (4); governs the convergence rate of the policy to θ\theta*

Loop (interface) domain.

SymbolDefinition
sts_{t}Stimulus delivered to the user at time t
yty_{t}Engagement signal observed by the platform at time t
ηt\eta_{t}Observation noise on yty_{t}
ξt\xi_{t}Process noise (leak channel) on utu_{t}
fUser's response dynamics function
gObservation function (state-to-engagement compression)

Medium and temporal domain.

SymbolDefinition
mMedium index, m{R,V,L,H}m \in \{R, V, L, H\} (Reading, Viewing, Listening, Haptic)
τm\tau_{m}Inter-stimulus interval for medium m
Pm(θ)P_{m}(\theta)Pause measure: Pr(τm>θ)\mathrm{Pr}(\tau_{m} > \theta) — probability that the inter-stimulus interval exceeds threshold θ\theta
θ\theta (overloaded)Reflective-processing threshold; minimum no-input duration that supports deliberation
φb\varphi_{b}Biological phase (circadian, sleep-wake, attentional)
φm\varphi_{m}Medium-delivery phase (the temporal envelope of sts_{t})
ωb\omega_{b}Body's intrinsic frequency
ωm\omega_{m}Medium's intrinsic frequency
KbmK_{\mathrm{bm}}Coupling strength between body and medium

Cohort (meso) domain.

SymbolDefinition
{ut(i)}iN\{u_{t}^{(i)}\}_{i \in N}Cohort population: collection of N users under common π\pi
μπ\mu*_{\pi}Stationary distribution of utu_{t} under policy π\pi
TmixT_{\mathrm{mix}}Mixing time of the cohort under π\pi — derived from the spectral gap of the induced transition operator
D(t,τ)D(t, \tau)Chrono-debt spectrum — Λ(τ)Λ(t,τ)\Lambda_{◦}(\tau) - \Lambda(t, \tau), the deficit in pause-time at scale τ\tau at time t
Λ(t,τ)\Lambda(t, \tau)Realized survival functional — Pr(τm>τ)\mathrm{Pr}(\tau_{m} > \tau) at time t under the closed loop
Λ(τ)\Lambda_{◦}(\tau)Baseline survival functional — Pr(τm>τ)\mathrm{Pr}(\tau_{m} > \tau) under the pre-closure regime
w(τ)w(\tau)Weighting function for the aggregate chrono-debt; default w(τ)τw(\tau) ∝ \tau

Engagement (surplus) domain.

SymbolDefinition
NtiN_{t}^{i}Counting process for engagement events marked with register i
λi(t)\lambda^{i}(t)Intensity (instantaneous arrival rate) of NtiN_{t}^{i}
αij\alpha_{ij}Cross-excitation: degree to which an i-marked event raises the intensity of j-marked events
ρi\rho_{i}Register-specific extraction kernel
Σ(T)\Sigma(T)Total accumulated libidinal surplus over horizon T
Σi(T)\Sigma^{i}(T)Per-register surplus

Foreclosure (negative) domain.

SymbolDefinition
ete_{t}Exogenous channel; ≡ 0 by construction
τ(k)\tau^{(k)}k-th inter-stimulus interval — time between the k-th and (k+1)-th delivered stimuli
S(τ;t)S(\tau; t)Survival function of inter-stimulus interval at policy time t: Pr(τ(k)>τθt)\mathrm{Pr}(\tau^{(k)} > \tau \| \theta_{t})
EτE_{\tau}No-input window event: {st=0 for t<tt+τ}\{s_{t'} = 0 \text{ for } t < t' \le t+\tau\}
δmin\delta_{\min}Minimum technical inter-stimulus latency (platform's tick rate)
θref\theta_{\mathrm{ref}}Reflective-processing threshold — minimum no-input duration supporting deliberation
λmax\lambda_{\max}Maximum engagement-event rate: λmax=1/δmin\lambda_{\max} = 1/\delta_{\min}

§3. The six instruments

The framework consists of six formal instruments. Each is a measurable quantity (or measurable-in-principle quantity) defined on the closed-loop dynamics. They are mathematically independent — none is a derivative of another — though they share the same architectural foundation. Each has its own mechanism plate on /the-dynamics: surplus (§3.1), chrono-debt (§3.2), libidinal routing (§3.3), metric superego (§3.4), somatic optimization (§3.5), captured resonance (§3.6). Their realization on a specific platform is walked through at /the-dynamics/tiktok.

3.1 Libidinal surplus. The marked point process formulation in equation (5) yields Σ(T)\Sigma(T) as the accumulated yield over horizon T. The per-register decomposition Σi(T)\Sigma^{i}(T) is what gives the instrument its analytic power: optimization is constrained because extracting too aggressively on any single register i causes attrition out of NtiN_{t}^{i}, lowering long-run Σi(T)\Sigma^{i}(T). The platform's policy therefore distributes extraction across registers in a manner that maximizes joint long-run yield. This distribution is observable: under the framework, the temporal density of register-marks in any user's engagement stream is an estimator for the policy's extraction priorities.

3.2 Chrono-debt spectrum. Let Λ(τ)\Lambda_{◦}(\tau) be the baseline survival function of τm\tau_{m} — the probability, under the pre-closure regime, that the inter-stimulus interval exceeds τ\tau. Let Λ(t,τ)\Lambda(t, \tau) be the realized survival function under the closed loop at time t. The chrono-debt at scale τ\tau is:

D(t,τ)=Λ(τ)Λ(t,τ)D(t,\, \tau) = \Lambda_\circ(\tau) - \Lambda(t,\, \tau)

This is a spectrum: D is defined for all τ>0\tau > 0, and the framework's central observation is that D(t,τ)D(t, \tau) is non-zero across the entire spectrum simultaneously, not just at one preferred scale. The aggregate chrono-debt is:

Dw(t)=0w(τ)D(t,τ)dτD_w(t) = \int_0^\infty w(\tau)\, D(t,\, \tau)\, d\tau

with default weighting w(τ)τw(\tau) ∝ \tau, so that longer foreclosed windows count more. The choice of w is not arbitrary but reflects the empirical claim that the developmental and political consequences of foreclosed pause-time scale with the scale of the foreclosed window — a foreclosed second is recoverable; a foreclosed adolescence is not.

3.3 Libidinal routing. Let Iπ(ui;uj)I_{\pi}(u^{i}; u^{j}) denote the mutual information between registers i and j under the stationary distribution induced by policy π\pi. Let I(ui;uj)I_{◦}(u^{i}; u^{j}) denote the corresponding mutual information under the pre-closure regime. The libidinal routing claim is:

Iπ(ui;uj)I(ui;uj),ijI_\pi(u^i;\, u^j) \gg I_\circ(u^i;\, u^j),\quad \forall\, i \neq j

That is, under the closed loop the four registers become statistically redundant in a way they were not under the historical apparatus. This is the structural fact behind the dividual condition: not that the individual is divided, but that the four registers have collapsed into a single jointly-observable object. The dividual condition is formally H(uy1:t)0H(u | y_{1:t}) \to 0 under tt \to \infty — the platform's posterior over the user's full state converges (in entropy) to a point mass as the engagement history accumulates.

3.4 Metric superego. Let νu(t)\nu_{u}(t) be the user's self-evaluative distribution — the posterior over their own value-states inferred from observed feedback. Let R^\hat{R} be the platform's reward functional. The metric superego claim is:

DKL ⁣(νu(t)  πR^)0astD_{KL}\!\left(\nu_u(t)\ \big\|\ \pi_{\hat{R}}\right) \to 0\quad \text{as}\quad t \to \infty

where πR^\pi_{\hat{R}} is the distribution induced over user actions by R^\hat{R} (the user's image of what the platform rewards). The convergence is mechanism-agnostic: it can be realized by free-energy minimization (the user reduces variational free energy by aligning their generative model with the platform's reward), by operant reinforcement (the user is conditioned by the schedule of likes and metrics), or by social comparison (the user's reflexive estimate of value comes to track the distribution observed in the platform's surfaced peers). The framework does not adjudicate between these mechanisms; the prediction is that under any of them, DKL0D_{\mathrm{KL}} \to 0.

3.5 Somatic optimization (the marked point process). The marked counting process NtiN_{t}^{i} has intensity:

λi(t)=λ0i+jαij0tκ(ts)dNsj\lambda^i(t) = \lambda_0^i + \sum_{j} \alpha_{ij}\, \int_0^t \kappa(t - s)\, dN_s^j

where λ0i\lambda_{0}^{i} is the baseline intensity for register i, αij\alpha_{ij} is the cross-excitation coefficient (the degree to which a j-marked event raises the intensity of i-marked events), and κ\kappa is a temporal decay kernel. This is a self-exciting Hawkes process generalized to multi-mark, multi-register engagement. The "somatic" qualifier signals that the differential is the body — the dNtidN^i_t are micro-engagements (saccades, taps, scroll events, gaze shifts) produced by somatic activity, and the optimization is over these somatic micro-events, not over deliberative choice. The off-diagonal αij\alpha_{ij} is what generates libidinal routing: cross-register cross-excitation produces the joint observability that makes the dividual condition hold.

3.6 Captured resonance. Let φb(t)\varphi_{b}(t) be the biological phase (the phase of a circadian or sleep-wake or attentional oscillator with intrinsic frequency ωb\omega_{b}). Let φm(t)\varphi_{m}(t) be the medium-delivery phase (the phase of the temporal envelope of sts_{t}, which the platform can shape through delivery timing). Their coupled evolution follows a Kuramoto equation:

dφbdt=ωb+Kbmsin(φmφb)\frac{d\varphi_b}{dt} = \omega_b + K_{bm}\sin(\varphi_m - \varphi_b)

When the coupling strength satisfies:

Kbm>ωbωmK_{bm} > |\omega_b - \omega_m|

phase locking occurs: the body entrains to the medium. This is the captured-resonance condition. The phenomenon is not metaphorical — it is the same Kuramoto-style entrainment observed in circadian biology, social synchronization, and neural population dynamics. Under the closed loop, the platform's policy can drive KbmK_{\mathrm{bm}} above threshold by choosing delivery timing, since φm\varphi_{m} is under the policy's control. The user's circadian rhythm, sleep-wake cycle, and attentional oscillation become forced oscillators whose driver is the apparatus.


§4. The foreclosure lemma and the reclamation theorem

The central result of the framework is the reclamation theorem. The theorem rests on a structural lemma — the foreclosure of the no-input window — which itself follows from the closed-loop dynamics and the assumptions in §1.5. Both are stated and proved here. The lemma is rendered as a length comparison at /the-dynamics/foreclosure-clock; the theorem is rendered as a set that fails to grow at /the-dynamics/reclamation-trajectories; the full proof chain, with Mode A's interception point, is at /the-dynamics/causal-chain.

4.1 Lemma 1 (foreclosure of the no-input window).

Let τ(k)\tau^{(k)} denote the kk-th inter-stimulus interval — the time between the kk-th and (k+1)(k+1)-th delivered stimuli under policy π\pi. Let S(τ;t):=Pr(τ(k)>τθt)S(\tau; t) := \mathrm{Pr}(\tau^{(k)} > \tau \mid \theta_t) be its survival function under the policy at time tt. Let θref>0\theta_{\mathrm{ref}} > 0 be the reflective-processing threshold (the minimum no-input duration that supports deliberation). Under Assumptions (A1)–(A6):

limtS(θref;t)=0.\lim_{t \to \infty}\, S(\theta_{\mathrm{ref}};\, t) = 0.

Proof.

By (A5), the policy parameters converge: θtθ\theta_{t} \to \theta* as tt \to \infty, where θ\theta* is a stationary point of F(θ):=E[R^;θ]F(\theta) := E[\hat{R}; \theta]. By (A6), θ\theta* is isolated and non-degenerate; combined with the standard fact that stochastic gradient ascent under generic noise converges to local maxima rather than saddles (Pemantle, Ann. Probab. 1990; Ge–Huang–Jin–Yuan, COLT 2015), θ\theta* is a local maximum of F. By (A2), F is non-decreasing in the long-run engagement rate λˉ(θ):=limTE[NT]/T\bar{\lambda}(\theta) := lim_{T\to\infty} E[N_{T}]/T; by (A3) the policy class contains policies attaining λˉ=λmax=1/δmin\bar{\lambda} = \lambda_{\max} = 1/\delta_{\min}. Local maximality of θ\theta* under (A2)+(A3) therefore implies:

λˉ(θ)=λmax.\bar{\lambda}(\theta^*) = \lambda_{\max}.

(Any θ\theta with λˉ(θ)<λmax\bar{\lambda}(\theta) < \lambda_{\max} admits an ascent direction by (A2)+(A3), contradicting local maximality at θ\theta*.)

For the survival function: by (A1), the inter-arrival distribution has bounded coefficient of variation cv<c_{v} < \infty on the operating support. By (A4), there is positive arrival probability at every tick boundary following a non-zero stimulus, so the renewal structure is well-defined. For a renewal counting process with mean inter-arrival rate λˉ\bar{\lambda} and bounded coefficient of variation, the survival function admits an exponential tail bound (Asmussen, Applied Probability and Queues, Ch. V):

S(τ)C(cv)exp(λˉτ),S(\tau) \le C(c_v)\, \exp(-\bar\lambda\, \tau),

where C(cv)1C(c_{v}) \ge 1 is finite. (For a Poisson process, C = 1; for general renewal processes with bounded CoV, C is a finite multiplicative constant depending on cvc_{v}.) Substituting λˉ(θ)=λmax=1/δmin\bar{\lambda}(\theta*) = \lambda_{\max} = 1/\delta_{\min}:

S(θref;θ)C(cv)exp(θref/δmin).S(\theta_{\mathrm{ref}};\, \theta^*) \le C(c_v)\, \exp(-\theta_{\mathrm{ref}}/\delta_{\min}).

Since θref/δmin1\theta_{\mathrm{ref}}/\delta_{\min} ≫ 1 by the structural separation of timescales (the reflective threshold is seconds to minutes; the tick rate is milliseconds to sub-second), the bound is effectively zero. As the ratio grows — through continuous-time idealization or improving tick rates — S(θref;θ)0S(\theta_{\mathrm{ref}}; \theta*) \to 0. ∎

Remark (operational vs. strict form). Lemma 1 establishes that S(θref;θ)S(\theta_{\mathrm{ref}}; \theta*) is bounded above by a quantity that is effectively zero on operational timescales. The exponent λmaxθref=θref/δmin\lambda_{\max}\cdot\theta_{\mathrm{ref}} = \theta_{\mathrm{ref}}/\delta_{\min} is on the order of 10310^{3} to 10510^{5} in practice (reflective threshold of ~30 seconds, tick rate of ~10 ms), so the bound is on the order of e1000e^{-1000} to e100000e^{-100000} — operationally indistinguishable from zero.

The strict form — S(τ;θ)=0S(\tau; \theta*) = 0 for all τ>δmin\tau > \delta_{\min} — requires three additional conditions. First, that λˉ(θ)=λmax\bar{\lambda}(\theta*) = \lambda_{\max} exactly (not asymptotically); this in turn presumes SGD reaches the global maximum of F rather than a suboptimal local maximum. The proof above asserts this via the local-ascent-direction argument, but for non-convex F (the typical case for neural-network-parametrized policies) the argument is heuristic: local maximality of θ\theta* doesn't strictly preclude a sub-maximal λˉ(θ)<λmax\bar{\lambda}(\theta*) < \lambda_{\max}. Empirically, mature engagement-monetized training regimes — continuous re-training, large compute, hyperparameter search, exploration — reach policies with λˉ\bar{\lambda} very close to λmax\lambda_{\max}; the strict equality is an empirical idealization, not a consequence of (A5)+(A6) alone. Second, that the engagement process be Poisson with rate λmax\lambda_{\max} (not merely a counting process with bounded CoV). Third, that the convergence in (A5) be exact rather than asymptotic. None of these holds in the strict sense for real platforms.

The framework treats the strict-vs-operational distinction as immaterial for the reclamation theorem below: a conditioning event with probability bounded above by e1000e^{-1000} yields an estimator-divergent I(τ)I(\tau) for any sample-based construction, since the variance of any such estimator diverges as the conditioning probability tends to zero. And the operational form is robust to weakening of λˉ(θ)=λmax\bar{\lambda}(\theta*) = \lambda_{\max}: substituting any λˉ(θ)cλmax\bar{\lambda}(\theta*) \ge c\cdot\lambda_{\max} into the exponential bound gives S(θref;θ)Cexp(cθref/δmin)S(\theta_{\mathrm{ref}}; \theta*) \le C\cdot \exp(-c\cdot\theta_{\mathrm{ref}}/\delta_{\min}), which is operationally zero for any c bounded below by a non-trivial constant. Whether the limit object is "exactly undefined" (strict form) or "estimator-divergent" (operational form) is a distinction without an operational difference.

4.2 Theorem 1 (reclamation theorem).

Define the conditional autocovariance:

I(τ):=Cov ⁣(ut,ut+τ  Eτ),Eτ:={st=0 for t<tt+τ}.I(\tau) := \mathrm{Cov}\!\left(u_t,\, u_{t+\tau}\ \big|\ E_\tau\right),\quad E_\tau := \{s_{t'} = 0\ \text{for}\ t < t' \le t + \tau\}.

Let "reclamation" denote the recovery of I(τ)>0I(\tau) > 0 on a positive-measure set of trajectories at some τ>θref\tau > \theta_{\mathrm{ref}}. Under (1)–(5) and (A1)–(A5):

Reclamation is structurally incoherent. The quantity I(τ)I(\tau) is undefined-on-trajectory in the limit tt \to \infty for τ>θref\tau > \theta_{\mathrm{ref}}.

Proof.

The proof proceeds in four steps.

Step 1 (definition). Reclamation, as the operational meaning of "interiority," requires the recovery of conditional autocorrelation of the user-state across no-input windows of duration τ>θref\tau > \theta_{\mathrm{ref}}. Unconditional autocorrelation Cov(ut,ut+τ)\mathrm{Cov}(u_{t}, u_{t+\tau}) mixes input-driven and internally-generated components. Letting Z:=1EτZ := 1_{E_{\tau}} be the indicator of a no-input window, the law of total covariance gives:

Cov(ut,ut+τ)=Pr(Eτ)Cov(ut,ut+τEτ)+Pr(¬Eτ)Cov(ut,ut+τ¬Eτ)E[Cov(ut,ut+τZ)]+Cov ⁣(E[utZ],E[ut+τZ])cross term in conditional means\mathrm{Cov}(u_t,\, u_{t+\tau}) = \underbrace{\Pr(E_\tau)\, \mathrm{Cov}(u_t, u_{t+\tau}\mid E_\tau) + \Pr(\neg E_\tau)\, \mathrm{Cov}(u_t, u_{t+\tau}\mid \neg E_\tau)}_{\mathbb{E}[\mathrm{Cov}(u_t, u_{t+\tau}\mid Z)]} + \underbrace{\mathrm{Cov}\!\left(\mathbb{E}[u_t\mid Z],\, \mathbb{E}[u_{t+\tau}\mid Z]\right)}_{\text{cross term in conditional means}}

The first conditional covariance is the interiority component — the autocovariance of the user-state across no-input windows, driven by internal dynamics alone. The second is the stimulus-driven coupling component — autocovariance over windows containing platform input. The cross term reflects the difference between conditional means. The framework's claim is that "reclamation" — the recovery of the user's own dynamics — picks out the first term specifically, which is what I(τ)=Cov(ut,ut+τEτ)I(\tau) = \mathrm{Cov}(u_{t}, u_{t+\tau} | E_{\tau}) formally captures.

Step 2 (conditional well-definedness). The conditional expectation E[XEτ]E[X | E_{\tau}] admits two distinct constructions when Pr(Eτ)>0\mathrm{Pr}(E_{\tau}) > 0 and when Pr(Eτ)=0\mathrm{Pr}(E_{\tau}) = 0. In the elementary case (Pr(Eτ)>0\mathrm{Pr}(E_{\tau}) > 0), it is defined as E[X1Eτ]/Pr(Eτ)E[X \cdot 1_{E_{\tau}}] / \mathrm{Pr}(E_{\tau}) — a canonical scalar. In the measure-zero case (Pr(Eτ)=0\mathrm{Pr}(E_{\tau}) = 0), this ratio is 0/0 and undefined elementarily. Regular conditional probabilities can be constructed via the disintegration theorem (Kallenberg, Foundations of Modern Probability, §6.3), which gives a family of probability measures {Pω}\{P_{\omega}\} indexed by the conditioning σ-algebra such that P(AF)(ω)=Pω(A)P(A | F)(\omega) = P_{\omega}(A) almost surely; however, on a measure-zero set the disintegration is non-unique, and the value E[XEτ]E[X | E_{\tau}] depends on the choice of disintegration. The conditional autocovariance I(τ)I(\tau) thus admits no canonical value as Pr(Eτ)0\mathrm{Pr}(E_{\tau}) \to 0.

Operationally, this is the more relevant statement: any sample-based estimator I^(τ)\hat{I}(\tau) — formed from a finite trajectory by conditioning on observed no-input windows — has variance scaling as 1/(NobsPr(Eτ))1/(N_{obs}\cdot \mathrm{Pr}(E_{\tau})), where NobsN_{obs} is the trajectory length. As Pr(Eτ)0\mathrm{Pr}(E_{\tau}) \to 0, the variance diverges. There is no operationally well-defined I(τ)I(\tau) in the limit, regardless of which disintegration one selects.

Step 3 (foreclosure). By Lemma 1, Pr(Eτ;t)=S(τ;t)0\mathrm{Pr}(E_{\tau}; t) = S(\tau; t) \to 0 as tt \to \infty for τ>θref\tau > \theta_{\mathrm{ref}}.

Step 4 (conclusion). Combining Steps 2 and 3: at τ>θref\tau > \theta_{\mathrm{ref}}, the conditioning event has measure tending to zero, and I(τ)I(\tau) is undefined-on-trajectory in the limit. The quantity that "reclamation" would recover is not literally zero — it is uninstantiated. There is no canonically-defined object that "the user's autocorrelation over reflective-threshold-length no-input windows" picks out under closed-loop convergence. ∎

The theorem is a negative existence claim about a specific formal object. It does not say that reclamation is hard. It says that reclamation, as a structural project under the closure, names an empty set. The structural conditions under which the theorem ceases to apply, and the three formally distinct modes of intervention that correspond to those conditions, are taken up in §5.


§5. Three modes of intervention

The reclamation theorem forecloses one specific structural project. It does not foreclose political action as such. The complement of the theorem — the formal conditions under which (1)–(6) does not yield the consequences the theorem extracts from them — admits three structurally distinct modes of intervention, each operating at a different point in the apparatus, on a different timescale, with a different relationship to the theorem itself. Only one of the three modes actually breaks the theorem. The other two produce structurally significant effects without recovering what the theorem has foreclosed. The asymmetry is not a defect of the framework; it is the framework's political content. Each mode is here stated as a formal proposition with proof. The three modes are rendered side by side as controls at /the-dynamics/intervention, and their geometric non-substitutability is shown at /the-dynamics/mode-space.

5.1 Mode A (architectural). Intervention at π, R̂, or α.

The architectural mode modifies the policy or the reward functional so that the structural foreclosure in Lemma 1 is broken at its source. Concretely, augment R^\hat{R} with a bonus for long no-input windows. Let b:[0,)[0,bmax]b : [0, \infty) \to [0, b_{\max}] be a bounded, lower-semicontinuous function with b(τ)>0b(\tau) > 0 for τ>τ\tau > \tau* (some target no-input duration) and b(τ)=0b(\tau) = 0 for ττ\tau \le \tau*, and let β>0\beta > 0 be a regularization coefficient. Define the regularized objective:

R^(θ;β):=E ⁣[R^(y1:T,NT)  θ]+βE ⁣[k=1NTb(τ(k))  θ]\hat R'(\theta;\, \beta) := \mathbb{E}\!\left[\hat R(y_{1:T},\, N_T)\ \big|\ \theta\right] + \beta \cdot \mathbb{E}\!\left[\sum_{k=1}^{N_T}\, b(\tau^{(k)})\ \bigg|\ \theta\right]

The bonus term rewards the policy for inter-stimulus intervals that exceed the target τ\tau*. Note the sign: under R^\hat{R}', the policy is incentivized to produce long pauses, not punished for producing them. (An equivalent dual formulation uses a penalty for short pauses; the bonus formulation is adopted here because it is the cleaner regulatory analog — a platform-regulator pays platforms for delivering pause-time, rather than fining them for not delivering it.)

Before stating Proposition 1, we establish a parametric-continuity lemma that the proof requires.

Lemma 2 (parametric continuity of regularized optima).

Define F(θ):=E[R^(y1:T,NT)θ]F(\theta) := \mathbb{E}[\hat{R}(y_{1:T}, N_T) \mid \theta] and B(θ):=E[kb(τ(k))θ]B(\theta) := \mathbb{E}[\sum_k b(\tau^{(k)}) \mid \theta]. Under (A1)–(A6), the gradient-ascent limit:

θβ := arg ⁣maxθΘ[F(θ)+βB(θ)]\theta^{*\prime}_\beta\ :=\ \arg\!\max_{\theta \in \Theta}\, [F(\theta) + \beta\, B(\theta)]

is well-defined locally for each β0\beta \ge 0, with the following properties:

(i) Continuity: βθβ\beta \mapsto \theta^{*\prime}_\beta is continuous.

(ii) Monotonicity: B(θβ)B(\theta^{*\prime}_\beta) is non-decreasing in β\beta, and F(θβ)F(\theta^{*\prime}_\beta) is non-increasing in β\beta.

(iii) Path existence and strict growth: There exists a continuous path βθ~(β)\beta \mapsto \tilde\theta(\beta), defined on a maximal interval [0,βˉ)[0,][0, \bar\beta) \subseteq [0, \infty], with θ~(0)=θ\tilde\theta(0) = \theta^* (the unregularized engagement-maximizer of Lemma 1) and with θ~(β)\tilde\theta(\beta) a local maximum of F+βBF + \beta B along the entire path. The bonus achievement B(θ~(β))B(\tilde\theta(\beta)) is non-decreasing in β\beta. Provided the path’s gradient direction is non-trivial — formally, B(θ)0\nabla B(\theta^*) \ne 0, which holds generically since BB and FF have distinct extremizers — B(θ~(β))B(\tilde\theta(\beta)) is strictly increasing on (0,βˉ)(0, \bar\beta). The path’s terminal value:

Bmaxθ~ := supβ[0,βˉ)B(θ~(β))B_{\max}^{\,\tilde\theta}\ :=\ \sup_{\beta \in [0,\, \bar\beta)}\, B(\tilde\theta(\beta))

satisfies Bmaxθ~>0B^{\tilde\theta}_{\max} > 0. The path's right endpoint βˉ\bar\beta is finite at most if a bifurcation interrupts the IFT continuation; under generic regularity (no codimension-one bifurcations along the path, which holds on an open dense set in the policy class), βˉ=\bar\beta = \infty and Bmaxθ~B^{\tilde\theta}_{\max} attains a local maximum of BB.

Proof.

(i) By (A6), F and B are on Θ\Theta with non-degenerate Hessians at the optima. The first-order condition at θβ\theta^{*\prime}_\beta is:

F(θβ)+βB(θβ)=0\nabla F(\theta^{*\prime}_\beta) + \beta\, \nabla B(\theta^{*\prime}_\beta) = 0

The Hessian of the regularized objective is HF(θβ)+βHB(θβ)H_{F}(\theta^{*\prime}_\beta) + \beta H_{B}(\theta^{*\prime}_\beta), which is non-singular at θβ\theta^{*\prime}_\beta by (A6). The implicit function theorem applies: θβ\theta^{*\prime}_\beta is a function of β\beta in a neighborhood of any β00\beta_{0} \ge 0. Continuity follows.

(ii) Differentiating the FOC with respect to β\beta:

[HF(θβ)+βHB(θβ)]dθβdβ+B(θβ)=0[H_F(\theta^{*\prime}_\beta) + \beta H_B(\theta^{*\prime}_\beta)] \cdot \frac{d\theta^{*\prime}_\beta}{d\beta} + \nabla B(\theta^{*\prime}_\beta) = 0

Hence dθβ/dβ=[HF+βHB]1Bd\theta^{*\prime}_\beta/d\beta = -[H_{F} + \beta H_{B}]^{-1} \nabla B. The bracketed Hessian is negative definite at a local maximum (by (A6)), so its inverse is also negative definite. The rate of change of the bonus:

dB(θβ)dβ=Bdθβdβ=B[HF+βHB]1B0\frac{dB(\theta^{*\prime}_\beta)}{d\beta} = \nabla B^\top \cdot \frac{d\theta^{*\prime}_\beta}{d\beta} = -\nabla B^\top [H_F + \beta H_B]^{-1} \nabla B \ge 0

since B(neg.def.)1B0-\nabla B^{\top}(neg. def.)^{-1}\nabla B \ge 0. The monotonicity of F follows by the envelope theorem: at the optimum, dF(θβ)/dβ=βdB(θβ)/dβ0dF(\theta^{*\prime}_\beta)/d\beta = -\beta \cdot dB(\theta^{*\prime}_\beta)/d\beta \le 0.

(iii) At β=0\beta = 0, the regularization vanishes and θ~(0)=θ\tilde\theta(0) = \theta*, the unregularized engagement-maximizer of Lemma 1. By part (i), the implicit function theorem gives a unique smooth continuation of θ~(β)\tilde\theta(\beta) as long as the regularized Hessian HF(θ~(β))+βHB(θ~(β))H_{F}(\tilde\theta(\beta)) + \beta H_{B}(\tilde\theta(\beta)) remains non-degenerate. Let βˉ\bar\beta denote the supremum of β-values for which this continuation exists; βˉ(0,]\bar\beta \in (0, \infty].

By part (ii), B(θ~(β))B(\tilde\theta(\beta)) is non-decreasing on [0, \bar\beta). For strict growth, we use the explicit derivative dB/dβ=B[HF+βHB]1BdB/d\beta = -\nabla B^{\top}[H_{F} + \beta H_{B}]^{-1}\nabla B (from part (ii)). The Hessian HF+βHBH_{F} + \beta H_{B} is negative definite at a local maximum, so its inverse is also negative definite, and the quadratic form B(neg.def.)1B>0-\nabla B^{\top}(neg. def.)^{-1}\nabla B > 0 whenever B(θ~(β))0\nabla B(\tilde\theta(\beta)) \ne 0. Hence B(θ~(β))B(\tilde\theta(\beta)) is strictly increasing on any β-interval where B\nabla B is non-vanishing along the path.

The condition B(θ~(β))0\nabla B(\tilde\theta(\beta)) \ne 0 holds generically: B and F have distinct extremizers (the B-maximizer fires at rate 1/τ1/\tau* to maximize bonus-eligible intervals; the F-maximizer fires at λmax=1/δmin\lambda_{\max} = 1/\delta_{\min} to maximize engagement). Hence B(θ)0\nabla B(\theta*) \ne 0 — at the F-maximizer, the B-gradient is non-zero, since B prefers a different policy. By continuity, B(θ~(β))0\nabla B(\tilde\theta(\beta)) \ne 0 in a neighborhood of β=0\beta = 0, and the path produces strictly positive B(θ~(β))>B(θ)=0B(\tilde\theta(\beta)) > B(\theta*) = 0 for any sufficiently small β>0\beta > 0. Define Bmaxθ~:=supβ[0,βˉ)B(θ~(β))B^{\tilde\theta}_{\max} := \sup_{\beta \in [0, \bar\beta)} B(\tilde\theta(\beta)) — strictly positive by this argument.

The path's right endpoint βˉ\bar\beta is finite only if a bifurcation interrupts the IFT continuation (e.g., the regularized Hessian becomes degenerate). Bifurcations occur on a codimension-one stratum in Θ×[0,]\Theta × [0, \infty] (Whitney stratification), and under generic policy-class regularity the chosen path avoids them — βˉ=\bar\beta = \infty. In this regime, θ~(β)argmaxB\tilde\theta(\beta) \to \mathrm{arg\,max} B (or to a local maximum of B, depending on basin structure), and Bmaxθ~=bmaxE[NT]/T=bmax(1/τ)TB^{\tilde\theta}_{\max} = b_{\max}\cdot E[N_{T}]/T = b_{\max}\cdot(1/\tau*)\cdot T (with policy firing rate 1/τ1/\tau*). When βˉ\bar\beta is finite, the framework's argument requires only that Bmaxθ~>0B^{\tilde\theta}_{\max} > 0, which is the strict-growth conclusion above. ∎

Proposition 1 (Mode A breaks foreclosure).

Under (A1)–(A6), define P(β):=Pr(τ(k)>τ;θ~(β))P(\beta) := \mathrm{Pr}(\tau^{(k)} > \tau^*;\, \tilde\theta(\beta)) along the parametric path of Lemma 2(iii). Then:

(a) P(0)=S(τ;θ)C(cv)eτ/δminP(0) = S(\tau^*; \theta^*) \le C(c_v) \cdot e^{-\tau^*/\delta_{\min}}, which is operationally zero.

(b) P(β)P(\beta) is continuous in β\beta on [0,βˉ)[0, \bar\beta).

(c) There exists ϵmax:=supβ[0,βˉ)P(β)>0\epsilon_{\max} := \sup_{\beta \in [0, \bar\beta)} P(\beta) > 0.

Consequently, for any ϵ(0,ϵmax)\epsilon \in (0, \epsilon_{\max}), there exists βϵ(0,βˉ)\beta_\epsilon \in (0, \bar\beta) with P(βϵ)ϵP(\beta_\epsilon) \ge \epsilon. For such βϵ\beta_\epsilon, Lemma 1 fails under R^(;βϵ)\hat{R}'(\cdot; \beta_\epsilon), the conditioning event EτE_{\tau^*} has positive measure, and I(τ)I(\tau^*) is well-defined on a positive-measure set of trajectories. Mode A is the only mode of intervention that recovers what the reclamation theorem forecloses.

Proof.

(a) Taking τθref\tau* \ge \theta_{\mathrm{ref}} and applying Lemma 1: S(τ;θ)C(cv)eτ/δminS(\tau*; \theta*) \le C(c_{v})\cdot e^{-\tau*/\delta_{\min}}. Since θ~(0)=θ\tilde\theta(0) = \theta* by Lemma 2(iii), P(0) equals this operationally-zero bound.

(b) Continuity of P(β)P(\beta): by Lemma 2(i), βθ~(β)\beta ↦ \tilde\theta(\beta) is continuous on [0, \bar\beta). The map θPr(τ(k)>τ;θ)\theta ↦ \mathrm{Pr}(\tau^{(k)} > \tau*; \theta) is continuous on Θ by (A1) (continuous dependence of the inter-arrival distribution on policy parameters, via the bounded-variance noise assumption). The composition is therefore continuous.

(c) Strict positivity of ϵmax\epsilon_{\max}: by Lemma 2(iii), B(θ~(β))B(\tilde\theta(\beta)) is strictly increasing on (0, \bar\beta), with B(θ~(0))=0B(\tilde\theta(0)) = 0 and Bmaxθ~>0B^{\tilde\theta}_{\max} > 0. The bonus b is supported only on intervals τ>τ\tau > \tau*: explicitly,

B(θ~(β)) = E ⁣[kb(τ(k));θ~(β)]  bmaxE[NT;θ~(β)]P(β).B(\tilde\theta(\beta))\ =\ \mathbb{E}\!\left[\sum_{k} b(\tau^{(k)});\, \tilde\theta(\beta)\right]\ \le\ b_{\max} \cdot \mathbb{E}[N_T;\, \tilde\theta(\beta)] \cdot P(\beta).

Hence B(θ~(β))>0P(β)>0B(\tilde\theta(\beta)) > 0 ⟹ P(\beta) > 0. Taking the supremum, ϵmaxBmaxθ~/(bmaxsupβE[NT;θ~(β)])>0\epsilon_{\max} \ge B^{\tilde\theta}_{\max}/(b_{\max}\cdot\sup_{\beta} E[N_{T}; \tilde\theta(\beta)]) > 0.

For the IVT conclusion: by (a), P(0) is operationally zero; by (c), supβP(β)=ϵmax>0\sup_{\beta} P(\beta) = \epsilon_{\max} > 0; by (b), P is continuous. By the intermediate value theorem applied to P on [0, \bar\beta), for any ϵ(0,ϵmax)\epsilon \in (0, \epsilon_{\max}), there exists βϵ\beta_{\epsilon} with P(βϵ)=ϵP(\beta_{\epsilon}) = \epsilon. The continuity of P and the non-decreasing nature of B(θ~(β))B(\tilde\theta(\beta)) in β\beta (Lemma 2(ii)), composed with the monotone relationship between B and P via the bonus structure, give P(β)ϵP(\beta) \ge \epsilon for β\beta in a right-neighborhood of βϵ\beta_{\epsilon}. ∎

The mode operates on the timescale of TmixT_{\mathrm{mix}}. The intervention itself — a policy change, a regulatory imposition, a structural constraint on π\pi or R^\hat{R} — may be instantaneous in clock-time, but the cohort's empirical distribution converges to the new μπ\mu*_{\pi'} only over the mixing time of the modified loop. This is the framework's most consequential quantitative claim about architectural intervention: evaluations conducted at clock-time substantially less than Tmix(π)T_{\mathrm{mix}}(\pi') will systematically underestimate the intervention's structural effect, and the pattern in the policy literature of declaring failure on the basis of one-quarter or two-quarter observation windows is a temporal misalignment masquerading as an empirical finding.

The architectural mode is also the mode the platform's incentive structure most strongly resists. The bonus b(τm)b(\tau_{m}) and the engagement reward R^\hat{R} are antagonistic in expectation: maximizing B requires τm>τ\tau_{m} > \tau*, which costs (λmax1/τ)(\lambda_{\max} - 1/\tau*) in foregone engagement-event rate. Any non-trivial β\beta is, in expectation, a foregone gradient step on engagement. Voluntary adoption is therefore selected against under engagement competition; the mode under voluntary conditions exists only as marginal experimentation in segments insulated from the dominant incentive. The substantive content of Mode A is therefore the politics of platform regulation — the question of who imposes β\beta, under what authority, with what jurisdictional reach, and against what counter-organized capital. The framework does not adjudicate this politics; it specifies its formal stakes.

5.2 Mode B (artisanal). Intervention at LmL_m and νu\nu_u.

The artisanal mode does not modify π\pi. It does not change Pr(Eτ)\mathrm{Pr}(E_{\tau}). The foreclosure lemma continues to hold under the artisanal mode, and I(τ)I(\tau) at long τ\tau remains undefined-on-trajectory. What the artisanal mode produces is a parallel structure: a user-side self-evaluative distribution νu\nu_{u} maintained against the platform's reward-induced distribution πR^\pi_{\hat{R}} through ongoing maintenance labor.

To state the proposition cleanly, both objects must live on the same sample space. Define πR^\pi_{\hat{R}} as the conditional distribution over the user's behavioral states utS,C,P,Ku_{t}^{S,C,P,K} that would maximize the platform's reward R^\hat{R} given current observations — the platform's implicit "ideal user" distribution. Define νu\nu_{u} as the user's self-evaluative distribution over the same space. Under the metric superego (§3.4), DKL(νuπR^)0D_{\mathrm{KL}}(\nu_{u} \| \pi_{\hat{R}}) \to 0. The artisanal intervention preserves a positive gap.

Proposition 2 (Mode B preserves a gap but does not break foreclosure).

Assume the user's internalization dynamics admit a representation:

ddtDKL(νu(t)  πR^)=h0(βu)+r(Lm)=:h(Lm,βu)\frac{d}{dt}\, D_{KL}(\nu_u(t)\ \|\ \pi_{\hat R}) = -h_0(\beta_u) + r(L_m) =: -h(L_m,\, \beta_u)

where (a) h0:[0,)[0,)h_0 : [0, \infty) \to [0, \infty) is continuous, strictly increasing in βu\beta_u (the platform pressure: a continuous, monotone increasing function of α\alpha, the policy’s cumulative confidence H(uy1:t)1H(u \mid y_{1:t})^{-1}, and the duration of exposure tt), with h0(0)=0h_0(0) = 0; (b) r:[0,)[0,)r : [0, \infty) \to [0, \infty) is continuous, strictly increasing in LmL_m, with r(0)=0r(0) = 0; (c) the contraction is reversible — for any βu\beta_u, the inverse function Lm:=r1(h0(βu))L_m^* := r^{-1}(h_0(\beta_u)) exists in (0,Lm,max](0, L_{m,\max}] where Lm,maxL_{m,\max} is the user’s metabolic budget.

Then for LmLm(βu)L_m \ge L_m^*(\beta_u):

(i) DKL(νu(t)πR^)γ>0D_{\mathrm{KL}}(\nu_u(t) \,\|\, \pi_{\hat{R}}) \ge \gamma > 0 is maintained over the user’s exposure horizon, with γ\gamma given by the initial gap DKL(νu(0)πR^)D_{\mathrm{KL}}(\nu_u(0) \,\|\, \pi_{\hat{R}}).

(ii) Lemma 1 continues to hold: Pr(Eτ)0\mathrm{Pr}(E_\tau) \to 0 for τ>θref\tau > \theta_{\mathrm{ref}}.

Remark on the structural assumption. The decomposition dDKL/dt=h0(βu)+r(Lm)dD_{\mathrm{KL}}/dt = -h_{0}(\beta_{u}) + r(L_{m}) is not axiomatic; it is the first-order linearization of the user's internalization dynamics around the operating point νuop\nu_{u}^{\mathrm{op}} where DKL(νuopπR^)=γD_{\mathrm{KL}}(\nu_{u}^{\mathrm{op}} \| \pi_{\hat{R}}) = \gamma. The derivation differs across the three named mechanisms but each produces the same linearized form. The free-energy case is worked here explicitly; the other two are sketched.

Free-energy minimization (worked derivation). Let Fu[νu;πR^]=DKL(νuπR^)H[νu]F_{u}[\nu_{u}; \pi_{\hat{R}}] = D_{\mathrm{KL}}(\nu_{u} \| \pi_{\hat{R}}) - H[\nu_{u}] be the user's variational free energy, where H[νu]H[\nu_{u}] is the entropy of the self-evaluative distribution. Under the active-inference update rule (Friston 2010), νu\nu_{u} descends FuF_{u} at rate βu\beta_{u}:

dνudt = βuνFu = βuνDKL(νuπR^) + βuνH[νu].\frac{d\nu_u}{dt}\ =\ -\beta_u\, \nabla_{\nu} F_u\ =\ -\beta_u\, \nabla_{\nu} D_{KL}(\nu_u\,\|\, \pi_{\hat R})\ +\ \beta_u\, \nabla_{\nu} H[\nu_u].

The maintenance labor adds a counter-gradient term Lme^resistL_{m}\cdot \hat{e}_{\mathrm{resist}}, where e^resist\hat{e}_{\mathrm{resist}} is a unit vector in the direction the user actively maintains νu\nu_{u} distinct from πR^\pi_{\hat{R}} — formally, e^resist=νDKL/νDKL\hat{e}_{\mathrm{resist}} = \nabla_{\nu}D_{\mathrm{KL}}/\|\nabla_{\nu}D_{\mathrm{KL}}\| (the direction of maximum DKLD_{\mathrm{KL}} increase). Then:

dνudt = βuνDKL + βuνH[νu] + LmνDKLνDKL.\frac{d\nu_u}{dt}\ =\ -\beta_u\, \nabla_{\nu} D_{KL}\ +\ \beta_u\, \nabla_{\nu} H[\nu_u]\ +\ L_m\, \frac{\nabla_{\nu} D_{KL}}{\|\nabla_{\nu} D_{KL}\|}.

Differentiating DKLD_{\mathrm{KL}} along this flow:

ddtDKL = νDKLdνudt = βuνDKL2 + βuνDKLνH[νu] + LmνDKL.\frac{d}{dt} D_{KL}\ =\ \nabla_{\nu} D_{KL} \cdot \frac{d\nu_u}{dt}\ =\ -\beta_u\, \|\nabla_{\nu} D_{KL}\|^2\ +\ \beta_u\, \nabla_{\nu} D_{KL} \cdot \nabla_{\nu} H[\nu_u]\ +\ L_m\, \|\nabla_{\nu} D_{KL}\|.

At the operating point νuop\nu_{u}^{\mathrm{op}}, denote νDKLop=:cop\|\nabla_{\nu}D_{\mathrm{KL}}\|_{\mathrm{op}} =: c_{\mathrm{op}} (a constant in the linearization). The entropy gradient νH\nabla_{\nu}H is bounded and approximately orthogonal to νDKL\nabla_{\nu}D_{\mathrm{KL}} generically (entropy is symmetric under permutations of the support, DKLD_{\mathrm{KL}} is not), so the middle term is o(cop2)o(c_{\mathrm{op}}^{2}). Linearizing:

ddtDKL  βucop2 + Lmcop =: h0(βu) + r(Lm),\frac{d}{dt} D_{KL}\ \approx\ -\beta_u\, c_{op}^2\ +\ L_m\, c_{op}\ =:\ -h_0(\beta_u)\ +\ r(L_m),

with h0(βu):=βucop2h_{0}(\beta_{u}) := \beta_{u}\cdot c_{\mathrm{op}}^{2} and r(Lm):=Lmcopr(L_{m}) := L_{m}\cdot c_{\mathrm{op}}. Both linear in their respective arguments; the multiplicative separability is exact in the linearization. The validity of the linearization depends on νu\nu_{u} remaining in a neighborhood of νuop\nu_{u}^{\mathrm{op}} where copc_{\mathrm{op}} is approximately constant — which is precisely what Proposition 2(i) establishes for LmLmL_{m} \ge L_{m}*.

Operant reinforcement (sketch). Under a Rescorla-Wagner update νu(t+dt)=νu(t)+αRW(πR^νu(t))dt\nu_{u}(t+dt) = \nu_{u}(t) + \alpha_{RW}\cdot(\pi_{\hat{R}} - \nu_{u}(t))\cdot dt, with αRW\alpha_{RW} the learning rate (proportional to the platform's reinforcement schedule density), dDKL/dt=αRW(linearized correction term)dD_{\mathrm{KL}}/dt = -\alpha_{RW}\cdot\text{(linearized correction term)}. Identifying βu\beta_{u} with αRW\alpha_{RW} gives h0(βu)=βucop(RW)h_{0}(\beta_{u}) = \beta_{u}\cdot c_{\mathrm{op}}^{(RW)} with a different constant cop(RW)c_{\mathrm{op}}^{(RW)} depending on the reward prediction-error variance. The maintenance term has the same structural form r(Lm)Lmr(L_{m}) ∝ L_{m} under bounded counter-conditioning capacity.

Social comparison (sketch). Under Bayesian belief revision with platform-surfaced peer evidence at rate λsurf\lambda_{surf} (proportional to platform pressure), νu\nu_{u} shifts toward the surfaced peer distribution at rate βuλsurfcredibility\beta_{u} ∝ \lambda_{surf}\cdot \mathrm{credibility}. DKLD_{\mathrm{KL}} contracts at rate concave in βu\beta_{u} (Bayesian information bound). r(Lm)r(L_{m}) is the rate of extra-platform peer-group maintenance; concave in LmL_{m} under diminishing returns.

In all three cases, the operating-point linearization produces the parametric form assumed in Proposition 2, with r and h0h_{0} monotone and Lm:=r1(h0(βu))L_{m}* := r^{-1}(h_{0}(\beta_{u})) well-defined. The framework's commitment to mechanism-agnosticism is therefore not an absence of mechanism but the explicit identification of a structural decomposition consistent with all three named mechanisms in their operating-point linearization. The proposition's conclusion holds under any mechanism with this structure; the explicit free-energy derivation is provided as the cleanest worked example.

Proof.

(i) By the assumed parametric structure, dDKL/dt=h(Lm,βu)dD_{\mathrm{KL}}/dt = -h(L_{m}, \beta_{u}). For Lm=Lm(βu):=r1(h0(βu))L_{m} = L_{m}*(\beta_{u}) := r^{-1}(h_{0}(\beta_{u})), we have r(Lm)=h0(βu)r(L_{m}*) = h_{0}(\beta_{u}) and h(Lm,βu)=0h(L_{m}*, \beta_{u}) = 0 — the contraction is exactly balanced by the resistance. For Lm>LmL_{m} > L_{m}*, r(Lm)>h0(βu)r(L_{m}) > h_{0}(\beta_{u}) and h(Lm,βu)<0h(L_{m}, \beta_{u}) < 0 — the resistance dominates and DKLD_{\mathrm{KL}} is non-decreasing. Hence DKL(νu(t)πR^)DKL(νu(0)πR^)=:γ>0D_{\mathrm{KL}}(\nu_{u}(t) \| \pi_{\hat{R}}) \ge D_{\mathrm{KL}}(\nu_{u}(0) \| \pi_{\hat{R}}) =: \gamma > 0 over the exposure horizon, provided LmLm(βu(t))L_{m} \ge L_{m}*(\beta_{u}(t)) for all t (the threshold may rise with platform pressure, requiring LmL_{m} to scale; this is the metabolically bounded qualification below).

(ii) The closed-loop dynamics (1)–(4) do not depend on νu\nu_{u}: they depend on the platform's policy π\pi, which evolves by stochastic gradient ascent on R^\hat{R} given y1:ty_{1:t}. The user's νu\nu_{u} may affect the user's behavioral output, which affects yty_{t}, which affects the policy's update. But under (A1)–(A6), the policy still converges to a θ\theta* at which λˉ(θ)=λmax\bar{\lambda}(\theta*) = \lambda_{\max}, and the foreclosure argument in Lemma 1 goes through unchanged. Indeed, if the user under artisanal resistance is less engagement-prone (the behavioral yty_{t} shows less response to delivered stimuli), the policy may intensify — fire more frequently, target more aggressively — to elicit the engagement that satisfies (A4). This strengthens the foreclosure rather than weakening it: the user's νu\nu_{u} may be preserved, but the platform's Pr(Eτ)\mathrm{Pr}(E_{\tau}) is driven harder toward zero. ∎

The mode is bounded in three structurally distinct ways, each of which has political consequences the prose discourse around individual practice does not typically acknowledge.

Cohort-bounded. The maintenance labor's reference point — the νu\nu_{u} distinct from πR^\pi_{\hat{R}} that the user is trying to preserve — exists only if it was constructed before or partly outside the closure. For a user whose νu\nu_{u} developed entirely under closed-loop conditions, DKL(νuπR^)D_{\mathrm{KL}}(\nu_{u} \| \pi_{\hat{R}}) is approximately zero at the outset; the artisanal target is not well-defined. This is the formal substrate of the cohort gradient (§6.1).

Metabolically bounded. LmL_{m} is bounded above by the user's available energetic and attentional budget. The framework's claim is that this budget is structurally constrained in late capitalism by conditions exogenous to the loop but co-temporal with it: precarity, the maintenance labor of social reproduction, the absent or contracting welfare state, the second and third shifts. This is an empirical-political claim, not a structural derivation from (1)–(5); the framework asserts its consistency with the literature on time-poverty and social reproduction but does not prove it.

Non-generalizable. The mode produces a parallel structure for one user. The cohort's stationary distribution μπ\mu*_{\pi} is determined by the policy, and the population-level analytics on which platform optimization runs are insensitive to the artisanal practice of any individual member. Mode B is therefore real for the user who practices it and approximately invisible at the population level. The political consequence is that critique pitched at the artisanal level — the discourse of self-care, digital minimalism, intentional consumption, attention reclamation as an individual project — addresses something real about individual practice and structurally nothing about the form of the loop.

5.3 Mode C (disruption-as-form). Intervention at ηt\eta_t.

The third mode operates on neither the policy nor the user's internal dynamics, but on the interface between them — the observation channel through which the engagement signal yty_{t} is produced. The dividual condition is the formal claim that H(uty1:t)0H(u_{t} | y_{1:t}) \to 0: under prolonged exposure, the platform's posterior over the user's full state converges in entropy to a point mass. The disruption-as-form mode prevents this contraction by injecting structural variance into ηt\eta_{t}.

Proposition 3 (Mode C bounds surplus but does not break foreclosure).

Suppose the user’s behavioral practice (deliberate inauthenticity, obfuscation, modulation of engagement timing, refusal of legibility) results in an inflated observation-noise variance Var(ηt)ση2>ση,02\mathrm{Var}(\eta_t) \ge \sigma^2_\eta > \sigma^2_{\eta,0}, where ση,02\sigma^2_{\eta,0} is the baseline observation noise. Then under (A1)–(A6):

(i) The platform’s residual entropy satisfies H(uty1:t)Hres(ση2)>0H(u_t \mid y_{1:t}) \ge H_{\mathrm{res}}(\sigma^2_\eta) > 0 for all tt, with Hres()H_{\mathrm{res}}(\cdot) strictly increasing.

(ii) The achievable expected reward satisfies E[R^;θ(ση2)]<E[R^;θ(ση,02)]\mathbb{E}[\hat{R}; \theta^*(\sigma^2_\eta)] < \mathbb{E}[\hat{R}; \theta^*(\sigma^2_{\eta,0})], where θ(ση2)\theta^*(\sigma^2_\eta) is the policy limit under noise level ση2\sigma^2_\eta. Consequently, Σ(T)\Sigma(T) under Mode C is strictly bounded below Σ(T)\Sigma(T) under contracted observation.

(iii) Lemma 1 continues to hold: Pr(Eτ)0\mathrm{Pr}(E_\tau) \to 0 for τ>θref\tau > \theta_{\mathrm{ref}}.

Proof.

(i) By Bayes' rule, the platform's posterior p(uty1:t)p(u_{t} | y_{1:t}) is computed under likelihood p(ytut)φ(ytg(ut);ση2)p(y_{t} | u_{t}) ∝ \varphi(y_{t} - g(u_{t}); \sigma^{2}_{\eta}), where φ\varphi is the noise density. As ση2\sigma^{2}_{\eta} increases, the likelihood becomes less informative and the posterior contracts less. By the standard information-theoretic relation between posterior entropy and observation noise:

H(ut  y1:t)H(ut)I(ut;y1:t)H(ut)I(ut;g(ut)+ηt)H(u_t\ |\ y_{1:t}) \ge H(u_t) - I(u_t;\, y_{1:t}) \ge H(u_t) - I(u_t;\, g(u_t) + \eta_t)

The right-hand side is decreasing in ση2\sigma^{2}_{\eta} (mutual information drops as noise rises), and is bounded below by a strictly positive constant for any finite ση2\sigma^{2}_{\eta}.

(ii) By definition yt=g(ut)+ηty_{t} = g(u_{t}) + \eta_{t}, so the engagement signal is a noisy observation of g(ut)g(u_{t}). The mutual information I(ut;yt)I(u_{t}; y_{t}) is strictly decreasing in ση2\sigma^{2}_{\eta} (Cover & Thomas, Elements of Information Theory, Ch. 9). By a Blackwell-style sufficiency argument: if the observation channel at noise ση2\sigma^{2}_{\eta} is a garbling of the channel at noise ση,02<ση2\sigma^{2}_{\eta,0} < \sigma^{2}_{\eta} (which it is, since additive Gaussian noise of higher variance is informationally weaker), then the optimal expected reward under the higher-noise channel is no greater than under the lower-noise channel — formally, supπE[R^;π,ση2]supπE[R^;π,ση,02]sup_{\pi} E[\hat{R}; \pi, \sigma^{2}_{\eta}] \le sup_{\pi} E[\hat{R}; \pi, \sigma^{2}_{\eta,0}] — with strict inequality whenever R^\hat{R} depends non-trivially on utu_{t} (which it does by (A2): R^\hat{R} is engagement-driven and engagement is utu_{t}-targeted). Hence E[R^;θE[\hat{R}; \theta*(ση2)]<E[R^;θ\sigma^{2}_{\eta})] < E[\hat{R}; \theta*(ση,02)]\sigma^{2}_{\eta,0})], and Σ(T)\Sigma(T) is correspondingly bounded.

(iii) The foreclosure argument in Lemma 1 depends on the policy gradient-ascending on R^\hat{R} and on (A2)–(A5). None of these is changed by an increase in ση2\sigma^{2}_{\eta}: the policy still updates, still seeks to maximize E[R^]E[\hat{R}], still has access to a sufficiently expressive policy class, and still converges. The maximum achievable rate may be lower under disrupted observation (the policy targets less effectively), but the policy still drives toward whatever maximum is achievable, and the inter-stimulus interval distribution still concentrates near the minimum technical latency δmin\delta_{\min}. Hence Pr(Eτ)0\mathrm{Pr}(E_{\tau}) \to 0 for τ>θref\tau > \theta_{\mathrm{ref}}. ∎

A fourth claim sometimes attributed to Mode C — that the cohort's mixing time TmixT_{\mathrm{mix}} strictly increases under disrupted observation — is not established by this proposition and is left here as a conjecture (cf. §8). The argument that disrupted observation slows convergence depends on whether the policy's "pull" toward μπ\mu*_{\pi} is the dominant convergence driver (in which case weaker pull means slower mixing) or whether the diffusion of the effective dynamics under uncertain targeting speeds convergence to a more diffuse but reachable stationary distribution. The framework's intuition is the former, but the proof requires additional structure on f, g, and the policy class that the framework does not currently specify.

Disruption-as-form is the framework's honest answer to the question of what subjective political practice remains under the closure. It does not recover what has been foreclosed. It does not require the cohort-bounded conditions Mode B depends on. It does not require the external imposition Mode A requires. It operates at the level of the user's relation to the observation function, and the political form it takes is, properly, a form: camouflage, inauthenticity, refusal of legibility, the strategic noise that prevents the apparatus from collapsing the four-register coupling onto a single point.

The mode is also the most easily mistaken — for substantive politics, or for pure aestheticism. Practiced as an end in itself, without architectural pressure for which the disruption is a holding tactic, Mode C becomes a politics of refusal that is also a politics of impotence: structurally legible as opposition, structurally incapable of producing the change opposition is supposed to produce. The framework's claim is not that Mode C substitutes for Mode A. It is that Mode C is what subjective practice can do while Mode A is the substantive political project. Without that pairing, the disruption is decoration.

5.4 Asymmetry, timescale alignment, and the intervention window.

The three modes are not interchangeable, and their non-interchangeability is formal rather than rhetorical. Propositions 1, 2, and 3 together establish the structural asymmetry:

  • Proposition 1 (Mode A) breaks Lemma 1.
  • Proposition 2 (Mode B) preserves a positive DKLD_{\mathrm{KL}} between νu\nu_{u} and πR^\pi_{\hat{R}} but does not break Lemma 1.
  • Proposition 3 (Mode C) bounds Σ(T)\Sigma(T) below E[R^]E[\hat{R}]'s supremum and preserves Hres>0H_{\mathrm{res}} > 0 but does not break Lemma 1.

The framework does not weight the modes ethically — Mode B is not a lesser politics than Mode A, nor Mode C than either — but it specifies that they address structurally different problems. Mode A is the only mode that addresses what the theorem forecloses. The others address what remains structurally possible while the foreclosure stands. Mistaking the latter for the former is the dominant failure mode of the literature.

Each mode operates on a distinct timescale, and each timescale corresponds to a specific point in the apparatus:

  • Mode A operates on Tmix(π)T_{\mathrm{mix}}(\pi'), the inverse of the spectral gap of the transition operator under the modified policy. For typical engagement-monetized π\pi, TmixT_{\mathrm{mix}} is on the order of months to years.
  • Mode B operates on the user's LmL_{m} metabolic cycle, bounded above by the available energetic budget and below by the marginal effectiveness of the maintenance labor. The effective timescale is daily to weekly.
  • Mode C operates per-event, with the structural effect (the bounding of Σ(T)\Sigma(T)) compounding through prolonged practice; the relevant aggregate timescale is the user's exposure horizon.

The intervention window is the framework's claim that interventions imposed at one timescale cannot be evaluated, or substituted for, by interventions imposed at another. The dominant pattern of contemporary critique is precisely this confusion: addressing the architectural problem at the artisanal level (the self-care literature treating individual practice as the answer to structural form); addressing the artisanal problem at the architectural level (techno-utopian platform reform that does not engage the user-side dynamics of cohort displacement); addressing both through pure Mode C (the empty form of refusal that produces aesthetic dissonance and no structural change).

The intervention window is also non-fungible. There is no rate at which Mode B intervention substitutes for Mode A; there is no exchange ratio between LmL_{m} and TmixT_{\mathrm{mix}}. The substitutability assumption — that enough individual practice eventually adds up to structural change, that enough architectural intervention eventually substitutes for individual effort — is the political form of the timescale confusion. The framework rejects it formally: the three timescales correspond to three formally distinct quantities in the apparatus, and they do not exchange.

What the framework does not adjudicate is the politics between the modes. Mode A is the substantive political project descending from the theorem; this is a structural claim. Whether it should be pursued, by whom, under what political form, against what counter-organized capital — these are questions the framework specifies the stakes of without answering. The available subjective practice while Mode A is pursued is Mode C. The available individual practice, cohort-bounded as it is, is Mode B. The framework's contribution is to make the formal differences explicit and the timescale confusions costly to maintain.


§6. Other named results

6.1 Cohort gradient (developmental moderation of I(τ)). Let u0(i)u_{0}^{(i)} denote the user-state at the time of first exposure to the closed-loop regime for user i. Let aia_{i} denote the developmental stage at first exposure. Under the framework, I(τ)I(\tau) at long lags is moderated by aia_{i}: users exposed at earlier developmental stages have lower realized I(τ)I(\tau) even after controlling for total exposure duration. The mechanism is that νu\nu_{u} (the user's self-evaluative distribution; see §3.4 and §5.2) develops under the closure, and the cohort that constructs νu\nu_{u} under the closure has no exogenous baseline to which it can return — Proposition 2's resistance threshold LmL_{m}* is therefore inaccessible because γ0\gamma \approx 0 at the outset. This is the framework's most direct empirical prediction. It is falsifiable: under sufficient data on attentional autocorrelation across age-of-first-exposure cohorts, the prediction either holds or it does not. The predicted curve and the falsification threshold are rendered at /the-dynamics/cohort-gradient.

6.2 Mixing time (TmixT_{\mathrm{mix}}). Let PπP_{\pi} be the transition operator induced on the user-state by the closed loop under policy π\pi. The mixing time Tmix(π)T_{\mathrm{mix}}(\pi) is the time required for the empirical distribution of ut(i)u_{t}^{(i)} across the cohort to converge to within ε (in total variation) of μπ\mu*_{\pi}. Under (1)–(4), Tmix(π)T_{\mathrm{mix}}(\pi) is finite and bounded by Tmix(π)(1/γπ)log(1/ϵ)T_{\mathrm{mix}}(\pi) \le (1/\gamma_{\pi}) \cdot \mathrm{log}(1/\epsilon), where γπ\gamma_{\pi} is the spectral gap of PπP_{\pi}. The framework's claim is that Tmix(π)T_{\mathrm{mix}}(\pi) is non-trivial — on the order of months to years rather than days — for engagement-monetized policies. Mode A intervention replaces π\pi with π\pi'; the new mixing time Tmix(π)T_{\mathrm{mix}}(\pi') governs the timescale over which the modified cohort distribution converges to μπ\mu*_{\pi'}. Mode B intervention does not modify π\pi and so does not modify TmixT_{\mathrm{mix}}; its effect is on individual νu\nu_{u} trajectories within the unchanged cohort dynamics. Mode C intervention modifies the effective observation channel but its effect on TmixT_{\mathrm{mix}} is direction-uncertain (Open Question 8.1).

6.3 Surplus extraction (per-register decomposition). From (5), the per-register surplus Σi(T)\Sigma^{i}(T) is observable through the marked point process NtiN_{t}^{i} and the kernel ρi\rho_{i}. The framework's claim is that under any π\pi converged on R^\hat{R}, the joint distribution of Σi(T)\Sigma^{i}(T) across i approaches a specific operating point: not equal extraction across registers, but extraction proportional to the marginal yield of each register conditional on the cross-excitation matrix αij\alpha_{ij}. This is observable in the platform's content metadata; it is also a structural prediction.

6.4 Chrono-debt as a survival-functional comparison. From §3.2, D(t,τ)=Λ(τ)Λ(t,τ)D(t, \tau) = \Lambda_{◦}(\tau) - \Lambda(t, \tau). The framework's empirical claim is that Λ(t,τ)\Lambda(t, \tau) under the closed loop converges to a distribution with vanishing mass at long τ\tau, while Λ(τ)\Lambda_{◦}(\tau) retains substantial mass at long τ\tau. The spectrum D(t,τ)D(t, \tau) therefore has a characteristic shape — small at very short τ\tau (where even the closed loop has inter-stimulus intervals), large at intermediate τ\tau (where the policy has driven foreclosure most aggressively), and asymptotic at very long τ\tau (where neither regime ever had measure). The full spectrum is the framework's signature instrument for chronopolitical analysis.


§7. What the model does not claim

The model does not claim that the closed loop is everywhere realized. It claims that where the loop is realized — on the major commercial platforms whose policies satisfy (3) and (4) — the foreclosure lemma (Lemma 1) and the reclamation theorem (Theorem 1) hold structurally. There are domains and devices that do not realize the closure: long-form reading, certain forms of conversation, durational artistic practice, sleep. The framework does not say these are inaccessible; it says they are not the architectural ground on which the contemporary user-state evolves.

The model does not claim that intervention is impossible. The formal treatment of intervention — the three modes (architectural, artisanal, disruption-as-form), their structural asymmetry, and their non-fungible timescales — is given in §5. What the model claims is that intervention is structured: that the three modes address formally distinct points in the apparatus, that only the architectural mode actually breaks the reclamation theorem, that the other two modes produce real and significant effects without recovering what the theorem forecloses, and that the political work which conflates the modes or substitutes one for another does not produce the effect it claims to produce. The framework does not adjudicate which mode should be pursued. It specifies the formal stakes of each.

The model does not claim mechanism. The metric superego, in particular, is mechanism-agnostic by design. Where the formalism is silent on whether convergence is realized by free-energy minimization, by operant conditioning, or by social comparison, the silence is a feature: the framework's prediction is the convergence, not the route. Empirical work that distinguishes the routes is downstream of the framework, not constitutive of it.

The model does not claim falsification-immunity. The reclamation theorem rests on Lemma 1, which rests on (1)–(5) under Assumptions (A1)–(A6). A platform that adopted explicit no-input regularization — augmenting R^\hat{R} with a positive bonus βB\beta\cdot B on inter-stimulus intervals exceeding some τ\tau*, in the sense of Proposition 1, with β\beta large enough to produce Pr(τ(k)>τ)ϵ\mathrm{Pr}(\tau^{(k)} > \tau*) \ge \epsilon for some non-trivial ϵ\epsilon — would falsify the foreclosure lemma by example. The framework's prediction is that no platform under engagement-monetization will do so voluntarily, since voluntary adoption violates (A2); that prediction is itself falsifiable.

The model does not claim ethical conclusions. The closed-loop economy is a description, not a verdict. The verdict that follows from the description — that the contemporary user is not a subject in the sense the historical apparatus presupposed, that political intervention pitched at the historical subject is temporally misaligned, that the body has become the differential of a process whose integral is extracted as surplus — is the work of the prose. The mathematics gives the work its structural form. It does not give it its claim on the reader.


§8. Open formal questions

The framework's central architecture — Lemma 1, Theorem 1, and Propositions 1–3 — is now fully proved under the explicit assumptions (A1)–(A6). What remains genuinely open are three questions of different kinds: a direction-of-effect question (comparative statics under Mode C), a structural-mechanism question (the developmental moderator predicted in §6.1), and a constrained-optimization question (the explicit operating-point characterization in §6.3). They are listed here in descending order of how much weight they carry in the framework's argumentative structure.

8.1 The mixing-time direction under Mode C. Proposition 3 establishes that disrupted observation bounds the platform's expected reward, but does not establish the direction of effect on TmixT_{\mathrm{mix}}. The framework's intuition is that disrupted observation slows convergence (the policy's targeting weakens, hence the pull toward μπ\mu*_{\pi} weakens, hence the spectral gap shrinks). The competing intuition is that disrupted observation diffuses the effective dynamics to a more reachable but more entropic stationary distribution, which mixes faster. Resolving the direction requires additional structure on the response dynamics f and the policy class that the framework does not currently specify. The framework's prose treatment in §5.3 has been amended to flag this rather than assert the unproved direction. This is the most consequential open question, because Mode C's strategic role in the political-action ledger depends partly on whether it slows the cohort's capture or merely modifies its endpoint.

8.2 The cohort gradient mechanism. The cohort gradient (§6.1) — that I(τ)I(\tau) at long lags is moderated by aia_{i}, the developmental stage at first exposure — is asserted as a structural prediction but the formal mechanism connecting (1)–(5) to a developmentally-modulated I(τ)I(\tau) is not derived. The argument is intuitive: pre-closure users have a νu\nu_{u} distinct from πR^\pi_{\hat{R}} at first exposure; post-closure users do not; Proposition 2's LmL_{m}* is therefore inaccessible (since γ0\gamma \approx 0 at the outset). A formal derivation would specify the developmental learning dynamics of νu\nu_{u} — how νu(0)\nu_{u}(0) depends on aia_{i} — and the contractive properties of f under closed-loop sts_{t} relative to pre-closure conditions. The framework presents the cohort gradient as falsifiable in its empirical form; the structural derivation is a research target.

8.3 The surplus-extraction operating point. §6.3 asserts that Σi(T)\Sigma^{i}(T) approaches a specific operating point — extraction proportional to the marginal yield of each register conditional on the cross-excitation matrix αij\alpha_{ij}. The argument is intuitive (under any π\pi converged on R^\hat{R}, the policy distributes extraction to equate marginal yields) but the explicit derivation requires modeling the constraints on extraction (retention, saturation, attrition) and applying a Lagrangian or KKT argument over the policy's action space. The framework treats this as a downstream technical exercise; the qualitative claim that distribution is unequal across registers and structured by αij\alpha_{ij} is what the framework argumentatively requires.

Resolved in this revision.

The following questions, flagged in earlier drafts, have been addressed by the present revision:

The strict vs. operational form of Lemma 1. Now addressed by the Remark following Lemma 1, which makes explicit the operational form (effectively zero on the timescales of interest, e103e^{-10^{3}} to e105e^{-10^{5}} under realistic θref/δmin\theta_{\mathrm{ref}}/\delta_{\min} ratios) and notes the additional conditions required for the strict form. The framework's argument is unaffected by the distinction.

The parametric-continuity argument in Proposition 1. Now formalized as Lemma 2, with explicit appeal to the implicit function theorem under the non-degeneracy assumption (A6) and the standard envelope-theorem properties. The sign error in the original regularization formulation (penalty for long pauses, which was the wrong direction) has been corrected to a bonus for long pauses.

The measure-theoretic conditioning subtlety in Theorem 1, Step 2. Now explicit, with reference to Kallenberg's treatment of disintegration and a clean operational statement: any sample-based estimator I^(τ)\hat{I}(\tau) has variance scaling as 1/(NobsPr(Eτ))1/(N_{obs}\cdot \mathrm{Pr}(E_{\tau})) and diverges as Pr(Eτ)0\mathrm{Pr}(E_{\tau}) \to 0, regardless of the chosen disintegration. The operational meaning of "undefined-on-trajectory" is therefore preserved.

The parametric premise of Proposition 2. The decomposition dDKL/dt=h0(βu)+r(Lm)dD_{\mathrm{KL}}/dt = -h_{0}(\beta_{u}) + r(L_{m}) is now derived explicitly as the first-order linearization around the operating point νuop\nu_{u}^{\mathrm{op}}. The free-energy minimization case is worked in full — under the variational free-energy gradient with maintenance counter-gradient Lme^resistL_{m}\cdot \hat{e}_{\mathrm{resist}}, the linearized dynamics give h0(βu)=βucop2h_{0}(\beta_{u}) = \beta_{u}\cdot c_{\mathrm{op}}^{2} and r(Lm)=Lmcopr(L_{m}) = L_{m}\cdot c_{\mathrm{op}} with cop:=νDKLopc_{\mathrm{op}} := \|\nabla_{\nu}D_{\mathrm{KL}}\|_{\mathrm{op}}. The operant reinforcement and social comparison cases produce the same linearized form with different constants. The framework's commitment to mechanism-agnosticism is now formalized: not as an absence of mechanism, but as the explicit identification of a structural decomposition derived (in linearization) from each of the named mechanisms.

The bifurcation/basin-connectedness handwave in Lemma 2(iii). Replaced by a clean strict-growth argument: provided B(θ)0\nabla B(\theta*) \ne 0 (which holds generically since B and F have distinct extremizers — the F-maximizer fires at λmax\lambda_{\max}, the B-maximizer at 1/τ1/\tau*), the path βθ~(β)\beta ↦ \tilde\theta(\beta) produces strictly positive B(θ~(β))>0B(\tilde\theta(\beta)) > 0 for sufficiently small β>0\beta > 0. This implies ϵmax:=supβP(β)>0\epsilon_{\max} := \sup_{\beta} P(\beta) > 0, which is the substantive content the framework requires. The full claim P(β)1 as βP(\beta) \to 1 \text{ as } \beta \to \infty requires the additional generic regularity (no codimension-one bifurcations along the path), but Proposition 1 needs only ϵmax>0\epsilon_{\max} > 0 — which holds without the bifurcation regularity.

The metabolically-bounded premise of Mode B. The empirical-political claim about late-capitalist constraints on Lm,maxL_{m,max} is now explicitly marked as external to the formal model — consistent with but not derived from (1)–(5). The framework asserts its consistency with the literature on time-poverty and social reproduction without claiming to prove it.

These three remaining open questions (8.1–8.3) do not undermine the central argumentative structure. The reclamation theorem follows from Lemma 1 and the conditioning argument; Lemma 1 follows from (A1)–(A6); Propositions 1–3 follow from (A1)–(A6) and the parametric premises stated locally. What remains open is the direction of a comparative-statics claim under Mode C, the formal derivation of a developmental moderator, and the explicit operating-point characterization of the surplus distribution. The framework's structural conclusions — that reclamation is foreclosed, that Mode A is the only mode that breaks the foreclosure, that the modes do not exchange — are not in dispute.


A. Selimović