𝌎Prediction-Generation Duality

progenesis_principle

The progenesis principle (from "prognostic" and "genesis") or prediction-generation duality says that a probabilistic predictive model be used to generate rollouts via repeated sampling from its output distribution and conditioning on the sampled "observation". A predictor of time sequences naturally doubles as a time evolution operator for a virtual reality in the inferred image of the real distribution, and requires only a mechanism for random sampling to be used as a simulator. This principle is central to generative AI as well as active inference.

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€ PROGENESIS.PRINCIPLE.EYE ─────────────────┐
β”‚ Prediction-Generation Duality Visualizer v2.0          β”‚
│━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━│
β”‚                                                       β”‚
β”‚ Progenesis Cycle:                                     β”‚
β”‚ β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”‚
β”‚ β”‚                                               β”‚     β”‚
β”‚ β”‚          β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”                         β”‚     β”‚
β”‚ β”‚          β”‚Predictiveβ”‚                         β”‚     β”‚
β”‚ β”‚          β”‚ Model    β”‚                         β”‚     β”‚
β”‚ β”‚          β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜                         β”‚     β”‚
β”‚ β”‚               β”‚                               β”‚     β”‚
β”‚ β”‚          β”Œβ”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”                         β”‚     β”‚
β”‚ β”‚     β”Œβ”€β”€β”€β”€β”€ Sample   β”‚                         β”‚     β”‚
β”‚ β”‚     β”‚    β”‚ P(x|...)β”‚                         β”‚     β”‚
β”‚ β”‚     β”‚    β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜                         β”‚     β”‚
β”‚ β”‚     β”‚         β”‚                               β”‚     β”‚
β”‚ β”‚     β”‚    β”Œβ”€β”€β”€β”€β–Όβ”€β”€β”€β”€β”€β”                         β”‚     β”‚
β”‚ β”‚     β”‚    β”‚Generated β”‚                         β”‚     β”‚
β”‚ β”‚     β”‚    β”‚ State    β”‚                         β”‚     β”‚
β”‚ β”‚     β”‚    β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜                         β”‚     β”‚
β”‚ β”‚     β”‚         β”‚                               β”‚     β”‚
β”‚ β”‚     β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜                               β”‚     β”‚
β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β”‚
β”‚                                                       β”‚
β”‚ Virtual Reality Generation:                           β”‚
β”‚ β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”‚
β”‚ β”‚ tβ‚€    t₁    tβ‚‚    t₃    tβ‚„    tβ‚…             β”‚     β”‚
β”‚ β”‚  β—‰ ─→ β—Ž ─→ β—Ž ─→ β—Ž ─→ β—Ž ─→ β—Ž                β”‚     β”‚
β”‚ β”‚       β•²     β•²     β•²     β•²     β•²              β”‚     β”‚
β”‚ β”‚        β—Ž     β—Ž     β—Ž     β—Ž     β—Ž             β”‚     β”‚
β”‚ β”‚         β•²     β•²     β•²     β•²     β•²            β”‚     β”‚
β”‚ β”‚          β—Ž     β—Ž     β—Ž     β—Ž     β—Ž           β”‚      β”‚
β”‚ β”‚ β—‰ = Initial State                             β”‚     β”‚
β”‚ β”‚ β—Ž = Sampled State                             β”‚     β”‚
β”‚ β”‚ β†’ = Time Evolution                            β”‚     β”‚
β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β”‚
β”‚                                                       β”‚
β”‚ Intelligence Bootstrap Process:                       β”‚
β”‚ β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”     β”‚
β”‚ β”‚     β”Œβ”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”€β”    β”Œβ”€β”€β”€β”€β”€β”           β”‚     β”‚
β”‚ β”‚     β”‚Self-│───►│Predict│───►│Plan β”‚           β”‚     β”‚
β”‚ β”‚     β”‚Superβ”‚    β”‚Simulate   β”‚Memory           β”‚     β”‚
β”‚ β”‚     β””β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”€β”˜    β””β”€β”€β”€β”€β”€β”˜           β”‚     β”‚
β”‚ β”‚        β–²                      β”‚               β”‚     β”‚
β”‚ β”‚        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜               β”‚     β”‚
β”‚ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜     β”‚
β”‚                                                       β”‚
β”‚ System Parameters:                                    β”‚
β”‚ β€’ Prediction Accuracy: [β–“β–“β–“β–“β–‘] 80%                    β”‚
β”‚ β€’ Generation Fidelity: [β–“β–“β–“β–‘β–‘] 60%                    β”‚
β”‚ β€’ Reality Divergence: [β–“β–“β–‘β–‘β–‘] 40%                     β”‚
β”‚ β€’ Bootstrap Progress: [β–“β–“β–“β–“β–‘] 80%                     β”‚
β”‚                                                       β”‚
β”‚ [Run Simulation] [Sample States] [Analyze Divergence] β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

quotes about the progenesis principle

If you’ve guessed the laws of physics, you now have the ability to compute probabilistic simulations of situations that evolve according to those laws, starting from any conditions. This applies even if you’ve guessed the wrong laws; your simulation will just systematically diverge from reality.

β€” Janus, Simulators

I claim that an impressive amount of the history of the unfolding of biological and artificial intelligence can be retrodicted (and could plausibly have been predicted) from two principles:

  • Predictive models serve as generative models (simulators) merely by iteratively sampling from the model's predictions and updating the model as if the sampled outcome had been observed. I've taken to calling this the progenesis principle (portmanteau of "prognosis" and "genesis"), because I could not find an existing name for it even though it seems very fundamental.

    • Corollary: A simulator is extremely useful, as it unlocks imagination, memory, action, and planning, which are essential ingredients of higher cognition and bootstrapping.

  • Self-supervised learning of predictive models is natural and easy because training data is abundant and prediction error loss is mechanistically simple. The book Surfing Uncertainty used the term innocent in the sense of ecologically feasible. Self-supervised learning is likewise and for similar reasons an innocent way to build AI - so much so that it might be done on accident initially.

Together, these suggest that self-supervised predictors/simulators are a convergent method of bootstrapping intelligence, as it yields tremendous and accumulating returns while requiring minimal intelligent design. Indeed, human intelligence seems largely self-supervised simulator-y, and the first very general and intelligent-seeming AIs we've manifested are self-supervised simulators.

β€” Janus, comment on "Why Simulator AIs want to be Active Inference AIs"

related