AI Alignment Research
Exploring the frontiers of interpretability through QFT-based semantic indexing, storyworlds as sparse autoencoders, and games designed to fund AI safety research.
Operating quantum computing systems commercially for semantic indexing applications with bifocal QFT architectures.
Building quantum-enhanced retrieval systems for large-scale semantic indexing with multiple codec types, augment RoPE systems.
Developing bifocal QFT architectures that leverage quantum fourier transforms via semantic codecs to analyze ultra-long context.
Related tools: QuantumBlot,
A storyworld is a formally structured interactive environment: a set of characters, variables, encounters presenting multiple options, formulas for unlocking options and determining the reaction to them, and subsequent effects on variables and encounter transitions, which constrain how narratives can unfold. Unlike linear stories or unstructured text, storyworlds explicitly encode goals, affordances, conflicts, and consequences.
When used for training and as a tool of inter-AI communication, these environments function as compressed, human-legible representations of complex systems. Agents must reason about state, intent, causality, and trade-offs within a bounded semantic space—precisely the dimensions we want to make visible inside learned models. Skills and a Verifiers env have been developed to assist in AI generation and iteration of storyworlds based on user prompts or ranges of historical/literary adaptation, as well as use in the context of diplomacy strategy games. An experiment harness called Storyforge facilitates the study of this output using MLFlow.
This research framework treats storyworlds as a form of narrative sparse autoencoder: structured inputs that encourage models to learn disentangled, interpretable internal features. By aligning model training with how humans naturally reason about agents and worlds, we aim to bridge the gap between mechanistic interpretability and human-understandable concepts.
Related tools: Storyforge, SweepWeave Storyworld Envs
A prototype demonstrating storyworlds-as-interpretability research. Navigate emergent ASI coordination through graph-based strategic gameplay.
Games designed to fund AI alignment research. Interactive narratives that generate revenue while advancing safety-focused AI development. A Netflix-like site for Storyworlds with AI Authoring Tools (In development)
Exploring governance models for artificial superintelligence through game-theoretic frameworks and interactive narrative systems. (Pending Funding)