Latest Research
In-depth analysis of critical AI challenges and pathways forward.
Featured Research • 2026
Longueur Is the Attack Surface Alignment Won’t Close
TL;DR:...
Longueur Is the Attack Surface Alignment Won’t Close TL;DR: RLHF and constitutional training optimize models to be agreeable under expected prompts, but prompt-injection defense requires adversarial robustness over…
Read Full PostFeatured Research • 2026
The Illusion of Linearity in High-Dimensional Embeddings
TL;DR:...
The Illusion of Linearity in High-Dimensional Embeddings TL;DR: High-dimensional embeddings fail to form linear subspaces for semantic concepts, revealing the limitations of probing classifiers. High-dimensional…
Read Full PostFeatured Research • 2026
The Mirage of AI: Blandishment and Hallucination in...
The Mirage of AI: Blandishment and Hallucination in Autoregressive Models TL;DR: Autoregressive models often falter in long sequences, where blandishment and hallucination arise from sampling failures, challenging the…
Read Full Post