Paper

A Variational Lens on RL in Diffusion Models.

DiPOD: Diffusion Policy Optimization without Drifting Apart

Diffusion language models are an exciting alternative to autoregressive language models. Instead of generating a response strictly left-to-right, they can refine many tokens in parallel. This opens the door to faster sampling, flexible …

Can Transformers Do Everything, and Undo It Too?

Large Language Models are Surjective? Injective? Invertible?

Recently, there have been discussions on functional properties of Transformers, the basic building block of Large Language Models (LLM) and many other generative models. My paper ([1]) proves that Transformers can output anything given an …