Thought-to-Video with Stable Diffusion.

It uses fMRI data from the brain to recreate video images.

The process includes two stages:

  • Stage A uses Sparse: Coding Masked Brain Modeling (SC-MBM) on a large brain scan dataset.
  • Stage B applies a Double: Conditioned Latent Diffusion Model (DC-LDM) to generate images from brain recordings.

More detail:

