The Latent Diffusion Model (LDM) is a diffusion model architecture developed by the CompVis (Computer Vision & Learning) group at LMU Munich. Introduced...
19 KB (2,178 words) - 00:05, 21 July 2025
diffusion models, also known as diffusion-based generative models or score-based generative models, are a class of latent variable generative models....
84 KB (14,123 words) - 17:53, 23 July 2025
organizations. Stable Diffusion is a latent diffusion model, a kind of deep generative artificial neural network. Its code and model weights have been released...
67 KB (6,228 words) - 20:51, 6 August 2025
Sora is a diffusion transformer – a denoising latent diffusion model with one Transformer as the denoiser. A video is generated in latent space by denoising...
15 KB (1,303 words) - 20:42, 2 August 2025
Text-to-image models are generally latent diffusion models, which combine a language model, which transforms the input text into a latent representation...
21 KB (1,949 words) - 15:01, 4 July 2025
Runway (company) (section Stable Diffusion)
the company co-released an improved version of their Latent Diffusion Model called Stable Diffusion together with the CompVis Group at Ludwig Maximilian...
17 KB (1,520 words) - 06:58, 1 August 2025
features, which can then be used as a module for other models, such as in a latent diffusion model. Tasks are often categorized as discriminative (recognition)...
31 KB (2,770 words) - 17:17, 16 July 2025
Analysis of the latent space geometry of diffusion models reveals a fractal structure of phase transitions in the latent space, characterized by abrupt changes...
11 KB (1,258 words) - 19:36, 23 July 2025
Retrieved 17 November 2024. "High-Resolution Image Synthesis with Latent Diffusion Models". Computer Vision & Learning Group. Archived from the original...
28 KB (2,279 words) - 20:11, 2 August 2025
Google DeepMind (redirect from Lyria (text-to-music model))
textual descriptions, images, or sketches. Built as an autoregressive latent diffusion model, Genie enables frame-by-frame interactivity without requiring labeled...
98 KB (9,516 words) - 10:17, 7 August 2025
than GANs in early 2021. Latent diffusion model was published in December 2021 and became the basis for the later Stable Diffusion (August 2022). In 2022...
121 KB (10,445 words) - 15:47, 7 August 2025
In natural language processing, latent Dirichlet allocation (LDA) is a generative statistical model that explains how a collection of text documents can...
45 KB (7,462 words) - 00:03, 24 July 2025
be added to the graphical model defining the mixture model. For example, in the common latent Dirichlet allocation topic model, the observations are sets...
58 KB (7,865 words) - 10:48, 7 August 2025
followed by the release of another paper detailing improvements to latent diffusion models for high-resolution image synthesis. Upon the peer review and acceptance...
16 KB (1,404 words) - 00:39, 25 April 2025
directly output continuous actions. This is achieved through the use of diffusion models or flow-matching networks that act as the action decoder. π0 exploited...
25 KB (2,839 words) - 03:31, 25 July 2025
latent tree analysis (HLTA) is an alternative to LDA, which models word co-occurrence using a tree of latent variables and the states of the latent variables...
23 KB (2,392 words) - 14:34, 12 July 2025
Project Latent diffusion model, in machine learning Latitude dependent mantle, a widespread layer of ice-rich material on Mars Liquid drop model of the...
1 KB (149 words) - 10:01, 27 June 2025
Two-alternative forced choice (redirect from Drift-diffusion model)
alternatives could be tracked separately. The drift-diffusion model (DDM) is a well defined model, that is proposed to implement an optimal decision policy...
24 KB (3,233 words) - 21:53, 19 August 2024
DeepSeek; text-to-image models such as Stable Diffusion, Midjourney, and DALL-E; and text-to-video models such as Veo, LTXV and Sora. Technology companies...
155 KB (13,956 words) - 04:40, 6 August 2025
Computer-generated imagery (redirect from Computer-generated anatomical models)
Text-to-image models are generally latent diffusion models, which combine a language model, which transforms the input text into a latent representation...
33 KB (4,115 words) - 19:48, 12 July 2025
Stability AI's well-known AI image generator, Stable Diffusion, originated from a project called Latent Diffusion, developed by researchers at Ludwig Maximilian...
17 KB (1,646 words) - 19:03, 26 July 2025
files. While these files were only several seconds long, the model could also use latent space between outputs to interpolate different files together...
8 KB (391 words) - 20:45, 2 August 2025
Variational autoencoder (category Graphical models)
low-dimensional latent space, it is called the encoder. The decoder is the second neural network of this model. It is a function that maps from the latent space...
27 KB (3,967 words) - 21:16, 2 August 2025
Process Latent Variable Model Locally Linear Embedding Relational Perspective Map DD-HDS homepage RankVisu homepage Short review of Diffusion Maps Nonlinear...
48 KB (6,119 words) - 04:01, 2 June 2025
(1 December 2022). "High-resolution image reconstruction with latent diffusion models from human brain activity": 2022.11.18.517004. doi:10.1101/2022...
40 KB (4,661 words) - 05:26, 2 June 2025
discriminative probabilistic latent variable models (DPLVM) are a type of CRFs for sequence tagging tasks. They are latent variable models that are trained discriminatively...
17 KB (2,065 words) - 18:45, 20 June 2025
Fingerprint (redirect from Latent fingerprint)
called live scan. A "latent print" is the chance recording of friction ridges deposited on the surface of an object or a wall. Latent prints are invisible...
112 KB (12,138 words) - 23:10, 24 July 2025
He, D; Li, YU (2025). "SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models". arXiv:2502.10495 [cs.CR]. Passananti, J; Wu, S; Shan, S; Zheng...
30 KB (3,015 words) - 13:13, 3 August 2025
posteriori (MAP) estimates of parameters in statistical models, where the model depends on unobserved latent variables. The EM iteration alternates between performing...
50 KB (7,512 words) - 16:40, 23 June 2025
latent trait is equal to the difficulty of the item, there is by definition a 0.5 probability of a correct response in the Rasch model. A Rasch model...
41 KB (5,234 words) - 15:21, 2 August 2025