Latent_diffusion_model Search Results

Latent diffusion model

The Latent Diffusion Model (LDM) is a diffusion model architecture developed by the CompVis (Computer Vision & Learning) group at LMU Munich. Introduced...

19 KB (2,178 words) - 00:05, 21 July 2025

Diffusion model

diffusion models, also known as diffusion-based generative models or score-based generative models, are a class of latent variable generative models....

84 KB (14,123 words) - 17:53, 23 July 2025

Stable Diffusion

organizations. Stable Diffusion is a latent diffusion model, a kind of deep generative artificial neural network. Its code and model weights have been released...

67 KB (6,228 words) - 20:51, 6 August 2025

Sora (text-to-video model)

Sora is a diffusion transformer – a denoising latent diffusion model with one Transformer as the denoiser. A video is generated in latent space by denoising...

15 KB (1,303 words) - 20:42, 2 August 2025

Text-to-image model

Text-to-image models are generally latent diffusion models, which combine a language model, which transforms the input text into a latent representation...

21 KB (1,949 words) - 15:01, 4 July 2025

Runway (company) (section Stable Diffusion)

the company co-released an improved version of their Latent Diffusion Model called Stable Diffusion together with the CompVis Group at Ludwig Maximilian...

17 KB (1,520 words) - 06:58, 1 August 2025

Unsupervised learning

features, which can then be used as a module for other models, such as in a latent diffusion model. Tasks are often categorized as discriminative (recognition)...

31 KB (2,770 words) - 17:17, 16 July 2025

Latent space

Analysis of the latent space geometry of diffusion models reveals a fractal structure of phase transitions in the latent space, characterized by abrupt changes...

11 KB (1,258 words) - 19:36, 23 July 2025

Flux (text-to-image model)

Retrieved 17 November 2024. "High-Resolution Image Synthesis with Latent Diffusion Models". Computer Vision & Learning Group. Archived from the original...

28 KB (2,279 words) - 20:11, 2 August 2025

Google DeepMind (redirect from Lyria (text-to-music model))

textual descriptions, images, or sketches. Built as an autoregressive latent diffusion model, Genie enables frame-by-frame interactivity without requiring labeled...

98 KB (9,516 words) - 10:17, 7 August 2025

Artificial intelligence visual art

than GANs in early 2021. Latent diffusion model was published in December 2021 and became the basis for the later Stable Diffusion (August 2022). In 2022...

121 KB (10,445 words) - 15:47, 7 August 2025

Latent Dirichlet allocation

In natural language processing, latent Dirichlet allocation (LDA) is a generative statistical model that explains how a collection of text documents can...

45 KB (7,462 words) - 00:03, 24 July 2025

Mixture model

be added to the graphical model defining the mixture model. For example, in the common latent Dirichlet allocation topic model, the observations are sets...

58 KB (7,865 words) - 10:48, 7 August 2025

Joe Penna

followed by the release of another paper detailing improvements to latent diffusion models for high-resolution image synthesis. Upon the peer review and acceptance...

16 KB (1,404 words) - 00:39, 25 April 2025

Vision-language-action model

directly output continuous actions. This is achieved through the use of diffusion models or flow-matching networks that act as the action decoder. π0 exploited...

25 KB (2,839 words) - 03:31, 25 July 2025

Topic model

latent tree analysis (HLTA) is an alternative to LDA, which models word co-occurrence using a tree of latent variables and the states of the latent variables...

23 KB (2,392 words) - 14:34, 12 July 2025

LDM

Project Latent diffusion model, in machine learning Latitude dependent mantle, a widespread layer of ice-rich material on Mars Liquid drop model of the...

1 KB (149 words) - 10:01, 27 June 2025

Two-alternative forced choice (redirect from Drift-diffusion model)

alternatives could be tracked separately. The drift-diffusion model (DDM) is a well defined model, that is proposed to implement an optimal decision policy...

24 KB (3,233 words) - 21:53, 19 August 2024

Generative artificial intelligence (section 3D modeling)

DeepSeek; text-to-image models such as Stable Diffusion, Midjourney, and DALL-E; and text-to-video models such as Veo, LTXV and Sora. Technology companies...

155 KB (13,956 words) - 04:40, 6 August 2025

Computer-generated imagery (redirect from Computer-generated anatomical models)

Text-to-image models are generally latent diffusion models, which combine a language model, which transforms the input text into a latent representation...

33 KB (4,115 words) - 19:48, 12 July 2025

Emad Mostaque

Stability AI's well-known AI image generator, Stable Diffusion, originated from a project called Latent Diffusion, developed by researchers at Ludwig Maximilian...

17 KB (1,646 words) - 19:03, 26 July 2025

Riffusion

files. While these files were only several seconds long, the model could also use latent space between outputs to interpolate different files together...

8 KB (391 words) - 20:45, 2 August 2025

Variational autoencoder (category Graphical models)

low-dimensional latent space, it is called the encoder. The decoder is the second neural network of this model. It is a function that maps from the latent space...

27 KB (3,967 words) - 21:16, 2 August 2025

Nonlinear dimensionality reduction (section Gaussian process latent variable models)

Process Latent Variable Model Locally Linear Embedding Relational Perspective Map DD-HDS homepage RankVisu homepage Short review of Diffusion Maps Nonlinear...

48 KB (6,119 words) - 04:01, 2 June 2025

Brain-reading

(1 December 2022). "High-resolution image reconstruction with latent diffusion models from human brain activity": 2022.11.18.517004. doi:10.1101/2022...

40 KB (4,661 words) - 05:26, 2 June 2025

Conditional random field (redirect from Discriminative probabilistic latent variable model)

discriminative probabilistic latent variable models (DPLVM) are a type of CRFs for sequence tagging tasks. They are latent variable models that are trained discriminatively...

17 KB (2,065 words) - 18:45, 20 June 2025

Fingerprint (redirect from Latent fingerprint)

called live scan. A "latent print" is the chance recording of friction ridges deposited on the surface of an object or a wall. Latent prints are invisible...

112 KB (12,138 words) - 23:10, 24 July 2025

Art of the My Little Pony: Friendship Is Magic fandom

He, D; Li, YU (2025). "SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models". arXiv:2502.10495 [cs.CR]. Passananti, J; Wu, S; Shan, S; Zheng...

30 KB (3,015 words) - 13:13, 3 August 2025

Expectation–maximization algorithm

posteriori (MAP) estimates of parameters in statistical models, where the model depends on unobserved latent variables. The EM iteration alternates between performing...

50 KB (7,512 words) - 16:40, 23 June 2025

Rasch model

latent trait is equal to the difficulty of the item, there is by definition a 0.5 probability of a correct response in the Rasch model. A Rasch model...

41 KB (5,234 words) - 15:21, 2 August 2025