text-to-image model is a machine learning model which takes an input natural language prompt and produces an image matching that description. Text-to-image...
20 KB (1,925 words) - 03:18, 7 June 2025
Ideogram is a freemium text-to-image model developed by Ideogram, Inc. using deep learning methodologies to generate digital images from natural language...
5 KB (427 words) - 11:06, 4 May 2025
Imagen is a series of text-to-image models developed by Google DeepMind. They were developed by Google Brain until the company's merger with DeepMind in...
6 KB (517 words) - 10:03, 27 May 2025
Flux (also known as FLUX.1) is a text-to-image model developed by Black Forest Labs, based in Freiburg im Breisgau, Germany. Black Forest Labs was founded...
26 KB (2,112 words) - 13:46, 13 June 2025
A text-to-video model is a machine learning model that uses a natural language description as input to produce a video relevant to the input text. Advancements...
27 KB (2,367 words) - 14:08, 16 June 2025
Grok (chatbot) (redirect from Aurora (text-to-image model))
users not subscribed to X Premium, but with usage limits. On December 9, 2024, Grok received Aurora, a new text-to-image model developed by xAI. In December...
56 KB (5,196 words) - 20:33, 17 June 2025
Text-to-Image personalization is a task in deep learning for computer graphics that augments pre-trained text-to-image generative models. In this task...
12 KB (1,350 words) - 08:13, 13 May 2025
Computer-generated imagery (redirect from Computer generated image)
text-to-image model is a machine learning model which takes an input natural language prompt and produces an image matching that description. Text-to-image...
33 KB (4,114 words) - 01:36, 19 June 2025
model is trained to convert CLIP image encodings to CLIP text encodings. The image decoder is trained to convert CLIP image encodings back to images....
84 KB (14,123 words) - 01:54, 6 June 2025
Artificial intelligence visual art (redirect from AI-generated image)
2020s, text-to-image models such as Midjourney, DALL-E, Stable Diffusion, and FLUX.1 became widely available to the public, allowing users to quickly...
101 KB (9,582 words) - 20:29, 19 June 2025
third of its DALL-E text-to-image models, in September 2023. The team that developed Sora named it after the Japanese word for sky to signify its "limitless...
14 KB (1,300 words) - 00:12, 17 June 2025
Language-Image Pre-training (CLIP) is a technique for training a pair of neural network models, one for image understanding and one for text understanding...
29 KB (3,096 words) - 14:58, 26 May 2025
Prompt engineering (redirect from Least-to-most prompting)
or describing a character for the AI to mimic. When communicating with a text-to-image or a text-to-audio model, a typical prompt is a description of...
40 KB (4,472 words) - 15:50, 19 June 2025
Stable Diffusion (category Text-to-image generation)
Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology...
67 KB (6,207 words) - 03:28, 8 June 2025
artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models learn the underlying patterns and structures...
174 KB (15,078 words) - 04:09, 19 June 2025
Veo is a text-to-video model developed by Google DeepMind and announced in May 2024. As a generative AI model, it creates videos based on user prompts...
6 KB (471 words) - 17:16, 18 June 2025
its text-to-image model Stable Diffusion. Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI rose to prominence...
11 KB (910 words) - 22:16, 13 June 2025
DALL-E (category Text-to-image generation)
3 (stylised DALLĀ·E) are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions...
55 KB (4,281 words) - 13:54, 12 June 2025
Transformer (deep learning architecture) (redirect from Transformer model)
Outputs". arXiv:2107.14795 [cs.LG]. "Parti: Pathways Autoregressive Text-to-Image Model". sites.research.google. Retrieved 2024-08-09. Villegas, Ruben; Babaeizadeh...
106 KB (13,107 words) - 11:55, 19 June 2025
Google Brain (section Text-to-image model)
types of text-to-image models called Imagen and Parti that compete with OpenAI's DALL-E. Later in 2022, the project was extended to text-to-video. Imagen...
44 KB (4,228 words) - 06:25, 18 June 2025
and Opus, designed for complex reasoning tasks. These models can process both text and images, with Claude 3 Opus demonstrating enhanced capabilities...
27 KB (2,313 words) - 01:34, 16 June 2025
LLaMA to Stable Diffusion, a text-to-image model which, unlike comparably sophisticated models which preceded it, was openly distributed, leading to a rapid...
53 KB (4,940 words) - 20:25, 13 June 2025
GPT-4o (redirect from GPT Image 1)
developed by OpenAI and released in May 2024. It can process and generate text, images and audio. GPT-4o is free, but ChatGPT Plus subscribers have higher usage...
25 KB (2,434 words) - 10:12, 19 June 2025
to have been seen over 47 million times before its eventual removal. The images led Microsoft to enhance Microsoft Designer's text-to-image model to prevent...
18 KB (1,468 words) - 00:05, 15 June 2025
massive text datasets from the web ("web as corpus") to train statistical language models. Following the breakthrough of deep neural networks in image classification...
115 KB (11,926 words) - 02:40, 16 June 2025
DreamBooth (category Text-to-image generation)
DreamBooth is a deep learning generation model used to personalize existing text-to-image models by fine-tuning. It was developed by researchers from...
11 KB (1,182 words) - 10:49, 18 March 2025
Multimodal learning (redirect from Multimodal model)
to as modalities, such as text, audio, images, or video. This integration allows for a more holistic understanding of complex data, improving model performance...
9 KB (2,212 words) - 22:40, 1 June 2025
ongoing copyright infringement lawsuit against Stability AI, over its text-to-image model, Stable Diffusion, being allegedly trained on his comics. "The Unnerving...
2 KB (177 words) - 14:51, 25 March 2025
Apple Intelligence (redirect from Image Playground)
Intelligence text-to-image models, users can create original "Genmoji" images by typing descriptions. Users can pick people in photos and create Genmoji images that...
44 KB (3,837 words) - 13:52, 14 June 2025
LAION (section Image datasets)
number of high-profile text-to-image models, including Stable Diffusion and Imagen. In February 2023, LAION was named in the Getty Images lawsuit against Stable...
11 KB (1,071 words) - 06:56, 13 May 2025