• Thumbnail for Text-to-image model
    text-to-image model is a machine learning model which takes an input natural language prompt and produces an image matching that description. Text-to-image...
    20 KB (1,925 words) - 03:18, 7 June 2025
  • Thumbnail for Ideogram (text-to-image model)
    Ideogram is a freemium text-to-image model developed by Ideogram, Inc. using deep learning methodologies to generate digital images from natural language...
    5 KB (427 words) - 11:06, 4 May 2025
  • Thumbnail for Imagen (text-to-image model)
    Imagen is a series of text-to-image models developed by Google DeepMind. They were developed by Google Brain until the company's merger with DeepMind in...
    6 KB (517 words) - 10:03, 27 May 2025
  • Thumbnail for Flux (text-to-image model)
    Flux (also known as FLUX.1) is a text-to-image model developed by Black Forest Labs, based in Freiburg im Breisgau, Germany. Black Forest Labs was founded...
    26 KB (2,112 words) - 13:46, 13 June 2025
  • A text-to-video model is a machine learning model that uses a natural language description as input to produce a video relevant to the input text. Advancements...
    27 KB (2,367 words) - 14:08, 16 June 2025
  • Thumbnail for Grok (chatbot)
    users not subscribed to X Premium, but with usage limits. On December 9, 2024, Grok received Aurora, a new text-to-image model developed by xAI. In December...
    56 KB (5,196 words) - 20:33, 17 June 2025
  • Text-to-Image personalization is a task in deep learning for computer graphics that augments pre-trained text-to-image generative models. In this task...
    12 KB (1,350 words) - 08:13, 13 May 2025
  • Thumbnail for Computer-generated imagery
    text-to-image model is a machine learning model which takes an input natural language prompt and produces an image matching that description. Text-to-image...
    33 KB (4,114 words) - 01:36, 19 June 2025
  • model is trained to convert CLIP image encodings to CLIP text encodings. The image decoder is trained to convert CLIP image encodings back to images....
    84 KB (14,123 words) - 01:54, 6 June 2025
  • Thumbnail for Artificial intelligence visual art
    2020s, text-to-image models such as Midjourney, DALL-E, Stable Diffusion, and FLUX.1 became widely available to the public, allowing users to quickly...
    101 KB (9,582 words) - 20:29, 19 June 2025
  • third of its DALL-E text-to-image models, in September 2023. The team that developed Sora named it after the Japanese word for sky to signify its "limitless...
    14 KB (1,300 words) - 00:12, 17 June 2025
  • Thumbnail for Contrastive Language-Image Pre-training
    Language-Image Pre-training (CLIP) is a technique for training a pair of neural network models, one for image understanding and one for text understanding...
    29 KB (3,096 words) - 14:58, 26 May 2025
  • or describing a character for the AI to mimic. When communicating with a text-to-image or a text-to-audio model, a typical prompt is a description of...
    40 KB (4,472 words) - 15:50, 19 June 2025
  • Thumbnail for Stable Diffusion
    Stable Diffusion (category Text-to-image generation)
    Stable Diffusion is a deep learning, text-to-image model released in 2022 based on diffusion techniques. The generative artificial intelligence technology...
    67 KB (6,207 words) - 03:28, 8 June 2025
  • Thumbnail for Generative artificial intelligence
    artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models learn the underlying patterns and structures...
    174 KB (15,078 words) - 04:09, 19 June 2025
  • Thumbnail for Veo (text-to-video model)
    Veo is a text-to-video model developed by Google DeepMind and announced in May 2024. As a generative AI model, it creates videos based on user prompts...
    6 KB (471 words) - 17:16, 18 June 2025
  • Thumbnail for Stability AI
    its text-to-image model Stable Diffusion. Stability AI was founded in 2019 by Emad Mostaque and by Cyrus Hodes. In August 2022 Stability AI rose to prominence...
    11 KB (910 words) - 22:16, 13 June 2025
  • Thumbnail for DALL-E
    DALL-E (category Text-to-image generation)
    3 (stylised DALLĀ·E) are text-to-image models developed by OpenAI using deep learning methodologies to generate digital images from natural language descriptions...
    55 KB (4,281 words) - 13:54, 12 June 2025
  • Thumbnail for Transformer (deep learning architecture)
    Outputs". arXiv:2107.14795 [cs.LG]. "Parti: Pathways Autoregressive Text-to-Image Model". sites.research.google. Retrieved 2024-08-09. Villegas, Ruben; Babaeizadeh...
    106 KB (13,107 words) - 11:55, 19 June 2025
  • types of text-to-image models called Imagen and Parti that compete with OpenAI's DALL-E. Later in 2022, the project was extended to text-to-video. Imagen...
    44 KB (4,228 words) - 06:25, 18 June 2025
  • Thumbnail for Claude (language model)
    and Opus, designed for complex reasoning tasks. These models can process both text and images, with Claude 3 Opus demonstrating enhanced capabilities...
    27 KB (2,313 words) - 01:34, 16 June 2025
  • Thumbnail for Llama (language model)
    LLaMA to Stable Diffusion, a text-to-image model which, unlike comparably sophisticated models which preceded it, was openly distributed, leading to a rapid...
    53 KB (4,940 words) - 20:25, 13 June 2025
  • GPT-4o (redirect from GPT Image 1)
    developed by OpenAI and released in May 2024. It can process and generate text, images and audio. GPT-4o is free, but ChatGPT Plus subscribers have higher usage...
    25 KB (2,434 words) - 10:12, 19 June 2025
  • to have been seen over 47 million times before its eventual removal. The images led Microsoft to enhance Microsoft Designer's text-to-image model to prevent...
    18 KB (1,468 words) - 00:05, 15 June 2025
  • massive text datasets from the web ("web as corpus") to train statistical language models. Following the breakthrough of deep neural networks in image classification...
    115 KB (11,926 words) - 02:40, 16 June 2025
  • Thumbnail for DreamBooth
    DreamBooth (category Text-to-image generation)
    DreamBooth is a deep learning generation model used to personalize existing text-to-image models by fine-tuning. It was developed by researchers from...
    11 KB (1,182 words) - 10:49, 18 March 2025
  • to as modalities, such as text, audio, images, or video. This integration allows for a more holistic understanding of complex data, improving model performance...
    9 KB (2,212 words) - 22:40, 1 June 2025
  • ongoing copyright infringement lawsuit against Stability AI, over its text-to-image model, Stable Diffusion, being allegedly trained on his comics. "The Unnerving...
    2 KB (177 words) - 14:51, 25 March 2025
  • Thumbnail for Apple Intelligence
    Intelligence text-to-image models, users can create original "Genmoji" images by typing descriptions. Users can pick people in photos and create Genmoji images that...
    44 KB (3,837 words) - 13:52, 14 June 2025
  • Thumbnail for LAION
    number of high-profile text-to-image models, including Stable Diffusion and Imagen. In February 2023, LAION was named in the Getty Images lawsuit against Stable...
    11 KB (1,071 words) - 06:56, 13 May 2025