• large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing...
    132 KB (14,012 words) - 14:30, 12 July 2025
  • A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language...
    64 KB (3,353 words) - 19:38, 17 June 2025
  • A language model is a model of the human brain's ability to produce natural language. Language models are useful for a variety of tasks, including speech...
    17 KB (2,417 words) - 21:27, 26 June 2025
  • Thumbnail for Llama (language model)
    Llama (Large Language Model Meta AI, formerly stylized as LLaMA) is a family of large language models (LLMs) released by Meta AI starting in February 2023...
    56 KB (5,410 words) - 14:17, 12 July 2025
  • Thumbnail for Claude (language model)
    Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. The Claude 3 family, released in March...
    27 KB (2,345 words) - 23:35, 11 July 2025
  • Reasoning language models (RLMs) are large language models that have been further trained to solve multi-step reasoning tasks. These models perform better...
    24 KB (2,864 words) - 21:02, 11 July 2025
  • Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...
    60 KB (4,676 words) - 22:14, 13 July 2025
  • Generative AI applications like large language models (LLM) are common examples of foundation models. Building foundation models is often highly resource-intensive...
    54 KB (5,550 words) - 11:52, 1 July 2025
  • Chinchilla is a family of large language models (LLMs) developed by the research team at Google DeepMind, presented in March 2022. It is named "chinchilla"...
    8 KB (615 words) - 19:51, 6 December 2024
  • Thumbnail for Generative pre-trained transformer
    A generative pre-trained transformer (GPT) is a type of large language model (LLM) and a prominent framework for generative artificial intelligence. It...
    65 KB (5,276 words) - 21:27, 10 July 2025
  • Large language models have been used by officials and politicians in a wide variety of ways. The Conversation described ChatGPT described as a uniquely...
    17 KB (1,396 words) - 07:17, 26 April 2025
  • improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...
    32 KB (3,623 words) - 12:46, 7 July 2025
  • Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...
    20 KB (1,932 words) - 03:55, 7 May 2025
  • Large Open-science Open-access Multilingual Language Model (BLOOM) is a 176-billion-parameter transformer-based autoregressive large language model (LLM)...
    4 KB (506 words) - 11:48, 25 June 2025
  • language processing including language and text generation. Unlike large language models (LLMs), small language models are much smaller in scale and scope...
    3 KB (388 words) - 14:14, 13 July 2025
  • constructed by fine-tuning a vision-language model (VLM, i.e. a large language model extended with vision capabilities) on a large-scale dataset that pairs visual...
    28 KB (3,022 words) - 13:50, 11 July 2025
  • Thumbnail for Language model benchmark
    Language model benchmark is a standardized test designed to evaluate the performance of language model on various natural language processing tasks. These...
    101 KB (10,888 words) - 09:44, 12 July 2025
  • A 1.58-bit Large Language Model (1.58-bit LLM, also ternary LLM) is a version of a transformer large language model with weights using only three values:...
    6 KB (701 words) - 14:29, 10 July 2025
  • Thumbnail for Grok (chatbot)
    launched in November 2023 by Elon Musk as an initiative based on the large language model (LLM) of the same name. Grok is integrated with the social media...
    78 KB (7,497 words) - 04:10, 14 July 2025
  • Thumbnail for Transformer (deep learning architecture)
    Later variations have been widely adopted for training large language models (LLMs) on large (language) datasets. The modern version of the transformer was...
    106 KB (13,107 words) - 19:01, 26 June 2025
  • Thumbnail for PaLM
    PaLM (redirect from Pathways Language Model)
    PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...
    13 KB (807 words) - 13:21, 13 April 2025
  • Thumbnail for Model Context Protocol
    to standardize the way artificial intelligence (AI) systems like large language models (LLMs) integrate and share data with external tools, systems, and...
    18 KB (1,630 words) - 19:03, 9 July 2025
  • Perplexity AI, or simply Perplexity, is a web search engine that uses a large language model to process queries and synthesize responses based on web search results...
    25 KB (2,193 words) - 20:21, 11 July 2025
  • intelligence (AI) model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query...
    40 KB (4,482 words) - 03:58, 30 June 2025
  • Thumbnail for Generative artificial intelligence
    Generative artificial intelligence (category CS1 Japanese-language sources (ja))
    particularly large language models (LLMs). Major tools include chatbots such as ChatGPT, Copilot, Gemini, Claude, Grok, and DeepSeek; text-to-image models such...
    150 KB (13,385 words) - 03:53, 13 July 2025
  • Thumbnail for ChatGPT
    ChatGPT (category Large language models)
    developed by OpenAI and released on November 30, 2022. It uses large language models (LLMs) such as GPT-4o to generate human-like responses in text,...
    177 KB (15,530 words) - 14:00, 14 July 2025
  • DeepSeek (category Articles containing Chinese-language text)
    DeepSeek, is a Chinese artificial intelligence company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, Deepseek is owned and funded...
    69 KB (6,445 words) - 19:23, 10 July 2025
  • Thumbnail for Meta AI
    Overleaf. In February 2023, Meta AI launched LLaMA (Large Language Model Meta AI), a large language model ranging from 7B to 65B parameters. On April 5, 2025...
    22 KB (1,903 words) - 03:43, 12 July 2025
  • GPT-4.5 (category Large language models)
    GPT-4.5 (codenamed "Orion") is a large language model developed by OpenAI as part of the GPT series. Officially released on February 27, 2025, GPT-4.5...
    6 KB (618 words) - 19:35, 10 July 2025
  • OpenAI o3 (category Large language models)
    the accuracy of o1. List of large language models Knight, Will (December 20, 2024). "OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills"...
    9 KB (851 words) - 20:05, 10 July 2025