• A large language model (LLM) is a language model notable for its ability to achieve general-purpose language understanding and generation. LLMs acquire...
    128 KB (11,624 words) - 22:51, 28 May 2024
  • Llama (acronym for Large Language Model Meta AI, and formerly stylized as LLaMA) is a family of autoregressive large language models released by Meta AI...
    29 KB (2,977 words) - 11:36, 28 May 2024
  • A language model is a probabilistic model of a natural language. In 1980, the first significant statistical language model was proposed, and during the...
    14 KB (2,211 words) - 10:21, 28 April 2024
  • Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. Claude 3, released in March 2024, can...
    10 KB (1,044 words) - 03:03, 28 May 2024
  • Large Open-science Open-access Multilingual Language Model (BLOOM) is a 176-billion-parameter transformer-based autoregressive large language model (LLM)...
    4 KB (496 words) - 20:56, 28 May 2024
  • (BERT) is a language model based on the transformer architecture, notable for its dramatic improvement over previous state of the art models. It was introduced...
    18 KB (2,144 words) - 09:13, 7 May 2024
  • Thumbnail for Gemini (language model)
    Gemini is a family of multimodal large language models developed by Google DeepMind, serving as the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...
    42 KB (3,299 words) - 21:59, 17 May 2024
  • Chinchilla is a family of large language models developed by the research team at DeepMind, presented in March 2022. It is named "chinchilla" because...
    7 KB (548 words) - 19:36, 24 April 2024
  • Transfer Transformer) is a series of large language models developed by Google AI. Introduced in 2019, T5 models are trained on a massive dataset of text...
    6 KB (535 words) - 07:55, 12 May 2024
  • GPT-3 (redirect from GPT-3 (language model))
    Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...
    54 KB (4,934 words) - 16:42, 28 May 2024
  • Thumbnail for Generative pre-trained transformer
    Generative pre-trained transformers (GPT) are a type of large language model (LLM) and a prominent framework for generative artificial intelligence. They...
    46 KB (4,098 words) - 18:01, 26 May 2024
  • produces open source large language models, citing the foundational importance of open-source software, and as a response to proprietary models. As of May 2024[update]...
    15 KB (1,527 words) - 07:32, 21 May 2024
  • freemium model; the free product uses its Perplexity model based on OpenAI's GPT-3.5 model combined with the company's standalone large language model (LLM)...
    6 KB (538 words) - 00:05, 25 May 2024
  • generative AI model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query...
    61 KB (6,657 words) - 07:57, 27 May 2024
  • adequate, stating that "'(large) language model' was too narrow given [the] focus is not only language; 'self-supervised model' was too specific to the...
    46 KB (5,051 words) - 16:15, 19 May 2024
  • and distributed systems. A large number of modeling languages appear in the literature. Example of graphical modeling languages in the field of computer...
    22 KB (2,852 words) - 21:58, 19 May 2024
  • Thumbnail for Transformer (deep learning architecture)
    variation has been prevalently adopted for training large language models (LLM) on large (language) datasets, such as the Wikipedia corpus and Common Crawl...
    65 KB (8,122 words) - 18:54, 28 May 2024
  • Thumbnail for PaLM
    PaLM (redirect from Pathways Language Model)
    PaLM (Pathways Language Model) is a 540 billion parameter transformer-based large language model developed by Google AI. Researchers also trained smaller...
    13 KB (798 words) - 00:31, 26 April 2024
  • GPT-4 (category Large language models)
    Transformer 4 (GPT-4) is a multimodal large language model created by OpenAI, and the fourth in its series of GPT foundation models. It was launched on March 14...
    60 KB (5,834 words) - 13:29, 24 May 2024
  • GPT-4o (category Large language models)
    launched under a different name on the Large Model Systems Organization (LMSYS) as 3 different models. These 3 models were called gpt2-chatbot, im-a-good-gpt2-chatbot...
    13 KB (1,361 words) - 05:46, 26 May 2024
  • open-source large language model developed in the United Arab Emirates and launched in August 2023. It was trained on both English- and Arabic-language data...
    3 KB (261 words) - 08:42, 1 March 2024
  • Thumbnail for Model collapse
    time. In the context of large language models, research found that training LLMs on predecessor-generated text—language models are trained on the synthetic...
    4 KB (345 words) - 17:52, 12 May 2024
  • describe the theory that large language models, though able to generate plausible language, do not understand the meaning of the language they process. The term...
    22 KB (2,444 words) - 03:08, 25 May 2024
  • development. LLaMA is a family of large language models released by Meta AI starting in February 2023. Meta claims these models are open-source software, but...
    6 KB (505 words) - 00:39, 27 April 2024
  • an open-weights large language model (LLM) developed by AI21 Labs. It utilizes a Mamba-based model built on a novel state space model (SSM) and transformer...
    4 KB (328 words) - 02:46, 4 April 2024
  • Thumbnail for Microsoft Copilot
    developed by Microsoft and launched on February 7, 2023. Based on a large language model, it is able to cite sources, create poems, and write songs. It is...
    53 KB (4,791 words) - 19:26, 28 May 2024
  • Thumbnail for Generative artificial intelligence
    Generative artificial intelligence (category CS1 Italian-language sources (it))
    Improvements in transformer-based deep neural networks, particularly large language models (LLMs), enabled an AI boom of generative AI systems in the early...
    110 KB (9,792 words) - 13:03, 24 May 2024
  • (2023-01-01). "BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models". arXiv:2301.12597 [cs.CV]. Alayrac...
    7 KB (1,746 words) - 10:31, 3 April 2024
  • Thumbnail for History of artificial intelligence
    mechanism and later became widely used in large language models. Foundation models, which are large language models trained on vast quantities of unlabeled...
    133 KB (15,569 words) - 14:37, 14 May 2024
  • Thumbnail for ChatGPT
    ChatGPT (category Large language models)
    developed by OpenAI and launched on November 30, 2022. Based on large language models (LLMs), it enables users to refine and steer a conversation towards...
    180 KB (15,747 words) - 08:35, 26 May 2024