Large_Language_Model Search Results

Large language model

large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing...

135 KB (14,248 words) - 17:13, 3 August 2025

List of large language models

A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language...

64 KB (3,353 words) - 15:04, 24 July 2025

Language model

A language model is a model of the human brain's ability to produce natural language. Language models are useful for a variety of tasks, including speech...

17 KB (2,424 words) - 12:05, 30 July 2025

Claude (language model)

Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. The Claude 3 family, released in March...

26 KB (2,274 words) - 20:30, 2 August 2025

Llama (language model)

Llama (Large Language Model Meta AI) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is Llama...

57 KB (5,448 words) - 20:35, 2 August 2025

Reasoning language model

Reasoning language models (RLMs) are large language models that are trained further to solve tasks that take several steps of reasoning. They tend to do...

26 KB (3,061 words) - 21:30, 31 July 2025

Foundation model

Generative AI applications like large language models (LLM) are common examples of foundation models. Building foundation models is often highly resource-intensive...

54 KB (5,552 words) - 18:04, 25 July 2025

Large language models in government

Large language models have been used by officials and politicians in a wide variety of ways. The Conversation described ChatGPT described as a uniquely...

17 KB (1,396 words) - 07:17, 26 April 2025

Gemini (language model)

Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...

64 KB (5,017 words) - 19:03, 2 August 2025

Chinchilla (language model)

Chinchilla is a family of large language models (LLMs) developed by the research team at Google DeepMind, presented in March 2022. It is named "chinchilla"...

8 KB (615 words) - 19:14, 2 August 2025

BLOOM (language model)

The BigScience Large Open-science Open-access Multilingual Language Model (BLOOM) is an open-access large language model (LLM). It was created by a volunteer-driven...

5 KB (528 words) - 10:30, 31 July 2025

Generative pre-trained transformer (redirect from GPT (language model))

A generative pre-trained transformer (GPT) is a type of large language model (LLM) that is widely used in generative AI chatbots. GPTs are based on a deep...

54 KB (4,304 words) - 18:45, 3 August 2025

Vision-language-action model

constructed by fine-tuning a vision-language model (VLM, i.e. a large language model extended with vision capabilities) on a large-scale dataset that pairs visual...

25 KB (2,839 words) - 03:31, 25 July 2025

BERT (language model)

improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...

32 KB (3,623 words) - 20:01, 2 August 2025

T5 (language model)

Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...

20 KB (1,932 words) - 20:50, 2 August 2025

Small language model

language processing including language and text generation. Unlike large language models (LLMs), small language models are much smaller in scale and scope...

3 KB (388 words) - 14:14, 13 July 2025

1.58-bit large language model

A 1.58-bit large language model (also known as a ternary LLM) is a type of large language model (LLM) designed to be computationally efficient. It achieves...

6 KB (720 words) - 01:58, 28 July 2025

Language model benchmark

Language model benchmark is a standardized test designed to evaluate the performance of language model on various natural language processing tasks. These...

103 KB (11,038 words) - 12:06, 30 July 2025

Transformer (deep learning architecture) (redirect from Transformer model)

Later variations have been widely adopted for training large language models (LLMs) on large (language) datasets. The modern version of the transformer was...

106 KB (13,107 words) - 01:38, 26 July 2025

Prompt engineering (redirect from In-context learning (natural language processing))

intelligence (AI) model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query...

40 KB (4,480 words) - 21:07, 27 July 2025

Model Context Protocol

to standardize the way artificial intelligence (AI) systems like large language models (LLMs) integrate and share data with external tools, systems, and...

20 KB (1,834 words) - 14:09, 3 August 2025

Perplexity AI

Perplexity AI, or simply Perplexity, is a web search engine that uses a large language model to process queries and synthesize responses based on web search results...

32 KB (2,720 words) - 18:34, 3 August 2025

PaLM (redirect from Pathways Language Model)

PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...

13 KB (807 words) - 19:02, 2 August 2025

Stochastic parrot (redirect from On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?)

by Emily M. Bender and colleagues in a 2021 paper, that frames large language models as systems that statistically mimic text without real understanding...

22 KB (2,359 words) - 14:00, 3 August 2025

Grok (chatbot) (redirect from Aurora (text-to-image model))

launched in November 2023 by Elon Musk as an initiative based on the large language model (LLM) of the same name. Grok is integrated with the social media...

83 KB (8,060 words) - 12:42, 3 August 2025

Generative artificial intelligence (category CS1 Japanese-language sources (ja))

particularly large language models (LLMs). Major tools include chatbots such as ChatGPT, Copilot, Gemini, Claude, Grok, and DeepSeek; text-to-image models such...

155 KB (13,950 words) - 05:14, 30 July 2025

Model collapse

obtained. In the context of large language models, research found that training LLMs on predecessor-generated text — language models are trained on the synthetic...

17 KB (2,466 words) - 23:18, 15 June 2025

Feedback neural network (redirect from Large reasoning model)

subsequent layers. This is notably used in large language models specifically in reasoning language models (RLM). This process is designed to mimic self-assessment...

8 KB (763 words) - 11:13, 20 July 2025

Jais (language model)

Jais is an open-source large language model launched in August 2023. Developed as a collaboration between Emirati AI company G42, the Mohamed bin Zayed...

5 KB (463 words) - 12:57, 1 August 2025

GPT-4.5 (category Large language models)

GPT-4.5 (codenamed "Orion") is a large language model developed by OpenAI as part of the GPT series. Officially released on February 27, 2025, GPT-4.5...

7 KB (630 words) - 15:27, 23 July 2025