Large_language_model Search Results

Large language model

A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language...

114 KB (11,942 words) - 05:35, 30 April 2025

List of large language models

A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language...

64 KB (3,361 words) - 09:20, 29 April 2025

Llama (language model)

Llama (Large Language Model Meta AI, formerly stylized as LLaMA) is a family of large language models (LLMs) released by Meta AI starting in February 2023...

53 KB (4,940 words) - 16:55, 22 April 2025

Language model

information retrieval. Large language models (LLMs), currently their most advanced form, are predominantly based on transformers trained on larger datasets (frequently...

16 KB (2,382 words) - 00:06, 17 April 2025

Claude (language model)

Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. The Claude 3 family, released in March...

21 KB (1,894 words) - 20:08, 19 April 2025

Gemini (language model)

Gemini is a family of multimodal large language models developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra, Gemini...

52 KB (4,226 words) - 20:15, 19 April 2025

Large language models in government

Large language models have been used by officials and politicians in a wide variety of ways. The Conversation described ChatGPT described as a uniquely...

17 KB (1,396 words) - 07:17, 26 April 2025

Generative pre-trained transformer (redirect from GPT (language model))

A generative pre-trained transformer (GPT) is a type of large language model (LLM) and a prominent framework for generative artificial intelligence. It...

65 KB (5,342 words) - 13:55, 1 May 2025

BERT (language model)

improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...

31 KB (3,528 words) - 01:20, 29 April 2025

Chinchilla (language model)

Chinchilla is a family of large language models (LLMs) developed by the research team at Google DeepMind, presented in March 2022. It is named "chinchilla"...

8 KB (615 words) - 19:51, 6 December 2024

BLOOM (language model)

Large Open-science Open-access Multilingual Language Model (BLOOM) is a 176-billion-parameter transformer-based autoregressive large language model (LLM)...

4 KB (506 words) - 02:26, 19 April 2025

T5 (language model)

Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...

20 KB (1,932 words) - 22:58, 21 March 2025

Reasoning language model

Reasoning language models are artificial intelligence systems that combine natural language processing with structured reasoning capabilities. These models are...

24 KB (2,960 words) - 18:31, 16 April 2025

PaLM (redirect from Pathways Language Model)

PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...

13 KB (807 words) - 13:21, 13 April 2025

Model Context Protocol

The Model Context Protocol (MCP) is an open standard developed by the artificial intelligence company Anthropic for enabling large language model (LLM)...

6 KB (615 words) - 09:47, 30 April 2025

1.58-bit large language model

A 1.58-bit Large Language Model (1.58-bit LLM, also ternary LLM) is a version of a transformer large language model with weights using only three values:...

5 KB (618 words) - 12:45, 1 May 2025

Foundation model

Generative AI applications like Large Language Models are common examples of foundation models. Building foundation models is often highly resource-intensive...

44 KB (4,714 words) - 05:57, 6 March 2025

Transformer (deep learning architecture) (redirect from Transformer model)

Later variations have been widely adopted for training large language models (LLM) on large (language) datasets. Transformers were first developed as an improvement...

106 KB (13,091 words) - 21:14, 29 April 2025

Small language model

processing including language and text generation. Unlike large language models (LLMs), small language models are much smaller in scale and scope. Typically, an...

2 KB (211 words) - 03:40, 29 April 2025

Meta AI (redirect from No Language Left Behind)

allow the model to follow instructions to manipulate LaTeX documents on Overleaf. In February 2023, Meta AI launched LLaMA (Large Language Model Meta AI)...

25 KB (2,148 words) - 09:26, 30 April 2025

Perplexity AI

simply Perplexity, is an American web search engine that uses a large language model to process queries and synthesize responses based on web search results...

22 KB (1,870 words) - 19:38, 30 April 2025

Grok (chatbot) (redirect from Aurora (text-to-image model))

generative artificial intelligence chatbot developed by xAI. Based on the large language model (LLM) of the same name, it was launched in 2023 as an initiative...

47 KB (4,189 words) - 11:15, 29 April 2025

Stochastic parrot (redirect from On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?)

describe the theory that large language models, though able to generate plausible language, do not understand the meaning of the language they process. The term...

22 KB (2,397 words) - 07:34, 27 March 2025

GPT-3 (redirect from GPT-3 (language model))

Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...

54 KB (4,913 words) - 09:05, 8 April 2025

Generative artificial intelligence (category CS1 Japanese-language sources (ja))

improvements in transformer-based deep neural networks, particularly large language models (LLMs). Major tools include chatbots such as ChatGPT, Copilot, Gemini...

163 KB (13,826 words) - 19:09, 30 April 2025

Prompt engineering (redirect from In-context learning (natural language processing))

intelligence (AI) model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query...

43 KB (4,790 words) - 18:46, 21 April 2025

DeepSeek (category Articles containing Chinese-language text)

DeepSeek, is a Chinese artificial intelligence company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, it is owned and funded by the...

62 KB (6,059 words) - 16:53, 1 May 2025

Microsoft Copilot (section Languages)

intelligence chatbot developed by Microsoft. Based on the GPT-4 series of large language models, it was launched in 2023 as Microsoft's primary replacement for...

63 KB (5,651 words) - 15:18, 1 May 2025

Mistral AI (section Mistral Large 2)

startup, headquartered in Paris. It specializes in open-weight large language models (LLMs). The company is named after the mistral, a powerful, cold...

27 KB (1,716 words) - 03:22, 29 April 2025

ChatGPT (category Large language models)

the American company OpenAI and launched in 2022. It is based on large language models (LLMs) such as GPT-4o. ChatGPT can generate human-like conversational...

207 KB (17,890 words) - 15:22, 1 May 2025