language models with many parameters, and are trained with self-supervised learning on a vast amount of text. This page lists notable large language models...
64 KB (3,353 words) - 15:01, 4 August 2025
large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing...
137 KB (14,547 words) - 01:05, 5 August 2025
Llama (Large Language Model Meta AI) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is Llama...
57 KB (5,448 words) - 20:35, 2 August 2025
of large language models developed by Anthropic. The first model was released in March 2023. The Claude 3 family, released in March 2024, consists of...
26 KB (2,274 words) - 19:08, 4 August 2025
A language model is a model of the human brain's ability to produce natural language. Language models are useful for a variety of tasks, including speech...
17 KB (2,424 words) - 12:05, 30 July 2025
Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...
64 KB (5,017 words) - 19:03, 2 August 2025
Mistral AI (category CS1 French-language sources (fr))
2023, it specializes in open-weight large language models (LLMs), with both open-source and proprietary AI models. The company is named after the mistral...
27 KB (1,626 words) - 21:59, 3 August 2025
DeepSeek (category Articles containing Chinese-language text)
DeepSeek, is a Chinese artificial intelligence company that develops large language models (LLMs). Based in Hangzhou, Zhejiang, Deepseek is owned and funded...
72 KB (6,633 words) - 17:50, 5 August 2025
comparing different models' capabilities in areas such as language understanding, generation, and reasoning. Benchmarks generally consist of a dataset and corresponding...
103 KB (11,038 words) - 21:27, 4 August 2025
Stochastic parrot (redirect from On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?)
frames large language models as systems that statistically mimic text without real understanding. The term was first used in the paper "On the Dangers of Stochastic...
22 KB (2,359 words) - 14:00, 3 August 2025
applications and web interfaces. List of large language models The Pile (dataset), public data used to train many research models "What is a chatbot?". techtarget...
29 KB (1,297 words) - 08:20, 15 July 2025
Qwen (category Large language models)
a family of large language models developed by Chinese company Alibaba Cloud. In July 2024, it was ranked as the top Chinese language model in some benchmarks...
22 KB (1,560 words) - 20:03, 2 August 2025
The BigScience Large Open-science Open-access Multilingual Language Model (BLOOM) is an open-access large language model (LLM). It was created by a volunteer-driven...
5 KB (528 words) - 10:30, 31 July 2025
models. Model transformation is a common example of such reasoning. Object modeling languages are modeling languages based on a standardized set of symbols...
23 KB (2,886 words) - 09:45, 29 July 2025
Generative artificial intelligence (category CS1 maint: numeric names: authors list)
particularly large language models (LLMs). Major tools include chatbots such as ChatGPT, Copilot, Gemini, Claude, Grok, and DeepSeek; text-to-image models such...
155 KB (13,968 words) - 04:29, 5 August 2025
ChatGPT (category Large language models)
used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there were several pending U...
168 KB (14,859 words) - 12:51, 5 August 2025
GPT-4 (category Large language models)
Transformer 4 (GPT-4) is a large language model trained and created by OpenAI and the fourth in its series of GPT foundation models. It was launched on March...
63 KB (6,044 words) - 12:11, 3 August 2025
Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...
20 KB (1,932 words) - 20:50, 2 August 2025
GPT-4o (category Large language models)
agreeable to the point of supporting clearly delusional or dangerous ideas. Apple Intelligence List of large language models Wiggers, Kyle (May 13, 2024)...
25 KB (2,396 words) - 17:43, 21 July 2025
dramatically improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...
32 KB (3,623 words) - 20:01, 2 August 2025
OpenAI o1 (category Large language models)
relatively rarely admitted deceptive action (in 20% of test cases). List of large language models Metz, Cade (September 12, 2024). "OpenAI Unveils New...
14 KB (1,421 words) - 20:12, 2 August 2025
GPT-3 (redirect from GPT-3 (language model))
3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...
55 KB (4,899 words) - 17:21, 5 August 2025
OpenAI o3 (category Large language models)
times the accuracy of o1. List of large language models Knight, Will (December 20, 2024). "OpenAI Upgrades Its Smartest AI Model With Improved Reasoning...
9 KB (851 words) - 20:12, 2 August 2025
Generative pre-trained transformer (redirect from GPT (language model))
released a series of open-source models, including GPT-J in 2021. Other major technology companies developed their own large language models, including Google's...
54 KB (4,304 words) - 18:45, 3 August 2025
GPT-4.1 (category Large language models)
misaligned than GPT-4o. List of large language models Weatherbed, Jess (2025-04-14). "OpenAI debuts its GPT-4.1 flagship AI model". The Verge. Retrieved...
6 KB (574 words) - 21:08, 23 July 2025
GPT-4.5 (category Large language models)
GPT-4.5 (codenamed "Orion") is a large language model developed by OpenAI as part of the GPT series. Officially released on February 27, 2025, GPT-4.5...
7 KB (630 words) - 15:27, 23 July 2025
OpenAI o4-mini (category Large language models)
assessment through automated document analysis and data processing. List of large language models "Introducing OpenAI o3 and o4-mini". OpenAI. Retrieved 17 April...
4 KB (306 words) - 20:06, 10 July 2025
GPT-2 (category Large language models)
(GPT-2) is a large language model by OpenAI and the second in their foundational series of GPT models. GPT-2 was pre-trained on a dataset of 8 million web...
44 KB (3,269 words) - 19:59, 2 August 2025
Grok (chatbot) (redirect from Aurora (text-to-image model))
intelligence models, in particular the Grok Large Language Models (LLMs). The inquiry considers a large range of issues concerning the use of a subset of the data...
85 KB (8,217 words) - 15:11, 5 August 2025
GPT-1 (category Large language models)
Pre-trained Transformer 1 (GPT-1) was the first of OpenAI's large language models following Google's invention of the transformer architecture in 2017. In June...
32 KB (1,069 words) - 19:58, 2 August 2025