Large language models have been used by officials and politicians in a wide variety of ways. The Conversation described ChatGPT described as a uniquely...
17 KB (1,396 words) - 07:17, 26 April 2025
Llama (Large Language Model Meta AI) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is...
57 KB (5,448 words) - 20:35, 2 August 2025
Generative AI applications like large language models (LLM) are common examples of foundation models. Building foundation models is often highly resource-intensive...
54 KB (5,552 words) - 18:04, 25 July 2025
Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...
20 KB (1,932 words) - 20:50, 2 August 2025
improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...
32 KB (3,623 words) - 20:01, 2 August 2025
Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...
64 KB (5,017 words) - 19:03, 2 August 2025
Mistral AI (category CS1 French-language sources (fr))
headquartered in Paris. Founded in 2023, it specializes in open-weight large language models (LLMs), with both open-source and proprietary AI models. The company...
27 KB (1,626 words) - 21:59, 3 August 2025
Qwen (category Large language models)
family of large language models developed by Chinese company Alibaba Cloud. In July 2024, it was ranked as the top Chinese language model in some benchmarks...
22 KB (1,560 words) - 20:03, 2 August 2025
GPT-1 (category Large language models)
(GPT-1) was the first of OpenAI's large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a...
32 KB (1,069 words) - 19:58, 2 August 2025
Huawei PanGu (category Large language models)
a multimodal large language model developed by Huawei. It was announced on July 7, 2023. The name of the large learning language model, PanGu, was derived...
13 KB (1,325 words) - 19:11, 2 August 2025
boom was made possible by improvements in transformer-based deep neural networks, particularly large language models (LLMs). Major tools include chatbots...
155 KB (13,968 words) - 04:29, 5 August 2025
PaLM (redirect from Pathways Language Model)
PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...
13 KB (807 words) - 19:02, 2 August 2025
Sarvam AI (section Sarvam Models)
startup focused on building large language models. These large language models (LLMs) are customised for Indian Languages and contexts. The company focuses...
11 KB (1,215 words) - 03:39, 4 June 2025
Kruti (category Large language models)
backend technology combines several open-source large language models with Ola's proprietary Krutrim V2 model. The system was developed to work primarily...
15 KB (1,066 words) - 07:37, 3 August 2025
GPT-4 (category Large language models)
Transformer 4 (GPT-4) is a large language model trained and created by OpenAI and the fourth in its series of GPT foundation models. It was launched on March...
63 KB (6,044 words) - 12:11, 3 August 2025
Grok (chatbot) (redirect from Aurora (text-to-image model))
developed by xAI. It was launched in November 2023 by Elon Musk as an initiative based on the large language model (LLM) of the same name. Grok is integrated...
83 KB (8,060 words) - 22:55, 4 August 2025
There are different models, including open source models. Chinese-language input CogVideo is the earliest text-to-video model "of 9.4 billion parameters"...
29 KB (2,449 words) - 04:52, 26 July 2025
Scale AI (category 2016 establishments in California)
Evaluation and Alignment Lab, focuses on evaluating and aligning large language models (LLMs), including through initiatives such as Humanity's Last Exam...
25 KB (2,312 words) - 05:00, 2 August 2025
Cohere (category Large language models)
company focused on artificial intelligence. Cohere specializes in large language models and AI products for regulated industries, particularly the finance...
24 KB (2,208 words) - 08:58, 24 July 2025
Alexandr Wang (category Articles containing simplified Chinese-language text)
company that provides data labeling and large language model evaluation services to develop AI applications. In 2021, he was the world's youngest self-made...
15 KB (1,380 words) - 11:39, 4 August 2025
Artificial general intelligence (category CS1 Bulgarian-language sources (bg))
all cognitive tasks. Some researchers argue that state‑of‑the‑art large language models (LLMs) already exhibit signs of AGI‑level capability, while others...
135 KB (14,800 words) - 17:53, 2 August 2025
Dead Internet theory (category CS1 French-language sources (fr))
increase in content generated via large language models (LLMs) such as ChatGPT appearing in popular Internet spaces without mention of the full theory. In a...
35 KB (3,258 words) - 11:56, 1 August 2025
GPT-3 (redirect from GPT-3 (language model))
Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...
55 KB (4,897 words) - 20:00, 2 August 2025
ChatGPT (category Large language models)
trained or used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there were several...
168 KB (14,859 words) - 05:35, 5 August 2025
Aleph Alpha (category CS1 French-language sources (fr))
and comply with European data protection regulations. It develops large language models (LLM), which try to provide transparency of its sources used for...
12 KB (1,146 words) - 19:32, 25 July 2025
Anthropic (category 2021 establishments in California)
intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's...
39 KB (3,620 words) - 05:10, 2 August 2025
Minerva is a large language model developed by an Italian research group, Sapienza NLP, at Sapienza University of Rome, led by Roberto Navigli. It is trained...
5 KB (379 words) - 06:44, 4 May 2025
AI boom (category CS1 Japanese-language sources (ja))
the 2020s. Examples include generative AI technologies, such as large language models and AI image generators by companies like OpenAI, as well as scientific...
64 KB (5,464 words) - 21:18, 26 July 2025
Ernie Bot (category Large language models)
company Baidu. It is built on the company's ERNIE series of large language models, which have been in development since 2019. The service was first launched...
22 KB (1,902 words) - 01:32, 31 July 2025
Recursive self-improvement (redirect from Self-rewarding language models)
the development of large language models capable of self-improvement. This includes their work on "Self-Rewarding Language Models" that studies how to...
12 KB (1,331 words) - 09:20, 4 June 2025