• Large language models have been used by officials and politicians in a wide variety of ways. The Conversation described ChatGPT described as a uniquely...
    17 KB (1,396 words) - 07:17, 26 April 2025
  • Thumbnail for Llama (language model)
    Llama (Large Language Model Meta AI) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is...
    57 KB (5,448 words) - 20:35, 2 August 2025
  • Generative AI applications like large language models (LLM) are common examples of foundation models. Building foundation models is often highly resource-intensive...
    54 KB (5,552 words) - 18:04, 25 July 2025
  • Transformer) is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder...
    20 KB (1,932 words) - 20:50, 2 August 2025
  • improved the state-of-the-art for large language models. As of 2020[update], BERT is a ubiquitous baseline in natural language processing (NLP) experiments...
    32 KB (3,623 words) - 20:01, 2 August 2025
  • Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...
    64 KB (5,017 words) - 19:03, 2 August 2025
  • Thumbnail for Mistral AI
    Mistral AI (category CS1 French-language sources (fr))
    headquartered in Paris. Founded in 2023, it specializes in open-weight large language models (LLMs), with both open-source and proprietary AI models. The company...
    27 KB (1,626 words) - 21:59, 3 August 2025
  • Thumbnail for Qwen
    Qwen (category Large language models)
    family of large language models developed by Chinese company Alibaba Cloud. In July 2024, it was ranked as the top Chinese language model in some benchmarks...
    22 KB (1,560 words) - 20:03, 2 August 2025
  • Thumbnail for GPT-1
    GPT-1 (category Large language models)
    (GPT-1) was the first of OpenAI's large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a...
    32 KB (1,069 words) - 19:58, 2 August 2025
  • Huawei PanGu (category Large language models)
    a multimodal large language model developed by Huawei. It was announced on July 7, 2023. The name of the large learning language model, PanGu, was derived...
    13 KB (1,325 words) - 19:11, 2 August 2025
  • Thumbnail for Generative artificial intelligence
    boom was made possible by improvements in transformer-based deep neural networks, particularly large language models (LLMs). Major tools include chatbots...
    155 KB (13,968 words) - 04:29, 5 August 2025
  • Thumbnail for PaLM
    PaLM (redirect from Pathways Language Model)
    PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...
    13 KB (807 words) - 19:02, 2 August 2025
  • startup focused on building large language models. These large language models (LLMs) are customised for Indian Languages and contexts. The company focuses...
    11 KB (1,215 words) - 03:39, 4 June 2025
  • Kruti (category Large language models)
    backend technology combines several open-source large language models with Ola's proprietary Krutrim V2 model. The system was developed to work primarily...
    15 KB (1,066 words) - 07:37, 3 August 2025
  • GPT-4 (category Large language models)
    Transformer 4 (GPT-4) is a large language model trained and created by OpenAI and the fourth in its series of GPT foundation models. It was launched on March...
    63 KB (6,044 words) - 12:11, 3 August 2025
  • Thumbnail for Grok (chatbot)
    developed by xAI. It was launched in November 2023 by Elon Musk as an initiative based on the large language model (LLM) of the same name. Grok is integrated...
    83 KB (8,060 words) - 22:55, 4 August 2025
  • There are different models, including open source models. Chinese-language input CogVideo is the earliest text-to-video model "of 9.4 billion parameters"...
    29 KB (2,449 words) - 04:52, 26 July 2025
  • Scale AI (category 2016 establishments in California)
    Evaluation and Alignment Lab, focuses on evaluating and aligning large language models (LLMs), including through initiatives such as Humanity's Last Exam...
    25 KB (2,312 words) - 05:00, 2 August 2025
  • Cohere (category Large language models)
    company focused on artificial intelligence. Cohere specializes in large language models and AI products for regulated industries, particularly the finance...
    24 KB (2,208 words) - 08:58, 24 July 2025
  • Thumbnail for Alexandr Wang
    Alexandr Wang (category Articles containing simplified Chinese-language text)
    company that provides data labeling and large language model evaluation services to develop AI applications. In 2021, he was the world's youngest self-made...
    15 KB (1,380 words) - 11:39, 4 August 2025
  • Artificial general intelligence (category CS1 Bulgarian-language sources (bg))
    all cognitive tasks. Some researchers argue that state‑of‑the‑art large language models (LLMs) already exhibit signs of AGI‑level capability, while others...
    135 KB (14,800 words) - 17:53, 2 August 2025
  • Dead Internet theory (category CS1 French-language sources (fr))
    increase in content generated via large language models (LLMs) such as ChatGPT appearing in popular Internet spaces without mention of the full theory. In a...
    35 KB (3,258 words) - 11:56, 1 August 2025
  • GPT-3 (redirect from GPT-3 (language model))
    Transformer 3 (GPT-3) is a large language model released by OpenAI in 2020. Like its predecessor, GPT-2, it is a decoder-only transformer model of deep neural network...
    55 KB (4,897 words) - 20:00, 2 August 2025
  • Thumbnail for ChatGPT
    ChatGPT (category Large language models)
    trained or used. This includes text-to-image models such as Stable Diffusion and large language models such as ChatGPT. As of 2023, there were several...
    168 KB (14,859 words) - 05:35, 5 August 2025
  • Thumbnail for Aleph Alpha
    Aleph Alpha (category CS1 French-language sources (fr))
    and comply with European data protection regulations. It develops large language models (LLM), which try to provide transparency of its sources used for...
    12 KB (1,146 words) - 19:32, 25 July 2025
  • Anthropic (category 2021 establishments in California)
    intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's...
    39 KB (3,620 words) - 05:10, 2 August 2025
  • Minerva is a large language model developed by an Italian research group, Sapienza NLP, at Sapienza University of Rome, led by Roberto Navigli. It is trained...
    5 KB (379 words) - 06:44, 4 May 2025
  • Thumbnail for AI boom
    AI boom (category CS1 Japanese-language sources (ja))
    the 2020s. Examples include generative AI technologies, such as large language models and AI image generators by companies like OpenAI, as well as scientific...
    64 KB (5,464 words) - 21:18, 26 July 2025
  • Thumbnail for Ernie Bot
    Ernie Bot (category Large language models)
    company Baidu. It is built on the company's ERNIE series of large language models, which have been in development since 2019. The service was first launched...
    22 KB (1,902 words) - 01:32, 31 July 2025
  • the development of large language models capable of self-improvement. This includes their work on "Self-Rewarding Language Models" that studies how to...
    12 KB (1,331 words) - 09:20, 4 June 2025