Small_language_model Search Results

Small language model

Small language models (SLMs) or compact language models are artificial intelligence language models designed for human natural language processing including...

3 KB (388 words) - 14:14, 13 July 2025

Large language model

large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing...

125 KB (13,357 words) - 06:28, 8 August 2025

Claude (language model)

Claude is a family of large language models developed by Anthropic. The first model was released in March 2023. The Claude 3 family, released in March...

27 KB (2,366 words) - 06:42, 6 August 2025

BERT (language model)

Bidirectional encoder representations from transformers (BERT) is a language model introduced in October 2018 by researchers at Google. It learns to represent...

32 KB (3,622 words) - 20:01, 2 August 2025

Reasoning language model

Reasoning language models (RLMs) are large language models that are trained further to solve tasks that take several steps of reasoning. They tend to do...

26 KB (3,063 words) - 05:37, 8 August 2025

List of large language models

A large language model (LLM) is a type of machine learning model designed for natural language processing tasks such as language generation. LLMs are language...

67 KB (3,494 words) - 04:55, 8 August 2025

Llama (language model)

Llama (Large Language Model Meta AI) is a family of large language models (LLMs) released by Meta AI starting in February 2023. The latest version is Llama...

58 KB (5,590 words) - 10:48, 7 August 2025

Gemini (language model)

Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Ultra...

64 KB (5,017 words) - 06:33, 8 August 2025

Systems modeling language

The systems modeling language (SysML) is a general-purpose modeling language for systems engineering applications. It supports the specification, analysis...

14 KB (1,568 words) - 07:28, 21 January 2025

T5 (language model)

is a series of large language models developed by Google AI introduced in 2019. Like the original Transformer model, T5 models are encoder-decoder Transformers...

20 KB (1,932 words) - 20:50, 2 August 2025

Unified Modeling Language

The Unified Modeling Language (UML) is a general-purpose, object-oriented, visual modeling language that provides a way to visualize the architecture...

28 KB (3,032 words) - 15:09, 7 August 2025

Language model benchmark

Language model benchmark is a standardized test designed to evaluate the performance of language model on various natural language processing tasks. These...

103 KB (11,102 words) - 02:46, 8 August 2025

Foundation model

typically requires only fine-tuning on smaller, task-specific datasets. Early examples of foundation models are language models (LMs) like OpenAI's GPT series...

54 KB (5,552 words) - 18:04, 25 July 2025

Vision-language-action model

robot learning, a vision-language-action model (VLA) is a class of multimodal foundation models that integrates vision, language and actions. Given an input...

25 KB (2,839 words) - 03:31, 25 July 2025

Generative pre-trained transformer (redirect from GPT (language model))

A generative pre-trained transformer (GPT) is a type of large language model (LLM) that is widely used in generative AI chatbots. GPTs are based on a deep...

54 KB (4,304 words) - 21:37, 7 August 2025

PaLM (redirect from Pathways Language Model)

PaLM (Pathways Language Model) is a 540 billion-parameter dense decoder-only transformer-based large language model (LLM) developed by Google AI. Researchers...

13 KB (807 words) - 19:02, 2 August 2025

Mistral AI (category CS1 French-language sources (fr))

2023, it specializes in open-weight large language models (LLMs), with both open-source and proprietary AI models. The company is named after the mistral...

27 KB (1,626 words) - 21:59, 3 August 2025

Cache language model

A cache language model is a type of statistical language model. These occur in the natural language processing subfield of computer science and assign...

9 KB (1,067 words) - 02:33, 22 March 2024

SLM

optical projection Standard litre per minute, a unit Small language model, a small scale language model in generative artificial intelligence Š-L-M (Shin-Lamedh-Mem)...

1 KB (171 words) - 02:09, 16 April 2025

Word n-gram language model

A word n-gram language model is a purely statistical model of language. It has been superseded by recurrent neural network–based models, which have been...

20 KB (2,647 words) - 17:27, 25 July 2025

Sarvam AI (section Sarvam Models)

startup focused on building large language models. These large language models (LLMs) are customised for Indian Languages and contexts. The company focuses...

11 KB (1,215 words) - 03:39, 4 June 2025

Model

software Economic model, a theoretical construct representing economic processes Language model, a probabilistic model of a natural language, used for speech...

16 KB (1,698 words) - 20:08, 25 May 2025

Transformer (deep learning architecture) (redirect from Transformer model)

variations have been widely adopted for training large language models (LLMs) on large (language) datasets. The modern version of the transformer was proposed...

106 KB (13,105 words) - 18:15, 6 August 2025

Prompt engineering (redirect from In-context learning (natural language processing))

intelligence (AI) model. A prompt is natural language text describing the task that an AI should perform. A prompt for a text-to-text language model can be a query...

40 KB (4,480 words) - 21:07, 27 July 2025

Human AI Labs

Grounded Transformer (GGT-1) and Small Language Models (SLMs), which are customized to individual users' data. These models incorporate memory-stacking functions...

8 KB (740 words) - 15:14, 24 July 2025

Neural scaling law (section Size of the model)

resources and time required for model training. With the "pretrain, then finetune" method used for most large language models, there are two kinds of training...

44 KB (5,854 words) - 22:47, 13 July 2025

Alloy (specification language)

specification language for expressing complex structural constraints and behavior in a software system. Alloy provides a simple structural modeling tool based...

6 KB (695 words) - 22:11, 24 July 2023

Stochastic parrot (redirect from On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?)

Emily M. Bender and colleagues in a 2021 paper, that frames large language models as systems that statistically mimic text without real understanding...

22 KB (2,359 words) - 14:00, 3 August 2025

Data model

programming languages. Data models are often complemented by function models, especially in the context of enterprise models. A data model explicitly determines...

40 KB (5,059 words) - 09:43, 29 July 2025

Knowledge distillation (redirect from Model distillation)

distillation or model distillation is the process of transferring knowledge from a large model to a smaller one. While large models (such as very deep...

17 KB (2,568 words) - 07:24, 24 June 2025