Mamba_(deep_learning_architecture) Search Results

Mamba (deep learning architecture)

Mamba is a deep learning architecture focused on sequence modeling. It was developed by researchers from Carnegie Mellon University and Princeton University...

11 KB (1,159 words) - 21:14, 2 August 2025

Transformer (deep learning architecture)

In deep learning, transformer is an architecture based on the multi-head attention mechanism, in which text is converted to numerical representations...

106 KB (13,107 words) - 01:38, 26 July 2025

Multimodal learning

Multimodal learning is a type of deep learning that integrates and processes multiple types of data, referred to as modalities, such as text, audio, images...

9 KB (2,212 words) - 22:40, 1 June 2025

Multilayer perceptron (category Neural network architectures)

In deep learning, a multilayer perceptron (MLP) is a name for a modern feedforward neural network consisting of fully connected neurons with nonlinear...

16 KB (1,932 words) - 03:01, 30 June 2025

Convolutional neural network (redirect from CNN (machine learning model))

that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different...

138 KB (15,555 words) - 03:37, 31 July 2025

Self-supervised learning

using autoencoders, which are a type of neural network architecture used for representation learning. Autoencoders consist of an encoder network that maps...

18 KB (2,047 words) - 06:56, 1 August 2025

Reinforcement learning

Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning differs...

69 KB (8,200 words) - 18:16, 17 July 2025

Q-learning

Q-learning algorithm. In 2014, Google DeepMind patented an application of Q-learning to deep learning, titled "deep reinforcement learning" or "deep Q-learning"...

29 KB (3,871 words) - 08:25, 31 July 2025

Machine learning

explicit instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed neural networks, a class of statistical...

140 KB (15,517 words) - 04:44, 31 July 2025

Neural network (machine learning)

regarding local vs non-local learning, as well as shallow vs deep architecture. Biological brains use both shallow and deep circuits as reported by brain...

168 KB (17,613 words) - 12:10, 26 July 2025

Mamba (disambiguation)

Kerala, India Mamba Point, zone in Monrovia, Liberia, north of Cape Mesurado Kobe Bryant Mamba (deep learning), a deep learning architecture Mamba (website)...

2 KB (231 words) - 04:16, 5 May 2025

Normalization (machine learning)

nanometers. Activation normalization, on the other hand, is specific to deep learning, and includes methods that rescale the activation of hidden neurons...

35 KB (5,361 words) - 05:48, 19 June 2025

Generative pre-trained transformer

is widely used in generative AI chatbots. GPTs are based on a deep learning architecture called the transformer. They are pre-trained on large data sets...

54 KB (4,320 words) - 20:33, 2 August 2025

U-Net (category Deep learning software applications)

Jian, Cheng-Yuan; Yang, Yong-Cheng; Lin, Chun-Liang (2023-02-14). "Deep learning based atomic defect detection framework for two-dimensional materials"...

12 KB (1,285 words) - 15:27, 26 June 2025

Variational autoencoder (category Unsupervised learning)

In machine learning, a variational autoencoder (VAE) is an artificial neural network architecture introduced by Diederik P. Kingma and Max Welling. It...

27 KB (3,967 words) - 21:16, 2 August 2025

Attention (machine learning)

self-attention". Recurrent neural network seq2seq Transformer (deep learning architecture) Attention Dynamic neural network Cherry, E. Colin (1953). "Some...

41 KB (3,641 words) - 13:27, 26 July 2025

Feature learning

autoencoders. Self-supervised learning has since been applied to many modalities through the use of deep neural network architectures such as convolutional neural...

45 KB (5,114 words) - 09:22, 4 July 2025

Neural field (category Deep learning)

Ian; Bengio, Yoshua; Courville, Aaron (2016). Deep learning. Adaptive computation and machine learning. Cambridge, Mass: The MIT press. ISBN 978-0-262-03561-3...

21 KB (2,336 words) - 12:11, 19 July 2025

Topological deep learning

Topological deep learning (TDL) is a research field that extends deep learning to handle complex, non-Euclidean data structures. Traditional deep learning models...

31 KB (3,296 words) - 17:16, 24 June 2025

Feature engineering (redirect from Feature extraction (machine learning))

Multi-relational decision tree learning (MRDTL) uses a supervised algorithm that is similar to a decision tree. Deep Feature Synthesis uses simpler methods...

20 KB (2,184 words) - 08:08, 17 July 2025

PyTorch (category Deep learning software)

an open-source machine learning library based on the Torch library, used for applications such as computer vision, deep learning research and natural language...

18 KB (1,540 words) - 20:40, 23 July 2025

GPT-1 (section Architecture)

large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a paper entitled "Improving Language...

32 KB (1,069 words) - 19:58, 2 August 2025

History of artificial neural networks (section Deep learning)

ongoing AI spring, and further increasing interest in deep learning. The transformer architecture was first described in 2017 as a method to teach ANNs...

85 KB (8,625 words) - 20:54, 10 June 2025

Deep belief network

In machine learning, a deep belief network (DBN) is a generative graphical model, or alternatively a class of deep neural network, composed of multiple...

11 KB (1,280 words) - 17:04, 13 August 2024

Generative adversarial network (category Unsupervised learning)

Realistic artificially generated media Deep learning – Branch of machine learning Diffusion model – Deep learning algorithm Generative artificial intelligence –...

95 KB (13,885 words) - 21:17, 2 August 2025

Long short-term memory (category Deep learning)

Decade of Deep Learning / Outlook on the 2020s". AI Blog. IDSIA, Switzerland. Retrieved 2022-04-30. Calin, Ovidiu (14 February 2020). Deep Learning Architectures...

52 KB (5,822 words) - 21:03, 2 August 2025

TensorFlow (category Deep learning software)

training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch. It is free and open-source...

52 KB (4,064 words) - 07:28, 17 July 2025

Automated machine learning

of their model. If deep learning is used, the architecture of the neural network must also be chosen manually by the machine learning expert. Each of these...

9 KB (1,034 words) - 10:43, 30 June 2025

Neural architecture search

the field of machine learning. NAS has been used to design networks that are on par with or outperform hand-designed architectures. Methods for NAS can...

26 KB (2,980 words) - 15:27, 18 November 2024

Mixture of experts (category Machine learning algorithms)

previous section described MoE as it was used before the era of deep learning. After deep learning, MoE found applications in running the largest models, as...

44 KB (5,634 words) - 08:30, 12 July 2025