Mamba is a deep learning architecture focused on sequence modeling. It was developed by researchers from Carnegie Mellon University and Princeton University...
11 KB (1,159 words) - 21:14, 2 August 2025
In deep learning, transformer is an architecture based on the multi-head attention mechanism, in which text is converted to numerical representations...
106 KB (13,107 words) - 01:38, 26 July 2025
Multimodal learning is a type of deep learning that integrates and processes multiple types of data, referred to as modalities, such as text, audio, images...
9 KB (2,212 words) - 22:40, 1 June 2025
Multilayer perceptron (category Neural network architectures)
In deep learning, a multilayer perceptron (MLP) is a name for a modern feedforward neural network consisting of fully connected neurons with nonlinear...
16 KB (1,932 words) - 03:01, 30 June 2025
Convolutional neural network (redirect from CNN (machine learning model))
that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different...
138 KB (15,555 words) - 03:37, 31 July 2025
using autoencoders, which are a type of neural network architecture used for representation learning. Autoencoders consist of an encoder network that maps...
18 KB (2,047 words) - 06:56, 1 August 2025
Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning differs...
69 KB (8,200 words) - 18:16, 17 July 2025
Q-learning algorithm. In 2014, Google DeepMind patented an application of Q-learning to deep learning, titled "deep reinforcement learning" or "deep Q-learning"...
29 KB (3,871 words) - 08:25, 31 July 2025
explicit instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed neural networks, a class of statistical...
140 KB (15,517 words) - 04:44, 31 July 2025
regarding local vs non-local learning, as well as shallow vs deep architecture. Biological brains use both shallow and deep circuits as reported by brain...
168 KB (17,613 words) - 12:10, 26 July 2025
Kerala, India Mamba Point, zone in Monrovia, Liberia, north of Cape Mesurado Kobe Bryant Mamba (deep learning), a deep learning architecture Mamba (website)...
2 KB (231 words) - 04:16, 5 May 2025
nanometers. Activation normalization, on the other hand, is specific to deep learning, and includes methods that rescale the activation of hidden neurons...
35 KB (5,361 words) - 05:48, 19 June 2025
is widely used in generative AI chatbots. GPTs are based on a deep learning architecture called the transformer. They are pre-trained on large data sets...
54 KB (4,320 words) - 20:33, 2 August 2025
U-Net (category Deep learning software applications)
Jian, Cheng-Yuan; Yang, Yong-Cheng; Lin, Chun-Liang (2023-02-14). "Deep learning based atomic defect detection framework for two-dimensional materials"...
12 KB (1,285 words) - 15:27, 26 June 2025
Variational autoencoder (category Unsupervised learning)
In machine learning, a variational autoencoder (VAE) is an artificial neural network architecture introduced by Diederik P. Kingma and Max Welling. It...
27 KB (3,967 words) - 21:16, 2 August 2025
self-attention". Recurrent neural network seq2seq Transformer (deep learning architecture) Attention Dynamic neural network Cherry, E. Colin (1953). "Some...
41 KB (3,641 words) - 13:27, 26 July 2025
autoencoders. Self-supervised learning has since been applied to many modalities through the use of deep neural network architectures such as convolutional neural...
45 KB (5,114 words) - 09:22, 4 July 2025
Neural field (category Deep learning)
Ian; Bengio, Yoshua; Courville, Aaron (2016). Deep learning. Adaptive computation and machine learning. Cambridge, Mass: The MIT press. ISBN 978-0-262-03561-3...
21 KB (2,336 words) - 12:11, 19 July 2025
Topological deep learning (TDL) is a research field that extends deep learning to handle complex, non-Euclidean data structures. Traditional deep learning models...
31 KB (3,296 words) - 17:16, 24 June 2025
Feature engineering (redirect from Feature extraction (machine learning))
Multi-relational decision tree learning (MRDTL) uses a supervised algorithm that is similar to a decision tree. Deep Feature Synthesis uses simpler methods...
20 KB (2,184 words) - 08:08, 17 July 2025
PyTorch (category Deep learning software)
an open-source machine learning library based on the Torch library, used for applications such as computer vision, deep learning research and natural language...
18 KB (1,540 words) - 20:40, 23 July 2025
GPT-1 (section Architecture)
large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a paper entitled "Improving Language...
32 KB (1,069 words) - 19:58, 2 August 2025
ongoing AI spring, and further increasing interest in deep learning. The transformer architecture was first described in 2017 as a method to teach ANNs...
85 KB (8,625 words) - 20:54, 10 June 2025
In machine learning, a deep belief network (DBN) is a generative graphical model, or alternatively a class of deep neural network, composed of multiple...
11 KB (1,280 words) - 17:04, 13 August 2024
Generative adversarial network (category Unsupervised learning)
Realistic artificially generated media Deep learning – Branch of machine learning Diffusion model – Deep learning algorithm Generative artificial intelligence –...
95 KB (13,885 words) - 21:17, 2 August 2025
Long short-term memory (category Deep learning)
Decade of Deep Learning / Outlook on the 2020s". AI Blog. IDSIA, Switzerland. Retrieved 2022-04-30. Calin, Ovidiu (14 February 2020). Deep Learning Architectures...
52 KB (5,822 words) - 21:03, 2 August 2025
TensorFlow (category Deep learning software)
training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch. It is free and open-source...
52 KB (4,064 words) - 07:28, 17 July 2025
of their model. If deep learning is used, the architecture of the neural network must also be chosen manually by the machine learning expert. Each of these...
9 KB (1,034 words) - 10:43, 30 June 2025
the field of machine learning. NAS has been used to design networks that are on par with or outperform hand-designed architectures. Methods for NAS can...
26 KB (2,980 words) - 15:27, 18 November 2024
Mixture of experts (category Machine learning algorithms)
previous section described MoE as it was used before the era of deep learning. After deep learning, MoE found applications in running the largest models, as...
44 KB (5,634 words) - 08:30, 12 July 2025