• Mamba is a deep learning architecture focused on sequence modeling. It was developed by researchers from Carnegie Mellon University and Princeton University...
    11 KB (1,159 words) - 21:14, 2 August 2025
  • Thumbnail for Transformer (deep learning architecture)
    In deep learning, transformer is an architecture based on the multi-head attention mechanism, in which text is converted to numerical representations...
    106 KB (13,107 words) - 01:38, 26 July 2025
  • Multimodal learning is a type of deep learning that integrates and processes multiple types of data, referred to as modalities, such as text, audio, images...
    9 KB (2,212 words) - 22:40, 1 June 2025
  • Multilayer perceptron (category Neural network architectures)
    In deep learning, a multilayer perceptron (MLP) is a name for a modern feedforward neural network consisting of fully connected neurons with nonlinear...
    16 KB (1,932 words) - 03:01, 30 June 2025
  • that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different...
    138 KB (15,555 words) - 03:37, 31 July 2025
  • using autoencoders, which are a type of neural network architecture used for representation learning. Autoencoders consist of an encoder network that maps...
    18 KB (2,047 words) - 06:56, 1 August 2025
  • Thumbnail for Reinforcement learning
    Reinforcement learning is one of the three basic machine learning paradigms, alongside supervised learning and unsupervised learning. Reinforcement learning differs...
    69 KB (8,200 words) - 18:16, 17 July 2025
  • Q-learning algorithm. In 2014, Google DeepMind patented an application of Q-learning to deep learning, titled "deep reinforcement learning" or "deep Q-learning"...
    29 KB (3,871 words) - 08:25, 31 July 2025
  • explicit instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed neural networks, a class of statistical...
    140 KB (15,517 words) - 04:44, 31 July 2025
  • Thumbnail for Neural network (machine learning)
    regarding local vs non-local learning, as well as shallow vs deep architecture. Biological brains use both shallow and deep circuits as reported by brain...
    168 KB (17,613 words) - 12:10, 26 July 2025
  • Kerala, India Mamba Point, zone in Monrovia, Liberia, north of Cape Mesurado Kobe Bryant Mamba (deep learning), a deep learning architecture Mamba (website)...
    2 KB (231 words) - 04:16, 5 May 2025
  • nanometers. Activation normalization, on the other hand, is specific to deep learning, and includes methods that rescale the activation of hidden neurons...
    35 KB (5,361 words) - 05:48, 19 June 2025
  • Thumbnail for Generative pre-trained transformer
    is widely used in generative AI chatbots. GPTs are based on a deep learning architecture called the transformer. They are pre-trained on large data sets...
    54 KB (4,320 words) - 20:33, 2 August 2025
  • U-Net (category Deep learning software applications)
    Jian, Cheng-Yuan; Yang, Yong-Cheng; Lin, Chun-Liang (2023-02-14). "Deep learning based atomic defect detection framework for two-dimensional materials"...
    12 KB (1,285 words) - 15:27, 26 June 2025
  • Thumbnail for Variational autoencoder
    Variational autoencoder (category Unsupervised learning)
    In machine learning, a variational autoencoder (VAE) is an artificial neural network architecture introduced by Diederik P. Kingma and Max Welling. It...
    27 KB (3,967 words) - 21:16, 2 August 2025
  • Thumbnail for Attention (machine learning)
    self-attention". Recurrent neural network seq2seq Transformer (deep learning architecture) Attention Dynamic neural network Cherry, E. Colin (1953). "Some...
    41 KB (3,641 words) - 13:27, 26 July 2025
  • Thumbnail for Feature learning
    autoencoders. Self-supervised learning has since been applied to many modalities through the use of deep neural network architectures such as convolutional neural...
    45 KB (5,114 words) - 09:22, 4 July 2025
  • Neural field (category Deep learning)
    Ian; Bengio, Yoshua; Courville, Aaron (2016). Deep learning. Adaptive computation and machine learning. Cambridge, Mass: The MIT press. ISBN 978-0-262-03561-3...
    21 KB (2,336 words) - 12:11, 19 July 2025
  • Topological deep learning (TDL) is a research field that extends deep learning to handle complex, non-Euclidean data structures. Traditional deep learning models...
    31 KB (3,296 words) - 17:16, 24 June 2025
  • Multi-relational decision tree learning (MRDTL) uses a supervised algorithm that is similar to a decision tree. Deep Feature Synthesis uses simpler methods...
    20 KB (2,184 words) - 08:08, 17 July 2025
  • PyTorch (category Deep learning software)
    an open-source machine learning library based on the Torch library, used for applications such as computer vision, deep learning research and natural language...
    18 KB (1,540 words) - 20:40, 23 July 2025
  • Thumbnail for GPT-1
    GPT-1 (section Architecture)
    large language models following Google's invention of the transformer architecture in 2017. In June 2018, OpenAI released a paper entitled "Improving Language...
    32 KB (1,069 words) - 19:58, 2 August 2025
  • ongoing AI spring, and further increasing interest in deep learning. The transformer architecture was first described in 2017 as a method to teach ANNs...
    85 KB (8,625 words) - 20:54, 10 June 2025
  • Thumbnail for Deep belief network
    In machine learning, a deep belief network (DBN) is a generative graphical model, or alternatively a class of deep neural network, composed of multiple...
    11 KB (1,280 words) - 17:04, 13 August 2024
  • Thumbnail for Generative adversarial network
    Generative adversarial network (category Unsupervised learning)
    Realistic artificially generated media Deep learning – Branch of machine learning Diffusion model – Deep learning algorithm Generative artificial intelligence –...
    95 KB (13,885 words) - 21:17, 2 August 2025
  • Thumbnail for Long short-term memory
    Long short-term memory (category Deep learning)
    Decade of Deep Learning / Outlook on the 2020s". AI Blog. IDSIA, Switzerland. Retrieved 2022-04-30. Calin, Ovidiu (14 February 2020). Deep Learning Architectures...
    52 KB (5,822 words) - 21:03, 2 August 2025
  • Thumbnail for TensorFlow
    TensorFlow (category Deep learning software)
    training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch. It is free and open-source...
    52 KB (4,064 words) - 07:28, 17 July 2025
  • of their model. If deep learning is used, the architecture of the neural network must also be chosen manually by the machine learning expert. Each of these...
    9 KB (1,034 words) - 10:43, 30 June 2025
  • the field of machine learning. NAS has been used to design networks that are on par with or outperform hand-designed architectures. Methods for NAS can...
    26 KB (2,980 words) - 15:27, 18 November 2024
  • Mixture of experts (category Machine learning algorithms)
    previous section described MoE as it was used before the era of deep learning. After deep learning, MoE found applications in running the largest models, as...
    44 KB (5,634 words) - 08:30, 12 July 2025