• Deep reinforcement learning (DRL) is a subfield of machine learning that combines principles of reinforcement learning (RL) and deep learning. It involves...
    12 KB (1,658 words) - 13:16, 21 July 2025
  • Thumbnail for Reinforcement learning
    Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions...
    69 KB (8,200 words) - 18:16, 17 July 2025
  • Q-learning is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring...
    30 KB (3,871 words) - 14:53, 3 August 2025
  • Thumbnail for Multi-agent reinforcement learning
    Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that...
    29 KB (3,030 words) - 12:25, 24 May 2025
  • In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves...
    62 KB (8,617 words) - 14:51, 3 August 2025
  • losing. Reinforcement learning is used heavily in the field of machine learning and can be seen in methods such as Q-learning, policy search, Deep Q-networks...
    35 KB (4,209 words) - 05:48, 3 August 2025
  • co‑authored Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels (Yarats, Kostrikov & Fergus, ICLR 2021), which introduced...
    4 KB (413 words) - 22:02, 30 July 2025
  • In reinforcement learning (RL), a model-free algorithm is an algorithm which does not estimate the transition probability distribution (and the reward...
    6 KB (614 words) - 16:21, 27 January 2025
  • Thumbnail for Pieter Abbeel
    his cutting-edge research in robotics and machine learning, particularly in deep reinforcement learning. In 2021, he joined AIX Ventures as an Investment...
    9 KB (839 words) - 06:36, 26 June 2025
  • Proximal policy optimization (category Reinforcement learning)
    is a reinforcement learning (RL) algorithm for training an intelligent agent. Specifically, it is a policy gradient method, often used for deep RL when...
    17 KB (2,504 words) - 14:52, 3 August 2025
  • Thumbnail for Deep learning
    In machine learning, deep learning focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation...
    183 KB (18,116 words) - 23:26, 2 August 2025
  • Imitation learning is a paradigm in reinforcement learning, where an agent learns to perform a task by supervised learning from expert demonstrations....
    13 KB (1,339 words) - 00:05, 21 July 2025
  • predictions. A deep Q-network (DQN) is a type of deep learning model that combines a deep neural network with Q-learning, a form of reinforcement learning. Unlike...
    138 KB (15,555 words) - 03:37, 31 July 2025
  • Thumbnail for Chelsea Finn
    Chelsea Finn (category Machine learning researchers)
    worked on robot learning algorithms from deep predictive models. She delivered a massive open online course on deep reinforcement learning. She was the first...
    9 KB (759 words) - 21:07, 25 July 2025
  • Actor-critic algorithm (category Reinforcement learning)
    The actor-critic algorithm (AC) is a family of reinforcement learning (RL) algorithms that combine policy-based RL algorithms such as policy gradient methods...
    11 KB (1,872 words) - 20:51, 25 July 2025
  • (Japanese chess) after a few days of play against itself using reinforcement learning. DeepMind has since trained models for game-playing (MuZero, AlphaStar)...
    98 KB (9,516 words) - 13:20, 4 August 2025
  • Paul Christiano (category Machine learning researchers)
    co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement learning from human feedback (RLHF)...
    14 KB (1,221 words) - 00:26, 6 June 2025
  • David Silver (computer scientist) (category Google DeepMind)
    research scientist at Google DeepMind and a professor at University College London. He has led research on reinforcement learning with AlphaGo, AlphaZero and...
    8 KB (713 words) - 18:17, 3 May 2025
  • Starting in 2013, significant progress was made following the deep reinforcement learning approach, including the development of programs that can learn...
    32 KB (3,040 words) - 05:57, 3 August 2025
  • Thumbnail for Demis Hassabis
    Demis Hassabis (category DeepMind people)
    made significant advances in deep learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods...
    82 KB (6,328 words) - 05:55, 5 August 2025
  • AlphaGo Zero (category Applied machine learning)
    Furthermore, AlphaGo Zero performed better than standard deep reinforcement learning models (such as Deep Q-Network implementations) due to its integration of...
    23 KB (2,160 words) - 09:09, 4 August 2025
  • four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable of autonomously...
    285 KB (29,145 words) - 07:39, 1 August 2025
  • Thumbnail for Neural network (machine learning)
    Alternative to Reinforcement Learning". arXiv:1703.03864 [stat.ML]. Such FP, Madhavan V, Conti E, Lehman J, Stanley KO, Clune J (20 April 2018). "Deep Neuroevolution:...
    168 KB (17,613 words) - 12:10, 26 July 2025
  • Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;...
    18 KB (1,251 words) - 18:41, 1 July 2025
  • Jian; Han, Jiawei (2018). Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652...
    13 KB (1,389 words) - 19:53, 17 July 2025
  • peer-to-peer networks, Internet privacy, social networks, and deep reinforcement learning. He is the Dean of Engineering and Computer Science at NYU Shanghai...
    5 KB (401 words) - 07:30, 13 September 2024
  • classification benchmarks and to policy-gradient-based reinforcement learning. Variational Bayes-Adaptive Deep RL (VariBAD) was introduced in 2019. While MAML...
    23 KB (2,496 words) - 16:53, 17 April 2025
  • Thumbnail for Federated learning
    Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression...
    51 KB (5,875 words) - 19:26, 21 July 2025
  • resembles Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing on vulnerabilities of learned...
    70 KB (7,938 words) - 02:14, 25 June 2025
  • Mamba is a deep learning architecture focused on sequence modeling. It was developed by researchers from Carnegie Mellon University and Princeton University...
    11 KB (1,159 words) - 21:14, 2 August 2025