• Deep reinforcement learning (DRL) is a subfield of machine learning that combines principles of reinforcement learning (RL) and deep learning. It involves...
    12 KB (1,658 words) - 12:58, 11 June 2025
  • Thumbnail for Reinforcement learning
    Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions...
    69 KB (8,194 words) - 13:01, 17 June 2025
  • Q-learning is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring...
    29 KB (3,835 words) - 15:13, 21 April 2025
  • Thumbnail for Multi-agent reinforcement learning
    Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that...
    29 KB (3,030 words) - 12:25, 24 May 2025
  • losing. Reinforcement learning is used heavily in the field of machine learning and can be seen in methods such as Q-learning, policy search, Deep Q-networks...
    34 KB (4,184 words) - 21:43, 2 May 2025
  • In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves...
    62 KB (8,617 words) - 19:50, 11 May 2025
  • In reinforcement learning (RL), a model-free algorithm is an algorithm which does not estimate the transition probability distribution (and the reward...
    6 KB (614 words) - 16:21, 27 January 2025
  • Proximal policy optimization (category Reinforcement learning)
    is a reinforcement learning (RL) algorithm for training an intelligent agent. Specifically, it is a policy gradient method, often used for deep RL when...
    17 KB (2,504 words) - 18:57, 11 April 2025
  • Thumbnail for Chelsea Finn
    Chelsea Finn (category Machine learning researchers)
    worked on robot learning algorithms from deep predictive models. She delivered a massive open online course on deep reinforcement learning. She was the first...
    9 KB (740 words) - 22:52, 17 April 2025
  • Imitation learning is a paradigm in reinforcement learning, where an agent learns to perform a task by supervised learning from expert demonstrations....
    13 KB (1,339 words) - 15:09, 2 June 2025
  • Thumbnail for Deep learning
    Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression...
    180 KB (17,775 words) - 21:04, 10 June 2025
  • predictions. A deep Q-network (DQN) is a type of deep learning model that combines a deep neural network with Q-learning, a form of reinforcement learning. Unlike...
    138 KB (15,585 words) - 07:00, 4 June 2025
  • (Japanese chess) after a few days of play against itself using reinforcement learning. In 2020, DeepMind made significant advances in the problem of protein...
    94 KB (9,162 words) - 06:06, 18 June 2025
  • Thumbnail for Pieter Abbeel
    his cutting-edge research in robotics and machine learning, particularly in deep reinforcement learning. In 2021, he joined AIX Ventures as an Investment...
    9 KB (796 words) - 06:48, 4 June 2025
  • Paul Christiano (category Machine learning researchers)
    co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement learning from human feedback (RLHF)...
    14 KB (1,221 words) - 00:26, 6 June 2025
  • Starting in 2013, significant progress was made following the deep reinforcement learning approach, including the development of programs that can learn...
    32 KB (3,018 words) - 22:04, 20 May 2025
  • four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable of autonomously...
    280 KB (28,636 words) - 01:05, 8 June 2025
  • Actor-critic algorithm (category Reinforcement learning)
    The actor-critic algorithm (AC) is a family of reinforcement learning (RL) algorithms that combine policy-based RL algorithms such as policy gradient methods...
    11 KB (1,868 words) - 20:22, 25 May 2025
  • David Silver (computer scientist) (category Google DeepMind)
    research scientist at Google DeepMind and a professor at University College London. He has led research on reinforcement learning with AlphaGo, AlphaZero and...
    8 KB (712 words) - 18:17, 3 May 2025
  • Jian; Han, Jiawei (2018). Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652...
    13 KB (1,367 words) - 04:26, 25 May 2025
  • explicit instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed neural networks, a class of statistical...
    140 KB (15,573 words) - 11:13, 9 June 2025
  • Thumbnail for Neural network (machine learning)
    Alternative to Reinforcement Learning". arXiv:1703.03864 [stat.ML]. Such FP, Madhavan V, Conti E, Lehman J, Stanley KO, Clune J (20 April 2018). "Deep Neuroevolution:...
    169 KB (17,641 words) - 00:21, 11 June 2025
  • AlphaGo Zero (category Applied machine learning)
    Furthermore, AlphaGo Zero performed better than standard deep reinforcement learning models (such as Deep Q-Network implementations) due to its integration of...
    23 KB (2,147 words) - 23:53, 29 November 2024
  • nor the robot. In 2017, OpenAI and DeepMind applied deep learning to the cooperative inverse reinforcement learning in simple domains such as Atari games...
    11 KB (1,336 words) - 19:23, 14 July 2024
  • Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;...
    18 KB (1,252 words) - 17:21, 16 April 2025
  • resembles Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing on vulnerabilities of learned...
    69 KB (7,819 words) - 08:26, 24 May 2025
  • Thumbnail for Demis Hassabis
    Demis Hassabis (category DeepMind people)
    made significant advances in deep learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods...
    80 KB (6,210 words) - 14:49, 10 June 2025
  • peer-to-peer networks, Internet privacy, social networks, and deep reinforcement learning. He is the Dean of Engineering and Computer Science at NYU Shanghai...
    5 KB (401 words) - 07:30, 13 September 2024
  • classification benchmarks and to policy-gradient-based reinforcement learning. Variational Bayes-Adaptive Deep RL (VariBAD) was introduced in 2019. While MAML...
    23 KB (2,496 words) - 16:53, 17 April 2025
  • Thumbnail for Fusion power
    address fusion heating, measurement, and power production. A deep reinforcement learning system has been used to control a tokamak-based reactor. The...
    218 KB (22,686 words) - 22:41, 10 June 2025