Deep reinforcement learning (DRL) is a subfield of machine learning that combines principles of reinforcement learning (RL) and deep learning. It involves...
12 KB (1,658 words) - 13:16, 21 July 2025
Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions...
69 KB (8,200 words) - 18:16, 17 July 2025
Q-learning is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring...
30 KB (3,871 words) - 14:53, 3 August 2025
Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that...
29 KB (3,030 words) - 12:25, 24 May 2025
In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves...
62 KB (8,617 words) - 14:51, 3 August 2025
losing. Reinforcement learning is used heavily in the field of machine learning and can be seen in methods such as Q-learning, policy search, Deep Q-networks...
35 KB (4,209 words) - 05:48, 3 August 2025
co‑authored Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels (Yarats, Kostrikov & Fergus, ICLR 2021), which introduced...
4 KB (413 words) - 22:02, 30 July 2025
In reinforcement learning (RL), a model-free algorithm is an algorithm which does not estimate the transition probability distribution (and the reward...
6 KB (614 words) - 16:21, 27 January 2025
his cutting-edge research in robotics and machine learning, particularly in deep reinforcement learning. In 2021, he joined AIX Ventures as an Investment...
9 KB (839 words) - 06:36, 26 June 2025
Proximal policy optimization (category Reinforcement learning)
is a reinforcement learning (RL) algorithm for training an intelligent agent. Specifically, it is a policy gradient method, often used for deep RL when...
17 KB (2,504 words) - 14:52, 3 August 2025
In machine learning, deep learning focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation...
183 KB (18,116 words) - 23:26, 2 August 2025
Imitation learning is a paradigm in reinforcement learning, where an agent learns to perform a task by supervised learning from expert demonstrations....
13 KB (1,339 words) - 00:05, 21 July 2025
Convolutional neural network (redirect from CNN (machine learning model))
predictions. A deep Q-network (DQN) is a type of deep learning model that combines a deep neural network with Q-learning, a form of reinforcement learning. Unlike...
138 KB (15,555 words) - 03:37, 31 July 2025
Chelsea Finn (category Machine learning researchers)
worked on robot learning algorithms from deep predictive models. She delivered a massive open online course on deep reinforcement learning. She was the first...
9 KB (759 words) - 21:07, 25 July 2025
Actor-critic algorithm (category Reinforcement learning)
The actor-critic algorithm (AC) is a family of reinforcement learning (RL) algorithms that combine policy-based RL algorithms such as policy gradient methods...
11 KB (1,872 words) - 20:51, 25 July 2025
(Japanese chess) after a few days of play against itself using reinforcement learning. DeepMind has since trained models for game-playing (MuZero, AlphaStar)...
98 KB (9,516 words) - 13:20, 4 August 2025
Paul Christiano (category Machine learning researchers)
co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement learning from human feedback (RLHF)...
14 KB (1,221 words) - 00:26, 6 June 2025
David Silver (computer scientist) (category Google DeepMind)
research scientist at Google DeepMind and a professor at University College London. He has led research on reinforcement learning with AlphaGo, AlphaZero and...
8 KB (713 words) - 18:17, 3 May 2025
General game playing (section Reinforcement learning)
Starting in 2013, significant progress was made following the deep reinforcement learning approach, including the development of programs that can learn...
32 KB (3,040 words) - 05:57, 3 August 2025
Demis Hassabis (category DeepMind people)
made significant advances in deep learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods...
82 KB (6,328 words) - 05:55, 5 August 2025
AlphaGo Zero (category Applied machine learning)
Furthermore, AlphaGo Zero performed better than standard deep reinforcement learning models (such as Deep Q-Network implementations) due to its integration of...
23 KB (2,160 words) - 09:09, 4 August 2025
Artificial intelligence (redirect from Probabilistic machine learning)
four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable of autonomously...
285 KB (29,145 words) - 07:39, 1 August 2025
Alternative to Reinforcement Learning". arXiv:1703.03864 [stat.ML]. Such FP, Madhavan V, Conti E, Lehman J, Stanley KO, Clune J (20 April 2018). "Deep Neuroevolution:...
168 KB (17,613 words) - 12:10, 26 July 2025
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;...
18 KB (1,251 words) - 18:41, 1 July 2025
Jian; Han, Jiawei (2018). Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652...
13 KB (1,389 words) - 19:53, 17 July 2025
peer-to-peer networks, Internet privacy, social networks, and deep reinforcement learning. He is the Dean of Engineering and Computer Science at NYU Shanghai...
5 KB (401 words) - 07:30, 13 September 2024
classification benchmarks and to policy-gradient-based reinforcement learning. Variational Bayes-Adaptive Deep RL (VariBAD) was introduced in 2019. While MAML...
23 KB (2,496 words) - 16:53, 17 April 2025
Guo, Weisi; Nallanathan, Arumugam; Wu, Qihui (2021). "Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression...
51 KB (5,875 words) - 19:26, 21 July 2025
resembles Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing on vulnerabilities of learned...
70 KB (7,938 words) - 02:14, 25 June 2025
Mamba is a deep learning architecture focused on sequence modeling. It was developed by researchers from Carnegie Mellon University and Princeton University...
11 KB (1,159 words) - 21:14, 2 August 2025