Deep reinforcement learning (DRL) is a subfield of machine learning that combines principles of reinforcement learning (RL) and deep learning. It involves...
12 KB (1,658 words) - 12:58, 11 June 2025
Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions...
69 KB (8,194 words) - 13:01, 17 June 2025
Q-learning is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring...
29 KB (3,835 words) - 15:13, 21 April 2025
Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that...
29 KB (3,030 words) - 12:25, 24 May 2025
losing. Reinforcement learning is used heavily in the field of machine learning and can be seen in methods such as Q-learning, policy search, Deep Q-networks...
34 KB (4,184 words) - 21:43, 2 May 2025
In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves...
62 KB (8,617 words) - 19:50, 11 May 2025
In reinforcement learning (RL), a model-free algorithm is an algorithm which does not estimate the transition probability distribution (and the reward...
6 KB (614 words) - 16:21, 27 January 2025
Proximal policy optimization (category Reinforcement learning)
is a reinforcement learning (RL) algorithm for training an intelligent agent. Specifically, it is a policy gradient method, often used for deep RL when...
17 KB (2,504 words) - 18:57, 11 April 2025
Chelsea Finn (category Machine learning researchers)
worked on robot learning algorithms from deep predictive models. She delivered a massive open online course on deep reinforcement learning. She was the first...
9 KB (740 words) - 22:52, 17 April 2025
Imitation learning is a paradigm in reinforcement learning, where an agent learns to perform a task by supervised learning from expert demonstrations....
13 KB (1,339 words) - 15:09, 2 June 2025
Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression...
180 KB (17,775 words) - 21:04, 10 June 2025
Convolutional neural network (redirect from CNN (machine learning model))
predictions. A deep Q-network (DQN) is a type of deep learning model that combines a deep neural network with Q-learning, a form of reinforcement learning. Unlike...
138 KB (15,585 words) - 07:00, 4 June 2025
(Japanese chess) after a few days of play against itself using reinforcement learning. In 2020, DeepMind made significant advances in the problem of protein...
94 KB (9,162 words) - 06:06, 18 June 2025
his cutting-edge research in robotics and machine learning, particularly in deep reinforcement learning. In 2021, he joined AIX Ventures as an Investment...
9 KB (796 words) - 06:48, 4 June 2025
Paul Christiano (category Machine learning researchers)
co-authored the paper "Deep Reinforcement Learning from Human Preferences" (2017) and other works developing reinforcement learning from human feedback (RLHF)...
14 KB (1,221 words) - 00:26, 6 June 2025
General game playing (section Reinforcement learning)
Starting in 2013, significant progress was made following the deep reinforcement learning approach, including the development of programs that can learn...
32 KB (3,018 words) - 22:04, 20 May 2025
Artificial intelligence (redirect from Probabilistic machine learning)
four of the world's best Gran Turismo drivers using deep reinforcement learning. In 2024, Google DeepMind introduced SIMA, a type of AI capable of autonomously...
280 KB (28,636 words) - 01:05, 8 June 2025
Actor-critic algorithm (category Reinforcement learning)
The actor-critic algorithm (AC) is a family of reinforcement learning (RL) algorithms that combine policy-based RL algorithms such as policy gradient methods...
11 KB (1,868 words) - 20:22, 25 May 2025
David Silver (computer scientist) (category Google DeepMind)
research scientist at Google DeepMind and a professor at University College London. He has led research on reinforcement learning with AlphaGo, AlphaZero and...
8 KB (712 words) - 18:17, 3 May 2025
Jian; Han, Jiawei (2018). Curriculum learning for heterogeneous star network embedding via deep reinforcement learning. pp. 468–476. doi:10.1145/3159652...
13 KB (1,367 words) - 04:26, 25 May 2025
explicit instructions. Within a subdiscipline in machine learning, advances in the field of deep learning have allowed neural networks, a class of statistical...
140 KB (15,573 words) - 11:13, 9 June 2025
Alternative to Reinforcement Learning". arXiv:1703.03864 [stat.ML]. Such FP, Madhavan V, Conti E, Lehman J, Stanley KO, Clune J (20 April 2018). "Deep Neuroevolution:...
169 KB (17,641 words) - 00:21, 11 June 2025
AlphaGo Zero (category Applied machine learning)
Furthermore, AlphaGo Zero performed better than standard deep reinforcement learning models (such as Deep Q-Network implementations) due to its integration of...
23 KB (2,147 words) - 23:53, 29 November 2024
nor the robot. In 2017, OpenAI and DeepMind applied deep learning to the cooperative inverse reinforcement learning in simple domains such as Atari games...
11 KB (1,336 words) - 19:23, 14 July 2024
Wierstra, Daan; Riedmiller, Martin (2013). "Playing Atari with Deep Reinforcement Learning". arXiv:1312.5602 [cs.LG]. Mnih, Volodymyr; Kavukcuoglu, Koray;...
18 KB (1,252 words) - 17:21, 16 April 2025
resembles Ridge regression. Adversarial deep reinforcement learning is an active area of research in reinforcement learning focusing on vulnerabilities of learned...
69 KB (7,819 words) - 08:26, 24 May 2025
Demis Hassabis (category DeepMind people)
made significant advances in deep learning and reinforcement learning, and pioneered the field of deep reinforcement learning which combines these two methods...
80 KB (6,210 words) - 14:49, 10 June 2025
peer-to-peer networks, Internet privacy, social networks, and deep reinforcement learning. He is the Dean of Engineering and Computer Science at NYU Shanghai...
5 KB (401 words) - 07:30, 13 September 2024
classification benchmarks and to policy-gradient-based reinforcement learning. Variational Bayes-Adaptive Deep RL (VariBAD) was introduced in 2019. While MAML...
23 KB (2,496 words) - 16:53, 17 April 2025
Fusion power (section Machine learning)
address fusion heating, measurement, and power production. A deep reinforcement learning system has been used to control a tokamak-based reactor. The...
218 KB (22,686 words) - 22:41, 10 June 2025