• Thumbnail for Multi-agent reinforcement learning
    Multi-agent reinforcement learning (MARL) is a sub-field of reinforcement learning. It focuses on studying the behavior of multiple learning agents that...
    29 KB (3,030 words) - 12:25, 24 May 2025
  • Thumbnail for Multi-agent system
    Microbial intelligence Multi-agent planning Multi-agent reinforcement learning Pattern-oriented modeling PlatBox Project Reinforcement learning Scientific community...
    28 KB (2,918 words) - 13:53, 4 July 2025
  • Q-learning is a reinforcement learning algorithm that trains an agent to assign values to its possible actions based on its current state, without requiring...
    30 KB (3,871 words) - 14:53, 3 August 2025
  • Thumbnail for Reinforcement learning
    Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions...
    69 KB (8,200 words) - 18:16, 17 July 2025
  • Deep reinforcement learning (DRL) is a subfield of machine learning that combines principles of reinforcement learning (RL) and deep learning. It involves...
    12 KB (1,658 words) - 13:16, 21 July 2025
  • In machine learning, reinforcement learning from human feedback (RLHF) is a technique to align an intelligent agent with human preferences. It involves...
    62 KB (8,617 words) - 14:51, 3 August 2025
  • onto the server.: v–vi  A mobile agent is a type of software agent, with the feature of autonomy, social ability, learning, and most significantly, mobility...
    6 KB (737 words) - 04:04, 18 April 2025
  • telecommunications and reinforcement learning. Reinforcement learning utilizes the MDP framework to model the interaction between a learning agent and its environment...
    35 KB (5,169 words) - 20:19, 22 July 2025
  • Particularly, reinforcement learning (RL) is essential in assisting agentic AI in making self-directed choices by supporting agents in learning best actions...
    10 KB (1,027 words) - 10:57, 30 July 2025
  • Imitation learning is a paradigm in reinforcement learning, where an agent learns to perform a task by supervised learning from expert demonstrations....
    13 KB (1,339 words) - 00:05, 21 July 2025
  • DAI is closely related to and a predecessor of the field of multi-agent systems. Multi-agent systems and distributed problem solving are the two main DAI...
    13 KB (1,534 words) - 12:41, 13 April 2025
  • (30 October 2019). "Grandmaster level in StarCraft II using multi-agent reinforcement learning". Nature. 575 (7782): 350–354. doi:10.1038/S41586-019-1724-Z...
    8 KB (713 words) - 18:17, 3 May 2025
  • Oriol Vinyals (category Machine learning researchers)
    Junhyuk (2019-11-14). "Grandmaster level in StarCraft II using multi-agent reinforcement learning". Nature. 575 (7782): 350–354. Bibcode:2019Natur.575..350V...
    7 KB (516 words) - 23:20, 25 May 2025
  • reinforcement learning agents. Intuitively, agents learn to improve their performance by playing "against themselves". In multi-agent reinforcement learning...
    4 KB (501 words) - 17:55, 25 June 2025
  • Standards and Scaleable Agencies". Infrastructure for Agents, Multi-Agent Systems, and Scalable Multi-Agent Systems. Lecture Notes in Computer Science. Vol...
    5 KB (467 words) - 00:25, 26 April 2024
  • integrations of reinforcement learning and deep learning architectures have enabled simulation of AI-driven agents in complex multi-agent economic models...
    22 KB (2,023 words) - 14:03, 3 August 2025
  • Ten Cent Diet". "A structured prediction approach for generalization in cooperative multi-agent reinforcement learning". GLOP home page GLOP source code...
    2 KB (134 words) - 10:39, 29 April 2025
  • Google DeepMind (category Deep learning)
    II using multi-agent reinforcement learning". DeepMind Blog. 31 October 2019. Retrieved 31 October 2019. Gao, Jim (2014). "Machine Learning Applications...
    98 KB (9,531 words) - 05:53, 3 August 2025
  • of multi-agent systems or multi-agent simulation in that the goal of ABM is to search for explanatory insight into the collective behavior of agents obeying...
    90 KB (9,330 words) - 02:23, 2 August 2025
  • Thumbnail for Multi-armed bandit
    by the end of a finite number of rounds. The multi-armed bandit problem is a classic reinforcement learning problem that exemplifies the exploration–exploitation...
    67 KB (7,668 words) - 23:00, 30 July 2025
  • in the field of multi-agent reinforcement learning for a dual purpose: A proof-of-concept to show that modern reinforcement learning algorithms can compete...
    35 KB (3,355 words) - 00:52, 19 April 2025
  • single-policy coverage—sufficient for single-agent reinforcement learning—is inadequate for multi-agent settings, requiring unilateral dataset coverage...
    48 KB (6,093 words) - 17:51, 31 July 2025
  • Proximal policy optimization (category Reinforcement learning)
    Proximal policy optimization (PPO) is a reinforcement learning (RL) algorithm for training an intelligent agent. Specifically, it is a policy gradient...
    17 KB (2,504 words) - 14:52, 3 August 2025
  • Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate...
    12 KB (1,565 words) - 14:57, 3 August 2025
  • objectives), distributed agents (being executed on physically distinct computers), multi-agent systems (distributed agents that work together to achieve...
    24 KB (2,918 words) - 17:58, 20 May 2025
  • Hidden Agenda is used in the field of multi-agent reinforcement learning to show that artificial intelligence agents are able to learn a variety of social...
    139 KB (12,147 words) - 19:50, 30 July 2025
  • A multi-agent system is a system created from multiple autonomous elements interacting with each other. These are called agents. In a multi-agent system...
    11 KB (1,190 words) - 01:15, 19 December 2024
  • simulation-based optimisation, multi-agent systems, swarm intelligence, statistics and genetic algorithms. In reinforcement learning, the environment is typically...
    140 KB (15,517 words) - 12:17, 3 August 2025
  • problem solving and Coordination Multi-agent systems and Software agent and Self-organization Multi-agent reinforcement learning Task Analysis, Environment...
    3 KB (289 words) - 03:23, 22 June 2024
  • Agents in a Multi-Agent World (MAAMAW-96). Rodriguez, Sebastian; Gaud, Nicolas; Galland, Stéphane (2014). "SARL: A General-Purpose Agent-Oriented Programming...
    8 KB (874 words) - 13:21, 10 February 2025