• Temporal difference (TD) learning refers to a class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate...
    12 KB (1,565 words) - 20:36, 20 October 2024
  • Thumbnail for Richard S. Sutton
    Richard S. Sutton (category Machine learning researchers)
    computational reinforcement learning, having several significant contributions to the field, including temporal difference learning and policy gradient methods...
    16 KB (1,347 words) - 06:22, 9 June 2025
  • Thumbnail for Reinforcement learning
    2018, §6. Temporal-Difference Learning. Bradtke, Steven J.; Barto, Andrew G. (1996). "Learning to predict by the method of temporal differences". Machine...
    69 KB (8,193 words) - 11:38, 2 June 2025
  • value ⏟ new value (temporal difference target) ) {\displaystyle Q^{new}(S_{t},A_{t})\leftarrow (1-\underbrace {\alpha } _{\text{learning rate}})\cdot \underbrace...
    29 KB (3,835 words) - 15:13, 21 April 2025
  • Thumbnail for Backgammon
    near the expert level. Its neural network was trained using temporal difference learning applied to data generated from self-play. According to assessments...
    83 KB (10,489 words) - 00:37, 6 June 2025
  • TD-Gammon (category Applied machine learning)
    fact that it is an artificial neural net trained by a form of temporal-difference learning, specifically TD-Lambda. It explored strategies that humans had...
    15 KB (1,659 words) - 07:09, 25 May 2025
  • Generalization Meta-learning Inductive bias Metadata Reinforcement learning Q-learning State–action–reward–state–action (SARSA) Temporal difference learning (TD) Learning...
    39 KB (3,386 words) - 19:51, 2 June 2025
  • Thumbnail for 2048 (video game)
    search for better parameter values; some papers used temporal difference reinforcement learning. Dickey, Megan Rose (23 March 2014). "Puzzle Game 2048...
    28 KB (2,480 words) - 10:55, 12 June 2025
  • Thumbnail for Learning disability
    Therefore, some people can be more accurately described as having a "learning difference", thus avoiding any misconception of being disabled with a possible...
    92 KB (10,781 words) - 20:53, 14 June 2025
  • Thumbnail for KnightCap
    KnightCap, introduced in the late 1990s, was an experiment in temporal difference learning as applied to chess. This technique allowed KnightCap to automatically...
    3 KB (248 words) - 12:04, 25 January 2025
  • visual cortex (ConvNet) and reinforcement learning inspired by the basal ganglia (Temporal difference learning). Notable affinity groups have emerged from...
    13 KB (1,236 words) - 09:03, 19 February 2025
  • Times. Retrieved 8 June 2016. Tesauro, Gerald (March 1995). "Temporal difference learning and TD-Gammon". Communications of the ACM. 38 (3): 58–68. doi:10...
    33 KB (1,764 words) - 05:08, 20 May 2025
  • accessed again, the time difference will be sent to the reuse distance predictor. The RDP uses temporal difference learning, where the new RDP value will...
    38 KB (4,883 words) - 21:33, 6 June 2025
  • play world-class backgammon partly by playing against itself (temporal difference learning with neural networks). Serenata de Amor, project for the analysis...
    40 KB (3,541 words) - 11:08, 21 May 2025
  • by ESRO Technical drawing, a term used in the design process Temporal difference learning, a prediction method Terrestrial Dynamical time, an obsolete...
    6 KB (884 words) - 15:55, 28 February 2025
  • Thumbnail for Monte Carlo method
    Multilevel Monte Carlo method Quasi-Monte Carlo method Sobol sequence Temporal difference learning Kalos & Whitlock 2008. Kroese, D. P.; Brereton, T.; Taimre, T...
    91 KB (10,690 words) - 23:18, 29 April 2025
  • Alexander WH, Brown JW (June 2010). "Hyperbolically discounted temporal difference learning". Neural Computation. 22 (6): 1511–1527. doi:10.1162/neco.2010...
    110 KB (10,169 words) - 15:04, 12 June 2025
  • {\displaystyle u(x)} . The critic and actor are trained iteratively using temporal difference learning or gradient descent to satisfy the Hamilton-Jacobi-Bellman (HJB)...
    8 KB (995 words) - 01:19, 17 April 2025
  • Temporal anti-aliasing (TAA), also known as TXAA (a proprietary technology) or TMAA/TSSAA (Temporal Super-Sampling Anti-Aliasing), is a spatial anti-aliasing...
    5 KB (598 words) - 23:08, 29 May 2025
  • Proximal policy optimization (category Machine learning algorithms)
    collection and computation can be costly. Reinforcement learning Temporal difference learning Game theory Schulman, John; Levine, Sergey; Moritz, Philipp;...
    17 KB (2,504 words) - 18:57, 11 April 2025
  • Viking. ISBN 9781101218884. Tesauro, Gerald (1 March 1995). "Temporal difference learning and TD-Gammon". Communications of the ACM. 38 (3): 58–68. doi:10...
    26 KB (3,081 words) - 21:13, 8 May 2025
  • State–action–reward–state–action (category Machine learning algorithms)
    mapping Constructing skill trees Q-learning Temporal difference learning Reinforcement learning Online Q-Learning using Connectionist Systems" by Rummery...
    6 KB (716 words) - 19:17, 6 December 2024
  • In statistics and machine learning, ensemble methods use multiple learning algorithms to obtain better predictive performance than could be obtained from...
    53 KB (6,685 words) - 14:14, 8 June 2025
  • Gerald Tesauro (category Reinforcement learning)
    world-championship level through self-play and temporal difference learning, an early success in reinforcement learning and neural networks. He subsequently researched...
    18 KB (1,636 words) - 21:39, 6 June 2025
  • Thumbnail for Feature learning
    the same/similar information. Therefore, for a dynamic system, a temporal difference in its embeddings may be explained by misalignment of embeddings...
    45 KB (5,114 words) - 02:41, 2 June 2025
  • Thumbnail for Evaluation function
    1126/science.aar6404. PMID 30523106. Tesauro, Gerald (March 1995). "Temporal Difference Learning and TD-Gammon". Communications of the ACM. 38 (3): 58–68. doi:10...
    19 KB (2,436 words) - 11:41, 25 May 2025
  • Tesauro, Gerald (May 1, 1992). "Practical issues in temporal difference learning". Machine Learning. 8 (3–4): 257–277. doi:10.1007/BF00992697. Shi-Jim...
    37 KB (2,837 words) - 00:34, 31 May 2025
  • Thumbnail for Dopamine
    neuroscientists, because an influential computational-learning method known as temporal difference learning makes heavy use of a signal that encodes prediction...
    140 KB (14,357 words) - 03:16, 14 June 2025
  • language (ISO 639-3 code: tdl), a Plateau language of Nigeria Temporal difference learning (TD), a prediction method Tunneled Direct Link Setup (TDLS) Two...
    891 bytes (131 words) - 03:37, 8 February 2025
  • Thumbnail for Transverse temporal gyrus
    Additionally this difference in processing rate was found to be related to the volume of rate-related cortex in the gyri; right transverse temporal gyri were...
    8 KB (882 words) - 22:39, 29 April 2025