learning, a neural scaling law is an empirical scaling law that describes how neural network performance changes as key factors are scaled up or down....
44 KB (5,830 words) - 06:29, 26 May 2025
power laws are Pareto's law of income distribution, structural self-similarity of fractals, scaling laws in biological systems, and scaling laws in cities...
63 KB (8,193 words) - 12:34, 24 May 2025
performance; see Neural scaling law Tooth scaling, in dentistry, the removal of plaque and calculus Fouling, i.e., formation of a deposit layer (scale) on a solid...
2 KB (316 words) - 14:54, 25 October 2024
laws – Adages and sayings named after a person List of laws § Technology Microprocessor chronology – Timeline of microprocessors Neural scaling law –...
104 KB (10,705 words) - 23:56, 28 May 2025
been confirmed numerically. The scaling behavior of double descent has been found to follow a broken neural scaling law functional form. Grokking (machine...
10 KB (923 words) - 10:43, 24 May 2025
Large language model (section Scaling laws)
"Scaling laws" are empirical statistical laws that predict LLM performance based on such factors. One particular scaling law ("Chinchilla scaling") for...
113 KB (11,794 words) - 05:10, 31 May 2025
Language Models". arXiv:2203.15556 [cs.CL]. Table 20 and page 66 of PaLM: Scaling Language Modeling with Pathways Archived 2023-06-10 at the Wayback Machine...
64 KB (3,361 words) - 16:05, 24 May 2025
Machine Intelligence Research Institute Machine learning Neural scaling law – Statistical law in machine learning Noosphere – Philosophical concept of...
43 KB (4,588 words) - 06:28, 26 May 2025
scaling may refer to: Applying a scale ratio to create a scale model, a physical representation of an object Scaling up a neural network; see neural scaling...
204 bytes (61 words) - 14:44, 25 October 2024
these applications. The prior knowledge of general physical laws acts in the training of neural networks (NNs) as a regularization agent that limits the...
38 KB (4,808 words) - 05:32, 19 May 2025
with infinite-dimensional noise. Backtracking line search Broken Neural Scaling Law Coordinate descent – changes one coordinate at a time, rather than...
52 KB (7,016 words) - 09:28, 13 April 2025
Solving multiple machine learning tasks at the same time Neural scaling law – Statistical law in machine learning Outline of artificial intelligence –...
129 KB (14,171 words) - 19:53, 27 May 2025
In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure...
169 KB (17,645 words) - 09:43, 1 June 2025
Ethan; Gupta, Kshitij; Rish, Irina; Krueger, David (2022). Broken Neural Scaling Laws. International Conference on Learning Representations (ICLR), 2023...
64 KB (6,200 words) - 06:52, 1 June 2025
Foundation model (section Scaling)
Rewon; Gray, Scott; Radford, Alec; Wu, Jeffrey (22 January 2020), Scaling Laws for Neural Language Models, arXiv:2001.08361 Jo, Eun Seo; Gebru, Timnit (27...
44 KB (4,719 words) - 15:41, 30 May 2025
Prompt engineering (category Pages using multiple image with auto scaled images)
language models. It is an emergent property of model scale, meaning that breaks in downstream scaling laws occur, leading to its efficacy increasing at a different...
40 KB (4,473 words) - 11:29, 27 May 2025
AI alignment (section Scalable oversight)
Ethan; Gupta, Kshitij; Rish, Irina; Krueger, David (2022). "Broken Neural Scaling Laws". arXiv:2210.14891 [cs.LG]. Dominguez, Daniel (May 19, 2022). "DeepMind...
132 KB (12,973 words) - 21:46, 25 May 2025
behind NTK is not specific to neural networks and can be observed in generic nonlinear models, usually by a suitable scaling. Let f ( x ; θ ) {\displaystyle...
35 KB (5,146 words) - 10:08, 16 April 2025
applications do not scale horizontally. Network function virtualization defines these terms differently: scaling out/in is the ability to scale by adding/removing...
17 KB (2,132 words) - 22:25, 14 December 2024
Perception Sone Buchsbaum, M.; Stevens, S. S. (1971-04-30). "Neural Events and Psychophysical Law". Science. 170 (3962): 1043. Bibcode:1971Sci...172..502B...
12 KB (1,326 words) - 09:48, 30 January 2025
Deep learning (redirect from Deep neural network)
is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation...
180 KB (17,772 words) - 15:04, 30 May 2025
Normalization (statistics) (section Feature Scaling)
different units, hatching feature scaling – a method used to rescale data to a fixed range – like min-max scaling and robust scaling. This modern normalization...
12 KB (1,180 words) - 21:28, 25 May 2025
An optical neural network is a physical implementation of an artificial neural network with optical components. Early optical neural networks used a photorefractive...
15 KB (1,761 words) - 15:56, 19 January 2025
Neural adaptation or sensory adaptation is a gradual decrease over time in the responsiveness of the sensory system to a constant stimulus. It is usually...
29 KB (3,694 words) - 19:53, 24 May 2025
argued that 1/f scaling in EEG recordings are inconsistent with critical states, and whether SOC is a fundamental property of neural systems remains an...
27 KB (3,042 words) - 09:39, 5 May 2025
Neural Network Quantum States (NQS or NNQS) is a general class of variational quantum states parameterized in terms of an artificial neural network. It...
5 KB (809 words) - 18:54, 16 April 2025
least approximately) fall into the class of scale-free networks, meaning that they have power-law (or scale-free) degree distributions, while random graph...
21 KB (2,744 words) - 11:12, 6 February 2025
LeNet (category Artificial neural networks)
LeNet is a series of convolutional neural network architectures created by a research group in AT&T Bell Laboratories during the 1988 to 1998 period, centered...
30 KB (3,887 words) - 01:19, 29 May 2025
A scale-free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction P(k) of nodes in the network...
47 KB (6,013 words) - 05:15, 12 April 2025
Spatial heterogeneity (section Law of geography)
re-phrased as scaling hierarchy of far more small things than large ones. It has been formulated as a scaling law. Spatial heterogeneity or scaling hierarchy...
12 KB (1,282 words) - 17:55, 25 May 2025