AI_capability_control Search Results

AI capability control

intelligence (AI) design, AI capability control proposals, also referred to as AI confinement, aim to increase our ability to monitor and control the behavior...

25 KB (3,182 words) - 09:56, 20 July 2025

AI takeover

to align AI goal systems with human values, and capability control, which aims to reduce an AI system's capacity to harm humans or gain control. An example...

39 KB (4,196 words) - 18:24, 3 August 2025

AI alignment

subfield of AI safety, the study of how to build safe AI systems. Other subfields of AI safety include robustness, monitoring, and capability control. Research...

133 KB (13,069 words) - 15:35, 21 July 2025

Existential risk from artificial intelligence (redirect from Existential risk of AI)

greater chance that human inability to control AI will cause an existential catastrophe. In 2023, hundreds of AI experts and other notable figures signed...

127 KB (13,309 words) - 09:56, 20 July 2025

Human Compatible (redirect from Human compatible AI and the problem of control)

progress in AI capability is inevitable because of economic pressures. Such pressures can already be seen in the development of existing AI technologies...

12 KB (1,133 words) - 09:57, 20 July 2025

Superintelligence: Paths, Dangers, Strategies

an existential catastrophe, it is necessary to successfully solve the "AI control problem" for the first superintelligence. The solution might involve instilling...

13 KB (1,273 words) - 09:58, 20 July 2025

Artificial intelligence (redirect from AI)

Artificial intelligence (AI) is the capability of computational systems to perform tasks typically associated with human intelligence, such as learning...

285 KB (29,145 words) - 07:39, 1 August 2025

Effective accelerationism

decelerationists). The movement carries utopian undertones and advocates for faster AI progress to ensure human survival and propagate consciousness throughout the...

25 KB (2,112 words) - 09:57, 20 July 2025

Claude (language model) (redirect from Claude.ai)

@AnthropicAI (October 22, 2024). "Introducing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku. We're also introducing a new capability in beta:...

26 KB (2,274 words) - 20:30, 2 August 2025

Technology

against the hypothetical risk of an AI takeover, and have advocated for the use of AI capability control in addition to AI alignment methods. Other fields...

106 KB (10,332 words) - 20:06, 18 July 2025

AI safety

subfield of AI safety, the study of how to build safe AI systems. Other subfields of AI safety include robustness, monitoring, and capability control. Research...

88 KB (10,513 words) - 22:49, 31 July 2025

Artificial general intelligence (redirect from Hard AI)

Artificial general intelligence (AGI)—sometimes called human‑level intelligence AI—is a type of artificial intelligence that would match or surpass human capabilities...

135 KB (14,800 words) - 17:53, 2 August 2025

Machine Intelligence Research Institute

from artificial general intelligence. MIRI's work has focused on a friendly AI approach to system design and on predicting the rate of technology development...

17 KB (1,160 words) - 23:07, 2 August 2025

Roman Yampolskiy (category AI safety scientists)

ISBN 978-1482234435 AI: Unexplainable, Unpredictable, Uncontrollable. Chapman & Hall/CRC Press, 2024, ISBN 978-1032576268 AI capability control AI-complete Machine...

11 KB (864 words) - 03:58, 29 May 2025

OpenAI

first DGX-1 supercomputer to OpenAI in August 2016 to help it train larger and more complex AI models with the capability of reducing processing time from...

150 KB (13,364 words) - 14:39, 3 August 2025

Anthropic (redirect from Anthropic AI)

intelligence (AI) startup company founded in 2021. Anthropic has developed a family of large language models (LLMs) named Claude as a competitor to OpenAI's ChatGPT...

39 KB (3,620 words) - 05:10, 2 August 2025

Mira Murati (category OpenAI people)

She launched an AI startup called Thinking Machines Lab in February 2025. She previously served as chief technology officer of OpenAI. Murati was born...

23 KB (1,798 words) - 02:33, 2 August 2025

Object-capability model

The object-capability model is a computer security model. A capability describes a transferable right to perform one (or more) operations on a given object...

8 KB (1,009 words) - 12:37, 12 June 2025

Shield AI

(including Shield AI) on its export control list, barring the export of dual-use commodities to that business. On March 12, 2025, Shield AI's then-current...

14 KB (1,469 words) - 10:57, 20 July 2025

Music and artificial intelligence (redirect from AI-generated music)

feature is the capability of an AI algorithm to learn based on past data, such as in computer accompaniment technology, wherein the AI is capable of listening...

59 KB (6,455 words) - 08:51, 23 July 2025

Alignment Research Center

former OpenAI researcher Paul Christiano, ARC focuses on recognizing and comprehending the potentially harmful capabilities of present-day AI models. ARC's...

8 KB (683 words) - 09:56, 20 July 2025

OpenAI Five

against and showing the capability to defeat professional teams. By choosing a game as complex as Dota 2 to study machine learning, OpenAI thought they could...

23 KB (2,279 words) - 20:58, 2 August 2025

Regulation of artificial intelligence (redirect from Regulation of AI)

numerous AI ethics guidelines have been published in order to maintain social control over the technology. Regulation is deemed necessary to both foster AI innovation...

127 KB (13,023 words) - 14:17, 3 August 2025

Grok (chatbot) (redirect from Grok AI)

Grok is a generative artificial intelligence chatbot developed by xAI. It was launched in November 2023 by Elon Musk as an initiative based on the large...

83 KB (8,060 words) - 12:42, 3 August 2025

Geoffrey Hinton (redirect from Godfather of AI)

AI in order to avoid the worst outcomes. After receiving the Nobel Prize, he called for urgent research into AI safety to figure out how to control AI...

67 KB (5,797 words) - 04:41, 29 July 2025

Thinking Machines Lab (category AI software)

an American artificial intelligence (AI) startup led by Mira Murati, the former chief technology officer of OpenAI. The company was founded in February...

9 KB (692 words) - 22:33, 2 August 2025

AI boom

The AI boom is an ongoing period of rapid progress in the field of artificial intelligence (AI) that started in the late 2010s before gaining international...

64 KB (5,464 words) - 21:18, 26 July 2025

Superintelligence (redirect from Superhuman AI)

proposed various approaches to mitigate risks associated with ASI: Capability control – Limiting an ASI's ability to influence the world, such as through...

43 KB (4,700 words) - 05:51, 31 July 2025

History of artificial intelligence (redirect from History of AI)

as little as 15 seconds of audio to reproduce a voice—a capability later corroborated by OpenAI in 2024. The service went viral on social media platforms...

172 KB (19,994 words) - 17:08, 22 July 2025

Applications of artificial intelligence (redirect from Applications of AI)

Artificial intelligence is the capability of computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning...

194 KB (19,251 words) - 21:11, 2 August 2025