intelligence (AI) design, AI capability control proposals, also referred to as AI confinement, aim to increase our ability to monitor and control the behavior...
24 KB (3,055 words) - 13:03, 14 April 2024
monitoring, and capability control. Research challenges in alignment include instilling complex values in AI, developing honest AI, scalable oversight...
119 KB (11,653 words) - 02:50, 7 June 2024
robots effectively take control of the planet away from the human species, which relies on human intelligence. Stories of AI takeovers remain popular...
39 KB (4,272 words) - 03:39, 23 May 2024
monitoring, and capability control. Research challenges in alignment include instilling complex values in AI, developing honest AI, scalable oversight...
80 KB (9,538 words) - 20:10, 2 June 2024
Human Compatible (redirect from Human compatible AI and the problem of control)
progress in AI capability is inevitable because of economic pressures. Such pressures can already be seen in the development of existing AI technologies...
12 KB (1,133 words) - 03:28, 14 May 2024
greater chance that our inability to control AI will cause an existential catastrophe. In 2023, hundreds of AI experts and other notable figures signed...
120 KB (12,508 words) - 19:34, 6 June 2024
exercise caution in dealing with AI, stating "that's too dangerous. You can't break things when you are talking about AI". In a similar vein, Ellen Huet...
18 KB (1,586 words) - 11:23, 5 June 2024
an existential catastrophe, it is necessary to successfully solve the "AI control problem" for the first superintelligence. The solution might involve instilling...
13 KB (1,270 words) - 22:01, 12 September 2023
Artificial Intelligence Act (redirect from General-purpose AI)
regulated. For general-purpose AI, transparency requirements are imposed, with additional evaluations for high-capability models. There are different risk...
32 KB (3,021 words) - 06:36, 31 May 2024
against the hypothetical risk of an AI takeover, and have advocated for the use of AI capability control in addition to AI alignment methods. Other fields...
106 KB (10,282 words) - 03:36, 4 June 2024