• intelligence (AI) design, AI capability control proposals, also referred to as AI confinement, aim to increase our ability to monitor and control the behavior...
    24 KB (3,055 words) - 13:03, 14 April 2024
  • monitoring, and capability control. Research challenges in alignment include instilling complex values in AI, developing honest AI, scalable oversight...
    119 KB (11,653 words) - 02:50, 7 June 2024
  • Thumbnail for AI takeover
    robots effectively take control of the planet away from the human species, which relies on human intelligence. Stories of AI takeovers remain popular...
    39 KB (4,272 words) - 03:39, 23 May 2024
  • monitoring, and capability control. Research challenges in alignment include instilling complex values in AI, developing honest AI, scalable oversight...
    80 KB (9,538 words) - 20:10, 2 June 2024
  • progress in AI capability is inevitable because of economic pressures. Such pressures can already be seen in the development of existing AI technologies...
    12 KB (1,133 words) - 03:28, 14 May 2024
  • greater chance that our inability to control AI will cause an existential catastrophe. In 2023, hundreds of AI experts and other notable figures signed...
    120 KB (12,508 words) - 19:34, 6 June 2024
  • exercise caution in dealing with AI, stating "that's too dangerous. You can't break things when you are talking about AI". In a similar vein, Ellen Huet...
    18 KB (1,586 words) - 11:23, 5 June 2024
  • an existential catastrophe, it is necessary to successfully solve the "AI control problem" for the first superintelligence. The solution might involve instilling...
    13 KB (1,270 words) - 22:01, 12 September 2023
  • Thumbnail for Artificial Intelligence Act
    regulated. For general-purpose AI, transparency requirements are imposed, with additional evaluations for high-capability models. There are different risk...
    32 KB (3,021 words) - 06:36, 31 May 2024
  • Thumbnail for Technology
    against the hypothetical risk of an AI takeover, and have advocated for the use of AI capability control in addition to AI alignment methods. Other fields...
    106 KB (10,282 words) - 03:36, 4 June 2024