Difference between revisions of "AI safety"

From GISAXS
Jump to: navigation, search
(Key Concepts)
Line 7: Line 7:
 
* [https://www.alignmentforum.org/w/mesa-optimization Mesa-optimization]
 
* [https://www.alignmentforum.org/w/mesa-optimization Mesa-optimization]
 
* [https://www.lesswrong.com/posts/N6vZEnCn6A95Xn39p/are-we-in-an-ai-overhang Overhang]
 
* [https://www.lesswrong.com/posts/N6vZEnCn6A95Xn39p/are-we-in-an-ai-overhang Overhang]
 +
* [https://www.alignmentforum.org/posts/pdaGN6pQyQarFHXF4/reward-is-not-the-optimization-target Reward is not the optimization target] (Alex Turner)
  
 
==Medium-term Risks==
 
==Medium-term Risks==

Revision as of 13:54, 14 February 2025

Description of Safety Concerns

Key Concepts

Medium-term Risks

Long-term (x-risk)

Learning Resources

Research