Provably safe AGI, with Steve Omohundro
London Futurists London Futurists
4.92K subscribers
114 views
2

 Published On Feb 13, 2024

AI systems have become more powerful in the last few years, and are expected to become even more powerful in the years ahead. The question naturally arises: what, if anything, should humanity be doing to increase the likelihood that these forthcoming powerful systems will be safe, rather than destructive?

Our guest in this episode has a long and distinguished history of analysing that question, and he has some new proposals to share with us. He is Steve Omohundro, the CEO of Beneficial AI Research, an organisation which is working to ensure that artificial intelligence is safe and beneficial for humanity.

Steve has degrees in Physics and Mathematics from Stanford and a Ph.D. in Physics from U.C. Berkeley. He went on to be an award-winning computer science professor at the University of Illinois. At that time, he developed the notion of basic AI drives, which we talk about shortly, as well as a number of potential key AI safety mechanisms.

Among many other roles which are too numerous to mention here, Steve served as a Research Scientist at Meta, the parent company of Facebook, where he worked on generative models and AI-based simulation, and he is an advisor to MIRI, the Machine Intelligence Research Institute.

Selected follow-ups:
Steve Omohundro: Innovative ideas for a better world (https://steveomohundro.com/)
Metaculus forecast for the date of weak AGI (https://www.metaculus.com/questions/3...)
"The Basic AI Drives" (PDF, 2008) (https://selfawaresystems.files.wordpr...)
TED Talk by Max Tegmark: How to Keep AI Under Control (   • How to Keep AI Under Control | Max Te...  )
Apple Secure Enclave (https://support.apple.com/en-gb/guide...)
Meta Research: Teaching AI advanced mathematical reasoning (https://ai.meta.com/blog/ai-math-theo...)
DeepMind AlphaGeometry (https://deepmind.google/discover/blog...)
Microsoft Lean theorem prover (https://www.microsoft.com/en-us/resea...)
Terence Tao (Wikipedia) (https://en.wikipedia.org/wiki/Terence...)
NeurIPS Tutorial on Machine Learning for Theorem Proving (2023) (https://machine-learning-for-theorem-...)
The team at MIRI (https://intelligence.org/team/)

Music: Spike Protein, by Koi Discovery, available under CC0 1.0 Public Domain Declaration

show more

Share/Embed