THE FUTURE IS HERE

New OpenAI Model 'Imminent' and AI Stakes Get Raised (plus Med Gemini, GPT 2 Chatbot and Scale AI)

Altman ‘knows the release date’, Politico calls it ‘imminent’ according to Insiders, and then the mystery GPT-2 chatbot [made by the phi team at Microsoft] causes mass confusion and hysteria. I break it all down and cover two papers – MedGemini and Scale AI Contamination – released in the last 24 hours. I’ve read them in full and they might be more important than all the rest. Let’s hope life wins over death in the deployment of AI.

AI Insiders: https://www.patreon.com/AIExplained

Politico Article: https://www.politico.eu/article/rishi-sunak-ai-testing-tech-ai-safety-institute/
Sam Altman Talk: https://www.youtube.com/watch?v=GLKoDkbS1Cg
MIT Interview: https://www.technologyreview.com/2024/05/01/1091979/sam-altman-says-helpful-agents-are-poised-to-become-ais-killer-function/
Logan Kilpatrick Tweet: https://twitter.com/OfficialLoganK/status/1785834464804794820
Bubeck Response: https://twitter.com/SebastienBubeck/status/1785888787484291440
GPT2: https://twitter.com/sama/status/1785107943664566556
Where it used to be hosted: https://arena.lmsys.org/
Unicorns?; https://twitter.com/phill__1/status/1784969111430103494
No Unicorns: https://twitter.com/suchenzang/status/1785159370512421201
GPT2 chatbot logic fail: https://twitter.com/VictorTaelin/status/1785367736157175859
And language fails: https://twitter.com/gblazex/status/1785101624475537813
James Betker Blog: https://nonint.com/2023/06/10/the-it-in-ai-models-is-the-dataset/
Scale AI Benchmark Paper: https://arxiv.org/pdf/2405.00332
Dwarkesh Zuckerberg Interview: https://www.youtube.com/watch?v=bc6uFV9CJGg
Lavander Misuse: https://www.972mag.com/lavender-ai-israeli-army-gaza/
Autonomous Tank: https://www.techspot.com/news/102769-darpa-unleashes-20-foot-autonomous-robo-tank-glowing.html
Claude 3 GPQA: https://www.anthropic.com/news/claude-3-family
Med Gemini: https://arxiv.org/pdf/2404.18416
Medical Mistakes: https://www.cnbc.com/2018/02/22/medical-errors-third-leading-cause-of-death-in-america.html
MedPrompt Microsoft: https://www.microsoft.com/en-us/research/blog/the-power-of-prompting/
My Benchmark Flaws Tweet: https://twitter.com/AIExplainedYT/status/1782716249639670000
My Stargate Video: https://www.youtube.com/watch?v=KXG2f-So9oo
My GPT-5 Video: https://www.youtube.com/watch?v=Zc03IYnnuIA

Non-hype Newsletter: https://signaltonoise.beehiiv.com/

AI Insiders: https://www.patreon.com/AIExplained