Tag: AI
-
Testing AI in the open world, not just on benchmarks (normaltech.ai)AI · · May 17, 2026
-
Ai2's open robotics model beats a proprietary baseline (allenai.org)AI · · May 17, 2026
-
arXiv will ban authors for a year if they let AI do all the work (techcrunch.com)Research · · May 16, 2026
-
How you pick benchmarks decides whether open models are far behind (interconnects.ai)AI · · May 16, 2026
-
Engineering · · May 14, 2026
-
Engineering · · May 13, 2026
-
Security · · May 12, 2026
-
Why China's open-model lead is about process, not the models (interconnects.ai)AI · · May 12, 2026
-
A model says 13 percent automation is enough to tip growth into the explosive zone (importai.substack.com)AI · · May 11, 2026
-
GitLab restructures around the agent thesis, and Simon Willison checks the incentive (simonwillison.net)AI · · May 11, 2026
-
Ai2's EMO trains a mixture of experts you can run at one-eighth size (huggingface.co)AI · · May 8, 2026
-
AI · · May 7, 2026
-
Mozilla says it fixed 423 Firefox security bugs in one month with AI help (simonwillison.net)Security · · May 7, 2026
-
Notes from inside China's AI labs: less ego, more students, build don't buy (interconnects.ai)AI · · May 7, 2026
-
Infrastructure · · May 6, 2026
-
Jack Clark puts a number on AI that trains its own successor (importai.substack.com)AI · · May 4, 2026
-
Calling it a distillation attack blurs a normal technique with API abuse (interconnects.ai)AI · · May 4, 2026
-
AI · · April 29, 2026
-
AI · · April 24, 2026
-
Ethan Mollick on GPT-5.5: strong where work is verifiable, weak where taste is the point (oneusefulthing.org)AI · · April 23, 2026
-
Anthropic's AI alignment researchers closed most of the human gap in five days (importai.substack.com)AI · · April 20, 2026
-
One benchmark number hides which jobs a model is actually good at (interconnects.ai)AI · · April 20, 2026
-
AI · · April 16, 2026
-
Nathan Lambert bets open models win on economics, not benchmarks (interconnects.ai)AI · · April 15, 2026
-
Jack Clark sorts attacks on AI agents into six kinds (importai.substack.com)AI · · April 13, 2026
-
AI · · April 2, 2026
-
The chatbot window is the bottleneck, not the model (oneusefulthing.org)AI · · March 31, 2026
-
DeepMind built a way to measure when AI manipulates people (deepmind.google)AI · · March 26, 2026
-
Nathan Lambert's counter-take: self-improvement will be lossy, not explosive (interconnects.ai)AI · · March 22, 2026
-
Anthropic asked 81,000 people what they want from AI (anthropic.com)AI · · March 18, 2026
-
AI · · March 17, 2026
-
Ethan Mollick: the job is shifting from talking to AI to managing it (oneusefulthing.org)AI · · March 12, 2026
-
Demis Hassabis on what AlphaGo's Move 37 set in motion, ten years on (deepmind.google)AI · · March 10, 2026
-
Are AI Datacenters Raising Your Electric Bill? (newsletter.semianalysis.com)AI · · March 3, 2026
-
Import AI 445: Bostrom on when to race, and a math benchmark AI can't beat yet (importai.substack.com)AI · · February 16, 2026
-
Gemini Deep Think solved open math problems and got a paper into ICLR (deepmind.google)AI · · February 11, 2026
-
Anthropic commits to keeping Claude permanently ad-free (anthropic.com)AI · · February 4, 2026
-
AI · · January 29, 2026
-
Ethan Mollick: Claude Code is a preview of agentic work everywhere (oneusefulthing.org)AI · · January 7, 2026
-
Code Execution Cuts MCP Agent Token Costs (anthropic.com)AI · · November 4, 2025
-
What Anthropic Learned Building a Multi-Agent Researcher (anthropic.com)AI · · June 13, 2025
-
Qwen3 Puts Reasoning on a Switch (qwenlm.github.io)AI · · April 29, 2025
-
AI as Normal Technology (normaltech.ai)AI · · April 15, 2025
-
OLMo 2 32B: Fully Open Catches Up to Closed (allenai.org)AI · · March 13, 2025
-
DeepSeek-R1: An Open Model Matches a Closed Reasoner (huggingface.co)AI · · January 20, 2025
-
Reward Hacking: Why Better Models Game You More (lilianweng.github.io)AI · · November 28, 2024
-
Tülu 3 Opens Up the Post-Training Recipe (allenai.org)AI · · November 21, 2024
-
OpenAI o1 and the Start of Test-Time Reasoning (openai.com)AI · · September 12, 2024
-
Can AI Scaling Continue Through 2030? (epoch.ai)AI · · August 20, 2024
-
A Build Order for a Production GenAI Platform (huyenchip.com)AI · · July 25, 2024
-
Llama 3.1 405B and the License That Mattered (ai.meta.com)AI · · July 23, 2024
-
Situational Awareness: One Insider's Case for Fast AGI (situational-awareness.ai)AI · · June 4, 2024
-
The OpenAI Board Fight, Reconstructed (thezvi.substack.com)AI · · November 22, 2023
-
Crash Testing GPT-4: The First Dangerous-Capability Eval (asteriskmag.com)AI · · June 1, 2023
-
A Field Guide to the AI Safety Camps (asteriskmag.com)AI · · June 1, 2023
-
GPT-4: The Reference Model, and What It Withheld (openai.com)AI · · March 14, 2023
-
The Original LLaMA Release and What It Set Off (ai.meta.com)AI · · February 24, 2023
-
The Bitter Lesson: Why General Methods Win (incompleteideas.net)AI · · March 13, 2019