A report on the benefits of AI was reportedly full of AI hallucinations
KPMG published a paper about the benefits of AI last year. An investigation found that it was full of AI hallucinations.
Read Source
Show HN: AgentNexus – coordinate LLM agents by service boundary, not role
Article URL: https://github.com/dugubuyan/agent-nexus Comments URL: https://news.ycombinator.com/item?id=48516614 Points: 3 # Comments: 0
Read SourceThe Week That Was
Your weekly summary of everything on the site.
Read SourceVisa is handling AI-prompted transactions for OpenAI - but can you trust it?
A new partnership between Visa and OpenAI takes the next step in AI-led purchasing. Here's what an expert wants you to know.
Read SourcePutting no effort into your dating app profile is unattractive
Why you should always put effort into writing your dating app bio. Do not leave your Tinder bio or Hinge prompts blank.
Read Source
OpenAI is facing investigation from a group of state attorneys general
A coalition of state attorneys general is asking OpenAI for documents about its activities.
Read Source
Anthropic blocks all customers' access to Fable 5 and Mythos 5
Anthropic has suspended all access to its new AI models Fable 5 and Mythos 5 to comply with a government order citing national security concerns.
Read Source
Andrew Yang thinks the next big startup opportunity is lowering the cost of living
Andrew Yang made a list of everything Americans overpay for — housing, food, wireless — and thinks the next startup gold rush is giving that money back.
Read SourceAnthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI
Anthropic isn't hiding its frustration. "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people," the company wrote in a blog post.
Read SourceStatement on the US government directive to suspend access to Fable 5 and Mythos 5
<p><strong><a href="https://www.anthropic.com/news/fable-mythos-access">Statement on the US government directive to suspend access to Fable 5 and Mythos 5</a></strong></p> Well this is <em>nuts</em>:</p> <blockquote> <p>The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, in
Read SourceOpenAI WebRTC Audio Session, now with document context
<p><strong><a href="https://tools.simonwillison.net/openai-webrtc">OpenAI WebRTC Audio Session, now with document context</a></strong></p> I built the first version of this tool <a href="https://simonwillison.net/2024/Dec/17/openai-webrtc/">in December 2024</a> to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.</p> <p>Last month OpenAI <a href="https://openai.com/index/advancing-voice-intellig
Read SourceSpaceX IPO: Live updates on everything you need to know
TechCrunch has followed SpaceX's start, struggles, and successes from the early days. And we're here for what happens next too. This package of SpaceX IPO coverage includes who stands to win (and maybe some who won't), pre-IPO deals, and what's tucked inside its S-1 registration document.
Read SourceMeta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it
A new report suggests the unit, which employs 6,500 people, is on the verge of revolt.
Read SourceNVIDIA Blackwell Leads on First Agentic AI Infrastructure Benchmark
AgentPerf from Artificial Analysis, the industry’s first agentic AI benchmark, gives developers, enterprises and infrastructure providers a clear way to compare systems for agentic AI. In the first round of published results, the NVIDIA Blackwell Ultra NVL72 platform delivers leading performance across the agentic AI workloads tested, running 20x more agents per megawatt than NVIDIA […]
Read Source
Chinese cybercrime operation that used AI to scam ‘hundreds of thousands of victims’ sued by Google
The tech giant said a group called "Outsider Enterprise" used AI to scam hundreds of thousands of victims, sending 2.5 million text messages over a span of two weeks.
Read SourceIre identifies another LOTUSLITE specimen
Project Ire examined a timely malware sample and determined its intent through reverse engineering—identifying LOTUSLITE characteristics even as most major EDR tools did not detect it. The post Ire identifies another LOTUSLITE specimen appeared first on Microsoft Research.
Read Source
Show HN: Rubric – test what your LLM agent did, not just what it said
Article URL: https://github.com/Kareem-Rashed/rubric-eval Comments URL: https://news.ycombinator.com/item?id=48509073 Points: 1 # Comments: 0
Read SourceImpactArbiter – A PyTorch autograd trap for LLM-generated vLLM/SGLang invariants
Article URL: https://github.com/msunda17/impactarbiter-cli Comments URL: https://news.ycombinator.com/item?id=48509033 Points: 1 # Comments: 0
Read SourceWhat Is an LLM Control Plane?
Article URL: https://blog.mozilla.ai/what-is-an-llm-control-plane/ Comments URL: https://news.ycombinator.com/item?id=48508913 Points: 3 # Comments: 0
Read SourceReasoning as Pattern Matching: Shared Mechanisms in Human and LLM Reasoning
Article URL: https://arxiv.org/abs/2606.13607 Comments URL: https://news.ycombinator.com/item?id=48508899 Points: 1 # Comments: 0
Read SourceShow HN: Cortex – Agent-native knowledge OS on Markdown (Karpathy's LLM Wiki)
Article URL: https://github.com/synpulse8-opensource/pulse8-ai-cortex-knowledge-vault Comments URL: https://news.ycombinator.com/item?id=48508143 Points: 2 # Comments: 0
Read SourceClaude Fable 5 and Mythos 5: The System Card
First things first: Claude Fable 5 is the new best publicly available model.
Read Source
Quoting Andrew Singleton
<blockquote cite="https://www.mcsweeneys.net/articles/ai-economics-for-dummies"><p>Jenny owns a crematorium. John’s propane company gives her a $20 billion investment in return for 5 percent of her operation. Jenny throws $10 billion into the incinerator, then pays John $10 billion to buy propane to burn that money to ashes. John reports that his AI investments have generated $10 billion in revenue this quarter and that he owns 5 percent of a $100 billion business. A reporter from &l
Read SourceResearch into how AI can help users understand skin conditions
Health & Bioscience
Read Source
Mistral is rumored to be raising €3B at €20B valuation
The funding round would value the company at around €20 billion (about $23.15 billion), nearly double its Series C valuation of €11.7 billion.
Read SourceA low-carbon computing platform from your retired phones
Climate & Sustainability
Read Source
Claude Fable 5 secretly throttled AI researchers, and the internet went wild
Claude Fable 5 gave users access to Mythos-class power, but its hidden safeguards turned a safety feature into a trust problem for Anthropic.
Read SourceUndersea Cables and the Material Politics of Digital Connectivity
A review of Samanth Subramanian, &ldquo;The Web Beneath the Waves: The Fragile Cables That Connect Our World&rdquo; (Columbia Global Reports, 2025).
Read SourceRational Security: The “Forbidden Fruit” Edition
Scott Anderson, Benjamin Wittes, Michael Feinberg, and Molly Roberts talked through the week&rsquo;s big news in national security.
Read SourceSpaceX, Anthropic, and OpenAI’s hot IPO summer
The IPO market is back, and it’s not the same companies leading the charge. FAANG had a good run, but a new acronym is taking over: MANGOS — Meta (or Microsoft, depending on who you ask), Anthropic, Nvidia, Google, OpenAI, and SpaceX. Half of that bunch is heading to public markets in the same window, and it’s a stress test for investors, for valuations, and for […]
Read Source40% of enterprises will scrap AI agents - 3 ways to ensure yours don't fail
How do you create real ROI from autonomous AI? Three digital leaders share lessons they've learned in the field.
Read SourceSiri AI is not into you, Apples Craig Federighi says
Siri is not for companionship and Apple says it designed the AI assistant that way.
Read Source
Ask HN: Favorite prompts for improving LLM output?
I use Claude Code a lot and GPT 5.5 as well, and find that they are simultaneously extremely useful and also fall into common poor-performance basins. For example, writing performance -- perhaps my biggest issue with them is writing style -- such as well documented stylistic tics (em-dash, it's not x it's y), less commented stylistic tics ("the honest framing", "the earned XYZ"), cryptic and coined jargon, overabbreviation and sentences replaces by arrow constructions (this -> that -> this other
Read SourceIt’s hot IPO summer, and the MANGOS are ripe
The IPO market is back, and it’s not the same companies leading the charge. FAANG had a good run, but a new acronym is taking over: MANGOS — Meta (or Microsoft, depending on who you ask), Anthropic, Nvidia, Google, OpenAI, and SpaceX. Half of that bunch is heading to public markets in the same window, and it’s a stress test for investors, for valuations, and for […]
Read SourceFuture of Privacy Forum Announces 2026 Career Achievement Award Recipients
WASHINGTON, D.C. — The Future of Privacy Forum, a global non-profit focused on data protection, AI, and emerging technologies, announced new recipients of its Career Achievement Award, recognizing exceptional leaders whose work has advanced privacy, responsible data governance, and AI leadership worldwide. The 2026 recipients of the FPF Career Achievement Award are:  Alan Raul, FPF’s […]
Read Source
Should you buy a robot vacuum before Prime Day? I test them for a living and found some early deals to eye.
Some of my top robot vacuum recommendations are on sale two weeks before Prime Day, like the Eufy C28 and Shark UV Reveal 2-in-1.
Read Source
AI Regulation and the Looming Problem of the Takings Clause
Regulations that force developers to disclose trade secrets to the public could violate the Constitution. How can regulators respond?
Read SourceSpotify adds editor videos to New Music Friday. We asked its curators why.
As algorithms increasingly shape how people discover music, Spotify is giving its editors a larger role in explaining the artists, trends, and stories behind each week's biggest releases.
Read Source

