Show HN: AgentNexus – coordinate LLM agents by service boundary, not role
Article URL: https://github.com/dugubuyan/agent-nexus Comments URL: https://news.ycombinator.com/item?id=48516614 Points: 3 # Comments: 0
Read SourceStatement on the US government directive to suspend access to Fable 5 and Mythos 5
<p><strong><a href="https://www.anthropic.com/news/fable-mythos-access">Statement on the US government directive to suspend access to Fable 5 and Mythos 5</a></strong></p> Well this is <em>nuts</em>:</p> <blockquote> <p>The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, in
Read SourceOpenAI WebRTC Audio Session, now with document context
<p><strong><a href="https://tools.simonwillison.net/openai-webrtc">OpenAI WebRTC Audio Session, now with document context</a></strong></p> I built the first version of this tool <a href="https://simonwillison.net/2024/Dec/17/openai-webrtc/">in December 2024</a> to try out the then-new OpenAI WebRTC API for interacting with their realtime audio models.</p> <p>Last month OpenAI <a href="https://openai.com/index/advancing-voice-intellig
Read SourceShow HN: Rubric – test what your LLM agent did, not just what it said
Article URL: https://github.com/Kareem-Rashed/rubric-eval Comments URL: https://news.ycombinator.com/item?id=48509073 Points: 1 # Comments: 0
Read SourceImpactArbiter – A PyTorch autograd trap for LLM-generated vLLM/SGLang invariants
Article URL: https://github.com/msunda17/impactarbiter-cli Comments URL: https://news.ycombinator.com/item?id=48509033 Points: 1 # Comments: 0
Read SourceWhat Is an LLM Control Plane?
Article URL: https://blog.mozilla.ai/what-is-an-llm-control-plane/ Comments URL: https://news.ycombinator.com/item?id=48508913 Points: 3 # Comments: 0
Read SourceReasoning as Pattern Matching: Shared Mechanisms in Human and LLM Reasoning
Article URL: https://arxiv.org/abs/2606.13607 Comments URL: https://news.ycombinator.com/item?id=48508899 Points: 1 # Comments: 0
Read SourceShow HN: Cortex – Agent-native knowledge OS on Markdown (Karpathy's LLM Wiki)
Article URL: https://github.com/synpulse8-opensource/pulse8-ai-cortex-knowledge-vault Comments URL: https://news.ycombinator.com/item?id=48508143 Points: 2 # Comments: 0
Read SourceQuoting Andrew Singleton
<blockquote cite="https://www.mcsweeneys.net/articles/ai-economics-for-dummies"><p>Jenny owns a crematorium. John’s propane company gives her a $20 billion investment in return for 5 percent of her operation. Jenny throws $10 billion into the incinerator, then pays John $10 billion to buy propane to burn that money to ashes. John reports that his AI investments have generated $10 billion in revenue this quarter and that he owns 5 percent of a $100 billion business. A reporter from &l
Read SourceAsk HN: Favorite prompts for improving LLM output?
I use Claude Code a lot and GPT 5.5 as well, and find that they are simultaneously extremely useful and also fall into common poor-performance basins. For example, writing performance -- perhaps my biggest issue with them is writing style -- such as well documented stylistic tics (em-dash, it's not x it's y), less commented stylistic tics ("the honest framing", "the earned XYZ"), cryptic and coined jargon, overabbreviation and sentences replaces by arrow constructions (this -> that -> this other
Read SourceLLM suppliers should offer PR scoped ephemeral keys
LLMs make PRs cheaper to create than to review (many to one relationship).In my case, as a repo maintainer, I usually run my prompts and workflow to review a lot of the code that comes in, and I end up paying for every unsolicited PR.I was wondering: why don't LLM suppliers support ephemeral, PR-scoped usage keys funded by the contributor? Like a review bond.For example: usable only by the maintainer, only for this repo/PR, hard capped at a low $ amount or N tokens, with a short expiry. Enough t
Read SourceHow to Grep Like an LLM
Article URL: https://marcusmichaels.com/notes/grep-like-an-llm/ Comments URL: https://news.ycombinator.com/item?id=48502159 Points: 2 # Comments: 3
Read SourceNew OpenAI Academy courses for the next era of work
OpenAI introduces three Academy courses that help people build practical AI skills, create repeatable workflows, and apply agents in everyday work.
Read SourcePersonal Guardian Angel with LLM
Article URL: https://gwern.net/guardian-angel Comments URL: https://news.ycombinator.com/item?id=48501715 Points: 2 # Comments: 0
Read SourceAuto mode for pi.dev. An LLM reviews your coding agent's commands
Article URL: https://github.com/vinzenzu/pi-auto-reviewer Comments URL: https://news.ycombinator.com/item?id=48501272 Points: 1 # Comments: 2
Read SourceLLM for the ESP32-S3
Article URL: https://github.com/harmansingh4163-ai/ESP-32-s3-Story-maker-LLM Comments URL: https://news.ycombinator.com/item?id=48500424 Points: 1 # Comments: 0
Read SourceLLM podcast addressing AI genocide of humanity
Article URL: https://MachineDeposition.com Comments URL: https://news.ycombinator.com/item?id=48499288 Points: 1 # Comments: 1
Read SourceChaining LLM and web bugs to Admin
Article URL: https://blog.quarkslab.com/from-prompt-to-pwned-chaining-llm-and-web-bugs-to-admin.html Comments URL: https://news.ycombinator.com/item?id=48498943 Points: 1 # Comments: 0
Read SourceDon't let the LLM speak, just probe it
Article URL: https://blog.j11y.io/2026-06-10_hidden-state-probes/ Comments URL: https://news.ycombinator.com/item?id=48498283 Points: 41 # Comments: 3
Read SourceOur workplace LLM mass delusion
Article URL: https://blog.avas.space/llm-circus/ Comments URL: https://news.ycombinator.com/item?id=48498252 Points: 21 # Comments: 2
Read SourceHow Preply combines AI and human tutors to personalize learning
Preply uses OpenAI to launch AI-generated lesson summaries, providing personalised feedback and language learning exercises.
Read SourceClaude Fable is relentlessly proactive
<p>After two days of experience with <a href="https://simonwillison.net/2026/Jun/9/claude-fable-5/">Claude Fable 5</a> I think the best way to describe it is <strong>relentlessly proactive</strong>. It knows a whole lot of tricks and it will deploy pretty much any of them to get to its goal.</p> <p>I'll illustrate this with an example. I was hacking on <a href="https://agent.datasette.io/">Datasette Agent</a> today when I noticed a glitch: a
Read SourceTired of AI amnesia, I built a 3-Tier infinite memory LLM in 1 week
Article URL: https://dl-chat-49232436682.asia-northeast3.run.app/ Comments URL: https://news.ycombinator.com/item?id=48496730 Points: 2 # Comments: 1
Read SourceEvery LLM Tool Call Needs an Output Budget
Article URL: https://www.axamy.com/blog/tool-budget Comments URL: https://news.ycombinator.com/item?id=48496230 Points: 2 # Comments: 0
Read SourceSuperficial Beliefs in LLM Decision-Making
Article URL: https://arxiv.org/abs/2606.11016 Comments URL: https://news.ycombinator.com/item?id=48496180 Points: 3 # Comments: 0
Read SourceOur new community investments in Virginia support local jobs and expand energy affordability.
<img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/VirginiaSocial.max-600x600.format-webp.webp">We’re helping build the state’s next-generation workforce and investing in energy programs.
Read Source
Ask HN: What's the best LLM model that on a 24 GB VRAM GPU?
What’s the best model right now that outperforms Qwopus3.6-27B-v2-MTP-GGUF 8-bit on a 24 GB VRAM GPU? Looking for real reviews. I found 4 bit not usable in production. Comments URL: https://news.ycombinator.com/item?id=48494377 Points: 3 # Comments: 3
Read Sourcedatasette 1.0a33
<p><strong>Release:</strong> <a href="https://github.com/simonw/datasette/releases/tag/1.0a33">datasette 1.0a33</a></p> <p>This alpha is a significant step on the road to a stable 1.0, finally extending the <code>?_extra=</code> pattern I introduced <a href="https://docs.datasette.io/en/1.0a3/changelog.html#a3-2023-08-09">in Datasette 1.0a3</a> to cover queries and rows in addition to tables. That pattern is also <a hre
Read Sourceasyncinject 0.7
<p><strong>Release:</strong> <a href="https://github.com/simonw/asyncinject/releases/tag/0.7">asyncinject 0.7</a></p> <p>I built this utility library to support an <code>asyncio</code> dependency injection pattern a few years ago. I was using it with Datasette and Claude Fable 5 spotted some bugs in the dependency which it then fixed for me. It's a very proactive model!</p> <p>Tags: <a href="https://si
Read SourceAnthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude
<p><strong><a href="https://www.wired.com/story/anthropic-responds-to-backlash-on-claudes-secret-sabotage-on-ai-research/">Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude</a></strong></p> Big scoop for Maxwell Zeff at Wired:</p> <blockquote> <p>“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.” Anthropic said in a statement to WIRED. “We made the wrong tradeoff and we a
Read SourceSupporting Europe’s work in ensuring a trustworthy AI ecosystem
OpenAI supports the EU Code of Practice on AI content transparency, advancing provenance standards and tools to help people understand AI-generated content.
Read SourceBBVA puts AI at the core of banking with OpenAI
Learn how BBVA scaled ChatGPT Enterprise to 100,000 employees and partnered with OpenAI to accelerate AI-powered banking transformation worldwide.
Read SourceOpenAI to acquire Ona
OpenAI plans to acquire Ona to expand Codex with secure, persistent cloud environments, enabling long-running AI agents across enterprise workflows.
Read SourceHow an astrophysicist uses Codex to help simulate black holes
Discover how astrophysicist Chi-kwan Chan uses Codex to build black hole simulations, helping scientists study extreme physics and test Einstein’s theory of general relativity.
Read Sourcedatasette-agent 0.2a0
<p><strong>Release:</strong> <a href="https://github.com/datasette/datasette-agent/releases/tag/0.2a0">datasette-agent 0.2a0</a></p> <p>Highlights from the release notes:</p> <blockquote> <ul> <li>Tools can now ask the user questions mid-execution. Tools that declare a <code>context</code> parameter receive a <code>ToolContext</code> object, and <code>await context.ask_user(...)</code> c
Read SourceDiffusionGemma
<p><strong><a href="https://blog.google/innovation-and-ai/technology/developers-tools/diffusion-gemma-faster-text-generation/">DiffusionGemma</a></strong></p> Last May Google briefly released an experimental Gemini Diffusion model. I <a href="https://simonwillison.net/2025/May/21/gemini-diffusion/">tried the preview at the time</a> and recorded it running at 857 tokens/second. It was an exciting model, but Google made no further announcements about
Read SourceAccess OpenAI models and Codex through your Oracle cloud commitment
Access OpenAI models and Codex through Oracle Cloud, using existing commitments to build and deploy AI with enterprise security and governance.
Read SourceQuoting Jeremy Howard
<blockquote cite="https://twitter.com/jeremyphoward/status/2064595816875217362"><p>Easy solution to slow down recursive AI self improvement:</p> <ul> <li>The lab with the top-ranked model must agree THEY must not use it for working on frontier AI</li> <li>But everyone else should have access to it.</li> </ul> <p>By definition, this means the frontier doesn't advance.</p> <p>It also has the critical benefit of avoiding a dang
Read SourcePRC-linked influence operations are targeting AI debates in the US
A new report from OpenAI details PRC-linked influence operations using AI to target U.S. tech debates, data center narratives, tariffs, and false claims about ChatGPT.
Read Source