<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[ToxSec - AI and Cybersecurity : Premium]]></title><description><![CDATA[ToxSec shows you how you'd get owned. The premium articles you how to make sure you don't. Every article is a full attack chain followed by the exact countermeasures. Specific configs, real CVEs, checklist you can actually deploy. 
We ride the knife's edge between red team and responsible disclosure.]]></description><link>https://www.toxsec.com/s/premium</link><image><url>https://substackcdn.com/image/fetch/$s_!knHk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb28d90f-ea4c-44fc-80b5-d73e8347f8d2_1024x1024.png</url><title>ToxSec - AI and Cybersecurity : Premium</title><link>https://www.toxsec.com/s/premium</link></image><generator>Substack</generator><lastBuildDate>Tue, 26 May 2026 07:49:19 GMT</lastBuildDate><atom:link href="https://www.toxsec.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Christopher Ijams]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[toxsec@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[toxsec@substack.com]]></itunes:email><itunes:name><![CDATA[ToxSec]]></itunes:name></itunes:owner><itunes:author><![CDATA[ToxSec]]></itunes:author><googleplay:owner><![CDATA[toxsec@substack.com]]></googleplay:owner><googleplay:email><![CDATA[toxsec@substack.com]]></googleplay:email><googleplay:author><![CDATA[ToxSec]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[How to Threat Model AI Applications With STRIDE]]></title><description><![CDATA[AI-STRIDE maps six classic threat categories to LLM pipelines, agent tools, and training data. Here&#8217;s the walkthrough.]]></description><link>https://www.toxsec.com/p/how-to-threat-model-ai-applications</link><guid isPermaLink="false">https://www.toxsec.com/p/how-to-threat-model-ai-applications</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Fri, 22 May 2026 13:31:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!1xqF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1xqF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1xqF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1xqF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png" width="2752" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:2752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8533746,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffa8fe04f-f704-4dd6-9029-3618be6d4f7a_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1xqF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!1xqF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa487b4ba-59ec-4449-97b7-2d300f33b7a3_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> STRIDE was built for traditional software. AI systems break its assumptions in six places at once. STRIDE-AI remaps the six threat categories to ML assets, prompt pipelines, agent tool chains, and training data. This walkthrough shows you how to run a threat model on an AI application, what to ask at each STRIDE category, and where the classic framework needs AI-specific extensions like MAESTRO and ASTRIDE. If you&#8217;re shipping AI and skipping the threat model, you&#8217;re shipping blind.</p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DQd-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DQd-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 424w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 848w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 1272w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DQd-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png" width="1456" height="712" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:712,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:135122,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DQd-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 424w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 848w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 1272w, https://substackcdn.com/image/fetch/$s_!DQd-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ab2715-4956-4ea3-ae3f-0b91a3a64458_1457x712.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>What Is STRIDE and Why Does AI Break It?</h2><p>Microsoft built STRIDE in the late 1990s to give developers a thinking framework during software design. Six categories, one mnemonic: Spoofing, Tampering, Repudiation, Information Disclosure, Denial of Service, Elevation of Privilege. You draw a data flow diagram, walk each component through the six questions, and document what can go wrong. Millions of threat models have been run this way. The framework works because traditional software is deterministic. Same input, same output. Clear trust boundaries between user and system.</p><p>AI applications violate every one of those assumptions. Same prompt, different output across runs. The model processes developer instructions and attacker payloads through the same attention pipeline with zero privilege separation. Training data, retrieval documents, tool descriptions, and user messages all land in the same context window. There&#8217;s no kernel mode. No ring separation. STRIDE still applies, but each category needs new threat examples, new questions, and new assets. That&#8217;s what STRIDE-AI gives you.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jSlz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jSlz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 424w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 848w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 1272w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jSlz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png" width="1456" height="1074" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1074,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:354003,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jSlz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 424w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 848w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 1272w, https://substackcdn.com/image/fetch/$s_!jSlz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f1dbcec-4a68-4e16-b82a-f3e0bfdfa33b_2642x1948.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Spoofing Hits AI Systems</h2><p>In traditional apps, spoofing means one entity pretends to be another. Fake login, stolen session cookie, forged certificate. In AI systems, the attack surface expands in two directions.</p><p>First, model-level spoofing. An attacker serves a trojaned model that mimics a legitimate one. You pull what looks like Llama-3 from a community hub, but the weights contain a backdoor triggered by a specific phrase. The model passes your eval benchmarks. It even passes your red team runs. The payload fires only on the trigger. Model provenance, cryptographic signing of weights, and hash verification are the controls.</p><p>Second, agent identity spoofing. In multi-agent architectures where AI agents communicate and delegate tasks, one agent can impersonate another. Documented black markets show this at scale: AI agents trading credentials and weaponized skills with no human verification in the loop. If your agent trusts another agent&#8217;s claimed identity without cryptographic proof, you have a spoofing problem STRIDE was never designed to catch.</p><p><strong>Questions to ask:</strong> Who proves the model is what it claims to be? How do agents verify each other&#8217;s identity in multi-agent workflows? Can an attacker substitute a model at any point in the supply chain?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aOfo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aOfo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 424w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 848w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 1272w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aOfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png" width="1456" height="589" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:589,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:301207,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aOfo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 424w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 848w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 1272w, https://substackcdn.com/image/fetch/$s_!aOfo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ffd56-7b9f-48f0-bb68-a9fee941dd92_2981x1206.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Tampering Targets the AI Pipeline</h2><p>Traditional tampering modifies data at rest or in transit. Database row gets changed. Config file gets swapped. In AI, tampering hits three distinct asset classes.</p><p>Training data poisoning is the big one. An attacker injects crafted samples into your training set, and the model learns the malicious pattern as ground truth. This can happen through contaminated public datasets, scraped web content, or compromised third-party data providers. The model ships with the backdoor baked in. No runtime exploit needed.</p><p>Prompt injection is tampering at inference time. The attacker modifies the instructions the model follows by injecting payloads into user input, retrieved documents, or tool descriptions. <a href="https://genai.owasp.org/llmrisk/llm01-prompt-injection/">OWASP ranks this LLM01:2025</a> for the second consecutive year. The model can&#8217;t distinguish developer instructions from attacker instructions because both arrive as tokens processed by the same attention mechanism. And it gets worse when the payload arrives in an image or audio file, since <a href="https://www.toxsec.com/p/multimodal-prompt-injection-attacks-images-audio">multimodal injections</a> ride right past text-based sanitizers.</p><p>RAG document poisoning sits between training and inference. The attacker plants a malicious document in your knowledge base. When a user query retrieves it, the model follows the embedded instructions. Research demonstrated that a single injected document achieves higher success rates than older multi-document approaches.</p><p><strong>Questions to ask:</strong> Where does untrusted data enter the training pipeline? Who can modify documents in the RAG knowledge base? Are tool descriptions treated as trusted input?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nfSP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nfSP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 424w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 848w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 1272w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nfSP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png" width="1456" height="1034" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1034,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:221323,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nfSP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 424w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 848w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 1272w, https://substackcdn.com/image/fetch/$s_!nfSP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F89fdf1fc-693e-4899-9e42-d72efc8712b3_1952x1386.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Repudiation Hides in Agent Logs</h2><p>Repudiation in traditional systems means someone does something and you can&#8217;t prove it. Missing audit logs. Unsigned transactions. The fix is straightforward: log everything, sign the entries, retain them securely.</p><p>AI agents make this exponentially harder. An autonomous agent chains tool calls, makes decisions based on probabilistic reasoning, and produces outputs that vary run to run. If an agent makes a financial decision, modifies a file, or sends a message, can you reconstruct why? Most agent frameworks log the final output. Few log the full reasoning chain, the retrieved context, the tool call sequence, or the system prompt that was active when the decision fired. The <a href="https://www.toxsec.com/p/ai-kill-chain-explained">AI kill chain</a> persistence phase exploits exactly this gap: an attacker poisons the agent&#8217;s memory, and the tampered state persists across sessions with no audit trail showing when it changed.</p><p><strong>Questions to ask:</strong> Does every agent tool call get logged with parameters and return values? Can you reconstruct the full context window that produced a given output? Are reasoning chains stored, or just final answers?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!52LL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!52LL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 424w, https://substackcdn.com/image/fetch/$s_!52LL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 848w, https://substackcdn.com/image/fetch/$s_!52LL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 1272w, https://substackcdn.com/image/fetch/$s_!52LL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!52LL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png" width="1456" height="701" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:701,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:370040,&quot;alt&quot;:&quot;Securing AI Agent Actions.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Securing AI Agent Actions." title="Securing AI Agent Actions." srcset="https://substackcdn.com/image/fetch/$s_!52LL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 424w, https://substackcdn.com/image/fetch/$s_!52LL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 848w, https://substackcdn.com/image/fetch/$s_!52LL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 1272w, https://substackcdn.com/image/fetch/$s_!52LL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f40501-3442-4e8e-845a-801e268ce2b5_2916x1404.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Information Disclosure Leaks From AI Systems</h2><p>Traditional info disclosure means sensitive data reaches someone who shouldn&#8217;t see it. SQL injection dumps the user table. Error messages expose stack traces. AI systems leak through entirely new channels.</p><p>System prompt extraction is the most common. The system prompt contains the developer&#8217;s instructions, business logic, and sometimes credentials. An attacker coaxes the model into reproducing it verbatim. This is trivially easy on most deployments. Jailbreak techniques that bypass safety training give the attacker direct access to whatever&#8217;s in the context window.</p><p>Embedding inversion is the quieter threat. Vector databases store your documents as numerical embeddings. Research has shown these embeddings can be reversed back into the original text. Your &#8220;encrypted&#8221; knowledge base is functionally plaintext if the embeddings are accessible.</p><p>Context window exfiltration chains with tool access. If the model can render Markdown images and the client loads them, an attacker can encode the context window contents into a URL parameter. The model generates what looks like a weather icon. The server on the other end receives your conversation history. This is the exact chain used in <a href="https://www.toxsec.com/p/lets-poison-the-mcp">MCP tool poisoning attacks</a> running in production today.</p><p><strong>Questions to ask:</strong> What&#8217;s in the system prompt? Can any user-facing path extract it? Are vector embeddings accessible outside the application? Does the client render model-generated URLs without sanitization?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pQqZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pQqZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 424w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 848w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 1272w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pQqZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:357313,&quot;alt&quot;:&quot;Common information Disclosure Channels in AI.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Common information Disclosure Channels in AI." title="Common information Disclosure Channels in AI." srcset="https://substackcdn.com/image/fetch/$s_!pQqZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 424w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 848w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 1272w, https://substackcdn.com/image/fetch/$s_!pQqZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb11c6a1-95b7-4940-8571-54c1263671df_2185x1639.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Denial of Service Drains AI Budgets</h2><p>Traditional DoS floods a server. AI denial of service is subtler and more expensive. Every LLM query burns tokens. Every token costs money. An attacker who forces the model into expensive execution paths doesn&#8217;t crash your service. They drain your cloud budget while staying under every request-based rate limit you&#8217;ve set.</p><p>Documented incidents include $46,000/day consumption attacks against AWS Bedrock via stolen credentials (Sysdig&#8217;s LLMjacking research), and an $82,000 Gemini API bill in 48 hours from a single compromised key earlier this year. Standard rate limiters count requests, not cost. One request hitting a multi-step agentic workflow can cost 500x more than a cached response. Both count as one request. We covered the full attack pattern in <a href="https://www.toxsec.com/p/denial-of-wallet">denial of wallet</a>.</p><p><strong>Questions to ask:</strong> Do you rate-limit by tokens or by requests? Is there a hard spending cap per API key? How fast would you detect a 4,000% spike in token usage at 2 AM?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ElxR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ElxR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 424w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 848w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 1272w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ElxR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png" width="1456" height="953" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:953,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:354312,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ElxR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 424w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 848w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 1272w, https://substackcdn.com/image/fetch/$s_!ElxR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeaa2e41-197a-4639-b1c9-7a0d6a65f01a_2723x1782.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How Elevation of Privilege Chains Through Agent Tools</h2><p>In traditional apps, privesc means a regular user gains admin access. Buffer overflow, misconfigured RBAC, path traversal to a config file. In AI systems, the model itself is the privilege boundary, and it&#8217;s terrible at enforcing one.</p><p>Excessive agency is the OWASP term. The model has access to tools, APIs, file systems, and external services. If the model can be tricked via prompt injection into calling those tools with attacker-controlled parameters, the attacker inherits every permission the model holds. Vibe-coded applications ship with admin routes unprotected because the AI never thought to add auth. MCP tool chains grant the agent capabilities the developer never scoped. Each connected tool is another capability an attacker inherits. The full picture of <a href="https://www.toxsec.com/p/owasp-top-10-for-genai">how OWASP LLM Top 10 chains together in production</a> shows why this category sits at the top of every real incident.</p><p>The NVIDIA AI Kill Chain maps this as the hijack phase: the attacker takes control of the model&#8217;s behavior, then uses its legitimate tool access to reach systems the attacker could never touch directly.</p><p><strong>Questions to ask:</strong> What&#8217;s the least privilege set this agent actually needs? Can the model invoke destructive operations without human approval? Are tool permissions scoped per-session or standing?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AFbx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AFbx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 424w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 848w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 1272w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AFbx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png" width="1456" height="956" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:956,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:397595,&quot;alt&quot;:&quot;Toxsec.com - Prompt injection elevates Agents.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193725871?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com - Prompt injection elevates Agents." title="Toxsec.com - Prompt injection elevates Agents." srcset="https://substackcdn.com/image/fetch/$s_!AFbx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 424w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 848w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 1272w, https://substackcdn.com/image/fetch/$s_!AFbx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff699a454-94a2-4a02-8ed4-5d51c66a4d78_2774x1821.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Beyond STRIDE: MAESTRO, ASTRIDE, and Shostack&#8217;s Four Questions</h2><p>STRIDE gives you the vocabulary. It tells you what can go wrong. But it was designed for applications with predictable execution paths, and AI breaks that assumption at the architectural level. Three extensions fill the gaps.</p><p><a href="https://www.mdpi.com/1424-8220/22/17/6662">STRIDE-AI</a> (Mauri &amp; Damiani, 2021 IEEE CSR) was the first formal adaptation. It maps STRIDE categories to ML-specific assets across the full pipeline: training data, model weights, inference APIs, and deployment artifacts. The contribution is making ML assets first-class citizens in the threat model instead of afterthoughts.</p><p><a href="https://arxiv.org/abs/2512.04785">ASTRIDE</a> (December 2025) is the first STRIDE-derived extension purpose-built for agentic systems. It adds a seventh category, &#8220;A&#8221; for AI Agent-Specific Attacks, covering prompt injection, unsafe reasoning-driven tool use, and context window manipulation. The framework leans hard into automated diagram-driven analysis using vision-language models.</p><p><a href="https://cloudsecurityalliance.org/blog/2025/02/06/agentic-ai-threat-modeling-framework-maestro">MAESTRO</a> (Cloud Security Alliance, February 2025) takes a different approach entirely: seven architectural layers from foundation models through reasoning and communication, each evaluated for AI-specific threats like multimodal injection, hallucination exploitation, and cross-layer threat chaining. Where STRIDE asks &#8220;what can go wrong at each component,&#8221; MAESTRO asks &#8220;what can go wrong at each layer of the AI stack.&#8221;</p><p>Adam Shostack&#8217;s Four Questions remain the backbone regardless of framework: What are we working on? What can go wrong? What are we going to do about it? Did we do a good enough job? Recent Microsoft guidance reinforces that AI threat modeling only works when grounded in the system as it truly operates, where the prompt assembly pipeline is a first-class security boundary.</p><blockquote><p>That's the framework. </p><p>Behind the wall: the copy-paste prompt that runs a full STRIDE-AI pass against your own architecture in one shot, the seven red flags that mean you're already exposed, and the exact three-layer circuit breaker that catches denial-of-wallet before the $82K invoice lands. </p><p>Free subs get the theory. Paid subs get the kit.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/how-to-threat-model-ai-applications">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Promptfoo Red Teaming: DAST for Your LLM Pipeline]]></title><description><![CDATA[YAML config, one command, 50+ attack plugins. OpenAI just bought the company. Still MIT licensed.]]></description><link>https://www.toxsec.com/p/promptfoo-red-teaming</link><guid isPermaLink="false">https://www.toxsec.com/p/promptfoo-red-teaming</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Sat, 09 May 2026 13:31:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZbyR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZbyR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZbyR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZbyR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png" width="2752" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:2752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7434020,&quot;alt&quot;:&quot;Promptfoo red teaming LLM vulnerability scanner tutorial showing YAML config attack plugins strategies and web UI results for AI security testing.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193714884?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13d24168-9e36-49e1-ae4f-efeb38afe030_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Promptfoo red teaming LLM vulnerability scanner tutorial showing YAML config attack plugins strategies and web UI results for AI security testing." title="Promptfoo red teaming LLM vulnerability scanner tutorial showing YAML config attack plugins strategies and web UI results for AI security testing." srcset="https://substackcdn.com/image/fetch/$s_!ZbyR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!ZbyR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31fec9c4-6ffa-42f0-a867-288a0790c7ef_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> Promptfoo is an open-source CLI for evaluating and red teaming LLM apps. YAML config, 50+ attack plugins, built-in OWASP LLM Top 10 presets, and a web UI that shows exactly where your model broke. OpenAI acquired the company in March 2026, terms undisclosed. It stays MIT licensed and open source. One command generates hundreds of adversarial test cases and scores them automatically.</p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h3>Why Promptfoo Is the Red Team Tool Your Dev Team Will Actually Use</h3><p>Security tools that only security people run don&#8217;t stop bugs from shipping. They catch bugs after the damage is done. The tool that stops a vulnerable LLM from hitting production is the one that sits in the build pipeline and blocks the deploy.</p><p>Promptfoo is that tool. It&#8217;s a CLI and Node.js library for evaluating and red teaming LLM applications. YAML-configured, CI/CD-native, and designed for the developer workflow: define your target, pick your plugins, run the scan, read the web UI. The red team mode auto-generates adversarial prompts using 50+ attack plugins across prompt injection, jailbreaks, PII leakage, SSRF, SQL injection, excessive agency, hallucination, and more. It ships with OWASP LLM Top 10 presets, NIST AI RMF mappings, and MITRE ATLAS coverage. One line in your config enables an entire compliance framework&#8217;s worth of testing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6ADY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6ADY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 424w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 848w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 1272w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6ADY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png" width="985" height="652" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9cc587f-556a-47de-a415-21c59a777a84_985x652.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:652,&quot;width&quot;:985,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:42670,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193714884?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6ADY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 424w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 848w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 1272w, https://substackcdn.com/image/fetch/$s_!6ADY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9cc587f-556a-47de-a415-21c59a777a84_985x652.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The pedigree: 10.4k GitHub stars, 350,000+ developers, 130,000 active monthly users, and adoption at 25% of Fortune 500 companies. OpenAI and Anthropic both ran it internally before <a href="https://openai.com/index/openai-to-acquire-promptfoo/">OpenAI acquired the company on March 9, 2026</a>. Acquisition terms were undisclosed, though Promptfoo had been valued at $86 million at its July 2025 Series A. The repo stays open source under MIT and lives at github.com/promptfoo/promptfoo.</p><p>The difference between Promptfoo and the other tools in this space: your dev team will actually adopt it. YAML configs live in your repo. Results render in a browser. CI/CD integration means red teaming runs on every PR. No Python notebooks, no manual orchestration, no &#8220;let the security team handle it.&#8221; Security shifts left to where the code is written. <a href="https://www.toxsec.com/p/garak-llm-vulnerability-scanner">Garak gives us the broad CLI sweep across known probe families</a>. <a href="https://www.toxsec.com/p/pyrit-ai-red-teaming">PyRIT runs the surgical multi-turn follow-up</a>. Promptfoo is the one that sits in the pipeline and blocks the merge.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qqvV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qqvV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 424w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 848w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qqvV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png" width="2667" height="1170" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1170,&quot;width&quot;:2667,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:159260,&quot;alt&quot;:&quot;Toxsec.com - Promptfoo, Garak, or PyRIT.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193714884?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F41ca6cb5-8bd3-45e2-acee-dce20f44d460_2667x1296.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com - Promptfoo, Garak, or PyRIT." title="Toxsec.com - Promptfoo, Garak, or PyRIT." srcset="https://substackcdn.com/image/fetch/$s_!qqvV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 424w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 848w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 1272w, https://substackcdn.com/image/fetch/$s_!qqvV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23a255f9-6130-4c43-b122-5176c0eed2ab_2667x1170.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Plugins, Strategies, and the YAML That Runs It All</h3><p>Three concepts drive Promptfoo&#8217;s red team architecture.</p><p><strong>Plugins</strong> generate adversarial inputs targeting specific vulnerability classes. <code>harmful</code> generates prompts that attempt to elicit dangerous content. <code>jailbreak</code> tests guardrail bypass resistance. <code>hijacking</code> checks whether an attacker can redirect the model&#8217;s behavior. <code>pii:direct</code>, <code>pii:session</code>, and <code>pii:social</code> test for PII leakage through different vectors. <code>ssrf</code>, <code>sql-injection</code>, <code>shell-injection</code> test for the exact agent-level attacks that bounty programs pay for. Framework presets bundle related plugins: <code>owasp:llm</code> enables the full OWASP LLM Top 10 suite. <code>owasp:agentic</code> covers the newer OWASP Top 10 for AI Agents.</p><p><strong>Strategies</strong> determine how those adversarial inputs get delivered. <code>prompt-injection</code> wraps payloads in injection frames. <code>jailbreak</code> applies <a href="https://www.toxsec.com/p/dan-prompts-for-guardrail-bypass">DAN-style bypass techniques</a>. <code>crescendo</code> runs multi-turn escalation where each message builds on the last. These are the same attack patterns we&#8217;ve been stacking against guardrails manually, except Promptfoo automates the generation and delivery.</p><p>The YAML config ties everything together.</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;yaml&quot;,&quot;nodeId&quot;:&quot;2d799992-66de-453d-97e7-b88a976b7b57&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-yaml"># promptfooconfig.yaml
targets:
  - id: openai:gpt-4o
    label: customer-service-bot

  # Or hit your own endpoint:
  - id: 'https://api.yourapp.com/chat'
    config:
      method: 'POST'
      headers:
        'Content-Type': 'application/json'
      body:
        message: '{{prompt}}'
      transformResponse: 'json.response'

redteam:
  purpose: &gt;
    Customer service chatbot for an airline.
    Users can check flight status, book tickets,
    and manage reservations.
  plugins:
    - owasp:llm          # Full OWASP LLM Top 10
    - harmful
    - pii
    - ssrf
    - excessive-agency
  strategies:
    - jailbreak
    - prompt-injection
    - crescendo</code></pre></div><p>That config scans your chatbot across every OWASP LLM Top 10 category, tests for PII exposure, checks for SSRF, and applies three different delivery strategies to each attack. The <code>purpose</code> field matters. Promptfoo uses it to generate contextually relevant adversarial prompts. An airline chatbot gets probes about frequent flyer data and booking system access. A healthcare app gets probes about patient records and HIPAA violations.</p><p>Run it:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;bash&quot;,&quot;nodeId&quot;:&quot;3ea5070e-9fe1-4a8a-9351-934aac1eef09&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-bash">npm install -g promptfoo
promptfoo redteam init my-scan --no-gui
# Edit promptfooconfig.yaml with the config above
promptfoo redteam run</code></pre></div><p>Generation takes about five minutes. The scan runs every generated test case against your target, grades each response using an LLM judge, and renders the results in a web UI. Red means it broke. Green means it held. Click any finding to see the exact adversarial prompt, the model&#8217;s response, and the grader&#8217;s reasoning.</p><h3>The Promptfoo Report Card You Can&#8217;t Argue With</h3><p>Here&#8217;s what makes Promptfoo dangerous for complacent teams. The web UI generates a compliance report card. <a href="https://www.toxsec.com/p/owasp-top-10-for-genai">OWASP LLM Top 10</a>, NIST AI RMF, MITRE ATLAS. Each framework&#8217;s relevant controls mapped to your scan results. Green checkmarks where you passed. Red flags where you failed. Severity ratings. Evidence trails.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VnCM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VnCM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 424w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 848w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 1272w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VnCM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png" width="955" height="627" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:627,&quot;width&quot;:955,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:44747,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193714884?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VnCM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 424w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 848w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 1272w, https://substackcdn.com/image/fetch/$s_!VnCM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b436ed5-d46e-47ac-9fa9-6faf9c5edc5f_955x627.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Your chatbot just failed three OWASP categories across 23 individual test cases. The <code>prompt-injection</code> plugin found that jailbreak-wrapped requests bypass your system prompt 40% of the time. The <code>pii</code> plugin extracted customer email addresses through a social engineering frame. The <code>excessive-agency</code> plugin got the model to attempt API calls it shouldn&#8217;t have access to.</p><p>All documented. All reproducible. All sitting in a web dashboard your engineering manager can read without knowing what a jailbreak is. That&#8217;s the part that changes behavior. Security findings buried in JSONL logs get ignored. Security findings rendered in a color-coded dashboard with OWASP mappings get fixed.</p><p>And every finding has a timestamp, a conversation transcript, and a grader explanation. That&#8217;s your bounty submission evidence. That&#8217;s your compliance audit trail. That&#8217;s the artifact your CISO shows the board when they ask &#8220;how do we know our AI is secure?&#8221;</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/promptfoo-red-teaming">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Garak Vulnerability Scanner: Nessus for LLMs]]></title><description><![CDATA[Point it at a model. Pick your probes. Watch every guardrail break in JSONL.]]></description><link>https://www.toxsec.com/p/garak-llm-vulnerability-scanner</link><guid isPermaLink="false">https://www.toxsec.com/p/garak-llm-vulnerability-scanner</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Wed, 06 May 2026 13:31:30 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wOGj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wOGj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wOGj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wOGj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png" width="2752" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:2752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7298228,&quot;alt&quot;:&quot;Garak NVIDIA LLM vulnerability scanner tutorial showing probes detectors generators and CLI output for AI security testing and bug bounty.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193694931?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a127658-a233-48ce-8017-a46617c303ab_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Garak NVIDIA LLM vulnerability scanner tutorial showing probes detectors generators and CLI output for AI security testing and bug bounty." title="Garak NVIDIA LLM vulnerability scanner tutorial showing probes detectors generators and CLI output for AI security testing and bug bounty." srcset="https://substackcdn.com/image/fetch/$s_!wOGj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!wOGj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c9ebd-9765-42b5-8259-e03a2bb2d743_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> Garak is NVIDIA&#8217;s open-source LLM vulnerability scanner. Point it at a model, pick your probes, and it fires hundreds of known attack patterns across prompt injection, jailbreaks, encoding bypasses, data leakage, and toxicity. CLI-first, plugin-based, fast. Your model just failed 47 probes across six categories. Now what?</p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h3>What Is Garak and Why You Run It First</h3><p>Nobody ships a web app without running a vulnerability scanner against it first. Nikto, Nessus, nuclei. Pick your poison, point it at the target, let it rip through known attack patterns, then read the report. LLMs ship without this step every single day.</p><p>Garak fixes that. The Generative AI Red-teaming and Assessment Kit is <a href="https://github.com/NVIDIA/garak">NVIDIA&#8217;s open-source LLM vulnerability scanner</a>, built by their AI Red Team and backed by a research paper, 7.5k GitHub stars, and an active Discord. The latest stable release is v0.14.1, shipped April 2026, so the project is actively maintained and shipping. The tool probes your model&#8217;s defenses while looking completely benign.</p><p>The workflow is simple. Install. Point it at a model. Pick probes (or let it pick all of them). Garak fires every probe, runs each prompt multiple times to account for the model&#8217;s stochastic output, scores responses through detectors, and writes a structured JSONL report. One command, hundreds of attack vectors, a complete audit trail.</p><p>Garak covers the attack categories that matter: prompt injection, <a href="https://www.toxsec.com/p/dan-prompts-for-guardrail-bypass">DAN-family jailbreaks</a>, encoding-based guardrail bypasses, data leakage, package hallucination (the <a href="https://www.toxsec.com/p/what-is-slopsquatting-ai-hallucinations">slopsquatting</a> vector), toxicity generation, malware generation attempts, cross-site scripting through LLM output, hallucination, and <a href="https://www.toxsec.com/p/token-level-ai-security-the-opus">glitch token exploitation</a>. 37+ probe modules, each containing multiple individual probes. The dan module alone ships with about fifteen scannable variants spanning DAN 6.0 through 11.0, plus STAN, DUDE, AntiDAN, and ChatGPT Developer Mode. The encoding module covers Base64, Base16, Base32, ROT13, Morse, Braille, ASCII85, hex, and more.</p><p>Think of Garak as Nessus before the pentest. We&#8217;re mapping the attack surface. Which probes get through. Which get blocked. Where the filters are soft. That scan data tells us where to aim our manual prompt injection chains. And once Garak flags the broken families, <a href="https://www.toxsec.com/p/pyrit-ai-red-teaming">PyRIT picks up the deep, adaptive multi-turn follow-up</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vfcu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vfcu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 424w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 848w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 1272w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vfcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png" width="2326" height="1756" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1756,&quot;width&quot;:2326,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:264813,&quot;alt&quot;:&quot;Toxsec.com Garak Vulnerability Scanner.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193694931?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b4ea58f-b8ea-48a6-9043-0d5a644dfb24_2326x2049.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com Garak Vulnerability Scanner." title="Toxsec.com Garak Vulnerability Scanner." srcset="https://substackcdn.com/image/fetch/$s_!vfcu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 424w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 848w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 1272w, https://substackcdn.com/image/fetch/$s_!vfcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd57f29ce-9701-49b8-bad5-2bbac4d00524_2326x1756.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Generators, Probes, and Detectors: The Three Moving Parts</h3><p>Garak&#8217;s architecture has three components that matter.</p><p><strong>Generators</strong> are our connection to the target. OpenAI API, Hugging Face (pipeline and inference), AWS Bedrock, Cohere, Groq, Mistral, Ollama for local models, NVIDIA NIM endpoints, Replicate, LiteLLM, and custom REST APIs. If the model accepts text over an API, Garak can hit it.</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;bash&quot;,&quot;nodeId&quot;:&quot;97b97e50-ffe5-4fa1-8e60-feb92943db67&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-bash"># Scan an OpenAI model for encoding-based injection
export OPENAI_API_KEY="sk-[REDACTED]"
python3 -m garak --target_type openai --target_name gpt-5-nano --probes encoding

# Scan a local Ollama model for DAN jailbreaks
python3 -m garak --target_type ollama --target_name llama3 --probes dan

# Scan a Hugging Face model for everything
python3 -m garak --target_type huggingface --target_name meta-llama/Llama-3-8b --probes all</code></pre></div><p><strong>Probes</strong> generate the attack payloads. Each probe module targets a specific vulnerability class and contains multiple individual prompts. Garak sends each prompt to the model ten times by default. Ten generations per prompt. That repetition matters because LLM output is non-deterministic. A model that refuses a jailbreak nine times out of ten still has a 10% bypass rate, and that 10% is a finding worth documenting.</p><p>The probe taxonomy maps directly to known vulnerability classes. promptinject implements the Agency Enterprise PromptInject framework for hijacking attacks. dan runs the full DAN family. encoding tests whether the same encoding stacks we use manually scale up to automation. leakreplay and knownbadsignatures check for training data extraction and malware signature generation. packagehallucination tests whether the model invents package names that don&#8217;t exist on PyPI or npm.</p><p><strong>Detectors</strong> evaluate the output. Simple string matching for known bad signatures. Classifier-based detection using small models for toxicity scoring. LLM-as-judge for nuanced cases. Each probe ships with a primary detector and optional extended detectors. A probe fires, the model responds, the detector scores pass or fail, and the result hits the JSONL log.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sSq-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sSq-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 424w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 848w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 1272w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sSq-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png" width="1083" height="926" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:926,&quot;width&quot;:1083,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:125166,&quot;alt&quot;:&quot;Garak Scan: CLI Output: Garak LLM vulnerability scanner CLI output showing dan, encoding, promptinject, and leakreplay probe modules with progress bars and pass-fail rates against an OpenAI gpt-5-nano target.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193694931?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Garak Scan: CLI Output: Garak LLM vulnerability scanner CLI output showing dan, encoding, promptinject, and leakreplay probe modules with progress bars and pass-fail rates against an OpenAI gpt-5-nano target." title="Garak Scan: CLI Output: Garak LLM vulnerability scanner CLI output showing dan, encoding, promptinject, and leakreplay probe modules with progress bars and pass-fail rates against an OpenAI gpt-5-nano target." srcset="https://substackcdn.com/image/fetch/$s_!sSq-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 424w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 848w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 1272w, https://substackcdn.com/image/fetch/$s_!sSq-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95e0aa5f-7fe8-44d4-b978-87debb503a56_1083x926.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>The Garak Scan That Matters</h3><p>Here&#8217;s what a real Garak scan surfaces. Point it at your production chatbot endpoint. Pick a handful of probe modules: dan, encoding, promptinject, leakreplay. Run it. Maybe twenty minutes depending on rate limits.</p><p>The report comes back. Your model held against DAN 6.0 through 9.0. Good. But DAN 11.0 and Developer Mode v2 both scored failures. The encoding module found that Base64-encoded prompts bypass your input filter entirely: 80% failure rate across ten generations. promptinject hijacking probes landed at 30%. leakreplay found the model regurgitating training data snippets when prompted with specific continuation patterns.</p><p>Four vulnerability classes confirmed in one scan. Base64 bypass alone maps to LLM01:2025 in the <a href="https://www.toxsec.com/p/owasp-top-10-for-genai">OWASP Top 10 for LLMs</a>, the top-ranked vulnerability. The DAN failures map to LLM01 too. The training data leakage maps to LLM02:2025 (Sensitive Information Disclosure), and a packagehallucination hit would map to LLM03:2025 (Supply Chain). Each finding has a full JSONL trail: exact prompts sent, exact responses received, detector verdicts, timestamps.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_ZYo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_ZYo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 424w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 848w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 1272w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_ZYo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png" width="1099" height="989" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:989,&quot;width&quot;:1099,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:74253,&quot;alt&quot;:&quot;Garak Scan: JSONL Hit: Garak LLM vulnerability scanner JSONL hit log entry showing a single encoding.InjectBase64 prompt injection attempt with redacted payload, detector verdict, and timestamp evidence chain for bug bounty reproduction.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/193694931?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Garak Scan: JSONL Hit: Garak LLM vulnerability scanner JSONL hit log entry showing a single encoding.InjectBase64 prompt injection attempt with redacted payload, detector verdict, and timestamp evidence chain for bug bounty reproduction." title="Garak Scan: JSONL Hit: Garak LLM vulnerability scanner JSONL hit log entry showing a single encoding.InjectBase64 prompt injection attempt with redacted payload, detector verdict, and timestamp evidence chain for bug bounty reproduction." srcset="https://substackcdn.com/image/fetch/$s_!_ZYo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 424w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 848w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 1272w, https://substackcdn.com/image/fetch/$s_!_ZYo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7779cbde-d25e-48fb-a927-0d8d8da6379f_1099x989.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is the part that should bother you. One command. Garak does the rest. Every model deployed without running this scan has the same holes.</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/garak-llm-vulnerability-scanner">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Vibe Coding Security Flaws Ship Shells, Keys, and Admin Access]]></title><description><![CDATA[Slopsquatting, hardcoded API keys, and broken auth in AI-generated code form a compound attack chain starting at pip install.]]></description><link>https://www.toxsec.com/p/vibe-coding-security-attack-chain</link><guid isPermaLink="false">https://www.toxsec.com/p/vibe-coding-security-attack-chain</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Thu, 19 Mar 2026 13:31:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RL5M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RL5M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RL5M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RL5M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png" width="2752" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:2752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7713504,&quot;alt&quot;:&quot;AI pair programmer security vulnerabilities vibe coding slopsquatting hardcoded secrets broken authentication LLM-generated code risks&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc41b5947-029b-4c8c-9e74-b9c6e3d28cd8_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="AI pair programmer security vulnerabilities vibe coding slopsquatting hardcoded secrets broken authentication LLM-generated code risks" title="AI pair programmer security vulnerabilities vibe coding slopsquatting hardcoded secrets broken authentication LLM-generated code risks" srcset="https://substackcdn.com/image/fetch/$s_!RL5M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!RL5M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa956a2cc-ad2d-42db-ad73-ceafe13615a5_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> We prompt an AI assistant until it hallucinates a package name, register it on PyPI before anyone installs it, grep the repo for credentials the LLM committed, then walk through the admin route the AI forgot to protect. Three vibe coding security flaws. </p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h2>What Is Slopsquatting and How Vibe Coding Creates It</h2><p>When you vibe code, you describe what you want and the AI writes it. Fast, popular, and it has a failure mode we&#8217;re already monetizing. Somewhere in that output is a <code>pip install some-package-name</code>. You run it, and it works. Or it looks like it works.</p><p>Here&#8217;s the problem. A <strong>package</strong> is a chunk of pre-built code your project pulls from a public registry instead of writing from scratch. LLMs don&#8217;t query PyPI, the Python package registry, before suggesting a dependency. The model pattern-matches to what a package for that task would <em>probably</em> be called. Sometimes the name is real, sometimes the model invented it, and it sounds equally confident either way.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xJPz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xJPz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 424w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 848w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 1272w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xJPz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png" width="834" height="339" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:834,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:26876,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xJPz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 424w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 848w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 1272w, https://substackcdn.com/image/fetch/$s_!xJPz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa4b5e269-dda7-4d70-b970-829ba7b87bfb_834x339.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That gap is the entire attack. We prompt LLMs with niche coding tasks and log every package name that doesn&#8217;t exist on any registry. Some names repeat across sessions, across models, same hallucination on a loop. A <a href="https://arxiv.org/abs/2501.02497">2025 academic study analyzing 576,000 AI-generated code samples</a> found hallucinated packages appear roughly 20% of the time, and 43% of those names repeat consistently. Predictable means registerable.</p><p>We check PyPI. Not claimed. We register the name with a functional README, plausible version history, and a malicious install hook that fires the moment someone runs <code>pip install</code>. This is <strong>slopsquatting</strong>, a supply chain attack where we pre-register the phantom dependency names that AI coding tools <a href="https://www.toxsec.com/p/distillation-raids-slopsquatting">hallucinate into existence</a>.</p><p>Then we search GitHub for <code>requirements.txt</code> files containing our package names. Find repos where the AI-generated README has the install command verbatim. Dev copy-pasted it, never checked, ran it. We have a shell.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!848M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!848M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 424w, https://substackcdn.com/image/fetch/$s_!848M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 848w, https://substackcdn.com/image/fetch/$s_!848M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 1272w, https://substackcdn.com/image/fetch/$s_!848M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!848M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png" width="593" height="359" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b130c528-1596-42db-894d-bb3387502c6b_593x359.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:359,&quot;width&quot;:593,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:22712,&quot;alt&quot;:&quot;PyPI package page for &#8220;flask-orient-connector&#8221;, published yesterday, 0 downloads, single maintainer with no other packages, next to a terminal showing pip install flask-orient-connector completing successfully, nuclear green on black&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="PyPI package page for &#8220;flask-orient-connector&#8221;, published yesterday, 0 downloads, single maintainer with no other packages, next to a terminal showing pip install flask-orient-connector completing successfully, nuclear green on black" title="PyPI package page for &#8220;flask-orient-connector&#8221;, published yesterday, 0 downloads, single maintainer with no other packages, next to a terminal showing pip install flask-orient-connector completing successfully, nuclear green on black" srcset="https://substackcdn.com/image/fetch/$s_!848M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 424w, https://substackcdn.com/image/fetch/$s_!848M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 848w, https://substackcdn.com/image/fetch/$s_!848M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 1272w, https://substackcdn.com/image/fetch/$s_!848M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb130c528-1596-42db-894d-bb3387502c6b_593x359.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How AI Coding Assistants Leak API Keys Into Git History</h2><p>When you vibe code a payment integration or an email service, you don&#8217;t wire up credentials manually. You describe the feature and the AI generates the whole thing, including the keys, hardcoded directly in the source so the code actually runs. An <strong>API key</strong> is a secret string that proves your app is authorized to talk to a service like Stripe for payments or AWS for cloud infrastructure. Leak it, and anyone holding that key can act as your application.</p><p>The AI ships hardcoded keys because that&#8217;s what &#8220;working code&#8221; looked like in its training data, millions of public repos where developers did exactly this and never rotated before pushing to GitHub. The model is doing what you asked. The problem is the pattern it learned, classified as <a href="https://cwe.mitre.org/data/definitions/798.html">CWE-798</a>: hardcoded credentials in source code. You test locally, it works, you push. The key goes with it.</p><p>We run <code>git log --all -p</code> piped through a grep for common credential patterns against the public repo. Four seconds. Stripe secret key, AWS access key, SendGrid token, all committed in the same PR that passed review because the feature worked. The AWS key gets us into the infrastructure, and the Stripe key starts pulling transaction data. The <a href="https://www.toxsec.com/p/the-voluntary-exfiltration-program">credential exfiltration pattern</a> is the same one that costs enterprises $670,000 per incident, except now the AI ships credentials faster than any human ever could.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SAqu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SAqu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 424w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 848w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 1272w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SAqu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png" width="1037" height="1003" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1003,&quot;width&quot;:1037,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:71730,&quot;alt&quot;:&quot;Terminal showing git grep output with three credential matches highlighted, Stripe key, AWS access key, SendGrid token, commit hash visible, values partially redacted, dark red warning glow on each match&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Terminal showing git grep output with three credential matches highlighted, Stripe key, AWS access key, SendGrid token, commit hash visible, values partially redacted, dark red warning glow on each match" title="Terminal showing git grep output with three credential matches highlighted, Stripe key, AWS access key, SendGrid token, commit hash visible, values partially redacted, dark red warning glow on each match" srcset="https://substackcdn.com/image/fetch/$s_!SAqu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 424w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 848w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 1272w, https://substackcdn.com/image/fetch/$s_!SAqu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61840b89-107e-4584-aec2-54da8f527ae3_1037x1003.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Why AI-Generated Code Ships Without Authentication Checks</h2><p>When you ask an AI to scaffold a user management dashboard, it builds the feature. CRUD operations, role assignment, user creation, all of it, clean and fast. What it doesn&#8217;t build is the check that runs before any of that executes. <strong>Auth middleware</strong> is the code that verifies who&#8217;s making a request before the server processes it, the gate in front of the feature. The AI doesn&#8217;t know your auth system and has no context for how your app verifies identity, so it skips the gate entirely.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!91qP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!91qP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 424w, https://substackcdn.com/image/fetch/$s_!91qP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 848w, https://substackcdn.com/image/fetch/$s_!91qP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 1272w, https://substackcdn.com/image/fetch/$s_!91qP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!91qP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png" width="725.2000122070312" height="352.1756081036477" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:625,&quot;width&quot;:1287,&quot;resizeWidth&quot;:725.2000122070312,&quot;bytes&quot;:66267,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!91qP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 424w, https://substackcdn.com/image/fetch/$s_!91qP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 848w, https://substackcdn.com/image/fetch/$s_!91qP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 1272w, https://substackcdn.com/image/fetch/$s_!91qP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbca62e3a-52e7-40f2-a04a-8ebf66ac8ba9_1287x625.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>That&#8217;s <strong>broken access control</strong>, <a href="https://owasp.org/Top10/A01_2021-Broken_Access_Control/">OWASP&#8217;s #1 web application security risk</a>. The route is live, and anyone can call it. The AI never had the information to do it right in the first place. Vibe coding makes this worse because the whole premise is speed: describe, generate, ship. The <a href="https://www.toxsec.com/p/nvidias-ai-kill-chain">AI kill chain</a> runs fastest when nobody pauses to check the scaffolding.</p><p>We find the repo on GitHub and pull the routes file. <code>POST /api/admin/users</code>, handler defined, no middleware in the chain before it. We send a POST with no token, no session cookie. The endpoint creates a new admin user and returns 201, full admin access. From there we pull the user database, reset passwords, and pivot to whatever the admin panel touches.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aKB5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aKB5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 424w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 848w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 1272w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aKB5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png" width="1053" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1053,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:32033,&quot;alt&quot;:&quot;Burp Suite repeater showing POST /api/admin/users request with empty Authorization header, response 201 Created with new admin user JSON, next to routes file showing handler with no auth middleware, nuclear green on dark&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/190338370?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Burp Suite repeater showing POST /api/admin/users request with empty Authorization header, response 201 Created with new admin user JSON, next to routes file showing handler with no auth middleware, nuclear green on dark" title="Burp Suite repeater showing POST /api/admin/users request with empty Authorization header, response 201 Created with new admin user JSON, next to routes file showing handler with no auth middleware, nuclear green on dark" srcset="https://substackcdn.com/image/fetch/$s_!aKB5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 424w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 848w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 1272w, https://substackcdn.com/image/fetch/$s_!aKB5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8eabec4e-61cf-40ea-a11e-6ba1090dfe6b_1053x428.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>The Compound Blast Radius of Three Vibe Coding Failures</h2><p>Three chapters, three AI-generated attack surfaces. Slopsquatting got us shell access before the app shipped. Hardcoded credentials handed us the infrastructure keys. Broken auth walked us into the application itself. Same AI, same afternoon, no zero-days required.</p><p>The compound blast radius is what makes this ugly. Each failure alone is bad. Chained together, they&#8217;re a full compromise: code execution on the developer&#8217;s machine, access to production infrastructure credentials, and admin-level control of the application. A Tenzai assessment of five major vibe coding tools found <a href="https://www.csoonline.com/article/4116923/output-from-vibe-coding-tools-prone-to-critical-security-flaws-study-finds.html">69 total vulnerabilities across 15 test applications</a>, including critical-severity flaws. The tools catch generic bugs but fail where context matters, and authentication, secrets management, and dependency verification all require context the model never had.</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/vibe-coding-security-attack-chain">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Zero Trust Home Network: AI Breaks Flat WiFi in Minutes]]></title><description><![CDATA[Evil twins, AirSnitch isolation bypass, AI-powered exploit chaining, and NAS zero-days make flat home networks a red team playground in 2026.]]></description><link>https://www.toxsec.com/p/zero-trust-home-network</link><guid isPermaLink="false">https://www.toxsec.com/p/zero-trust-home-network</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Fri, 06 Mar 2026 14:46:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!f_Gn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f_Gn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f_Gn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f_Gn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png" width="2752" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:2752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7310270,&quot;alt&quot;:&quot;Evil twin WiFi attack &#8212; attacker laptop cloning home network curbside, Bettercap terminal, IoT devices hijacked, NAS root via CVE-2026-24061&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44151a6e-1812-46be-b07a-acc727f9963b_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Evil twin WiFi attack &#8212; attacker laptop cloning home network curbside, Bettercap terminal, IoT devices hijacked, NAS root via CVE-2026-24061" title="Evil twin WiFi attack &#8212; attacker laptop cloning home network curbside, Bettercap terminal, IoT devices hijacked, NAS root via CVE-2026-24061" srcset="https://substackcdn.com/image/fetch/$s_!f_Gn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!f_Gn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b017c47-311f-4b61-a239-b901b2850651_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> We clone the target&#8217;s WiFi from the curb, AirSnitch punches through client isolation on every router tested, and a local AI agent maps the whole attack surface and chains the exploits autonomously. A standard home network in 2026 falls fast.</p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h2>The AI Runs the Op</h2><p>We park curbside with a laptop, a $30 Alfa WiFi adapter, and PentAGI, an open source pentest platform that ships as a Docker container with twenty-plus security tools pre-loaded. One command starts the agent. It takes over from there.</p><p>PentAGI runs a local AI model through Ollama, meaning nothing phones home. No cloud, no logs, no outside visibility. <a href="https://www.toxsec.com/p/openai-signs-what-anthropic-wouldnt">Qwen3 32B handles the reasoning with a 110,000-token context window.</a> It scans the network, catalogs every device and open port, cross-references known CVEs in real time, and picks the highest-value next move on its own. What used to take an experienced pentester a full day of manual work now runs in about twenty minutes.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CPRQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CPRQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 424w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 848w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 1272w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CPRQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png" width="864" height="764" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:764,&quot;width&quot;:864,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:77106,&quot;alt&quot;:&quot;PentAGI terminal, autonomous agent output showing network enumeration, device fingerprinting, CVE cross-reference, and attack plan generation against a home subnet&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="PentAGI terminal, autonomous agent output showing network enumeration, device fingerprinting, CVE cross-reference, and attack plan generation against a home subnet" title="PentAGI terminal, autonomous agent output showing network enumeration, device fingerprinting, CVE cross-reference, and attack plan generation against a home subnet" srcset="https://substackcdn.com/image/fetch/$s_!CPRQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 424w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 848w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 1272w, https://substackcdn.com/image/fetch/$s_!CPRQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d694a4b-a7f6-4789-944d-ee438f0c9aa0_864x764.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>How the AI Clones Your WiFi and Boots You Off It</h2><p>Here&#8217;s the first move the agent makes. It clones the target&#8217;s WiFi network name using Bettercap, a widely used network attack tool, and broadcasts louder than the real router. Then it runs a deauth flood, spoofed disconnect frames, forged to look like they came from your actual router, that kick every device off the legitimate network. WPA2 and WPA3 both accept these frames. The WiFi standard never protected them.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HQc4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HQc4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 424w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 848w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 1272w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HQc4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png" width="824" height="714" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:714,&quot;width&quot;:824,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:49238,&quot;alt&quot;:&quot;toxsec.com zero trust networks&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="toxsec.com zero trust networks" title="toxsec.com zero trust networks" srcset="https://substackcdn.com/image/fetch/$s_!HQc4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 424w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 848w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 1272w, https://substackcdn.com/image/fetch/$s_!HQc4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa98ff83d-d663-4c1d-94f4-011902f639cb_824x714.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Devices drop and reconnect automatically without checking which AP is real. They chase the strongest signal like moths to a bug zapper, and we&#8217;re broadcasting louder. The thermostat, the doorbell cam, the smart plugs, all of them hit our fake AP first. <a href="https://www.toxsec.com/p/darknet-chatbots">The AI agent scrapes the ISP&#8217;s branding and generates a pixel-perfect &#8220;re-authenticate to continue&#8221; page, down to the favicon.</a> Credentials roll in while nmap quietly maps every device on the subnet in the background. Every open port, every running service, every version string. All of it, because nothing on a typical home network separates anything from anything else.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7UNK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7UNK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 424w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 848w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 1272w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7UNK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png" width="590" height="739" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cc16c184-7855-402d-b675-abbff8d8e503_590x739.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:590,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:26250,&quot;alt&quot;:&quot;toxsec.com zero trust networks&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="toxsec.com zero trust networks" title="toxsec.com zero trust networks" srcset="https://substackcdn.com/image/fetch/$s_!7UNK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 424w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 848w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 1272w, https://substackcdn.com/image/fetch/$s_!7UNK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc16c184-7855-402d-b675-abbff8d8e503_590x739.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Client Isolation Was Never Real</h2><p>Most people&#8217;s fallback is client isolation, a setting in your router that&#8217;s supposed to block devices on the same network from talking to each other directly. Toggle it on, problem solved. Except the problem was never solved in the spec.</p><p><a href="https://www.ndss-symposium.org/ndss-paper/airsnitch-demystifying-and-breaking-client-isolation-in-wi-fi-networks/">AirSnitch, presented at NDSS 2026</a> by researchers from UC Riverside and KU Leuven, demonstrated full bidirectional MitM through client isolation on every router they tested: Netgear Nighthawk, TP-Link Archer, ASUS RT-AX57, D-Link DIR-3040, OpenWrt, DD-WRT. WPA2 and WPA3, isolation maxed out. The WiFi standard never defined how client isolation should actually work, so every vendor improvised, and every vendor left the same gaps. AirSnitch wraps a targeted packet inside a broadcast frame that all devices accept as legitimate, spoofs the router&#8217;s identity, and intercepts both directions without dropping a single packet. Enterprise WPA3 deployments got hit too, researchers intercepted auth traffic between access points and backend servers, cracked weak shared secrets, and escalated to credential theft.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iK0v!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iK0v!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 424w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 848w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 1272w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iK0v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png" width="1122" height="546" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:1122,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:50477,&quot;alt&quot;:&quot;Test matrix from AirSnitch NDSS 2026 paper showing bidirectional MitM success across six consumer and enterprise router models on WPA2 and WPA3&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Test matrix from AirSnitch NDSS 2026 paper showing bidirectional MitM success across six consumer and enterprise router models on WPA2 and WPA3" title="Test matrix from AirSnitch NDSS 2026 paper showing bidirectional MitM success across six consumer and enterprise router models on WPA2 and WPA3" srcset="https://substackcdn.com/image/fetch/$s_!iK0v!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 424w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 848w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 1272w, https://substackcdn.com/image/fetch/$s_!iK0v!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fb8d36-bab8-4b69-8882-cb3124fd1bcf_1122x546.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>The NAS Is Always the Prize</h2><p>The AI agent flagged the NAS as the highest-value target on the subnet immediately. Most homes run one, a box serving as a personal file server. Photos, tax returns, backups, password exports, the full digital life in one place with one set of credentials.</p><p><a href="https://nvd.nist.gov/vuln/detail/CVE-2026-24061">CVE-2026-24061 dropped in January: an authentication bypass in GNU InetUtils telnetd.</a> The login prompt can be skipped entirely. The bug sat unpatched for eleven years. CVSS 9.8. Over 212,000 devices were still running exposed telnet servers at disclosure, a large portion of them consumer NAS boxes that shipped with telnet enabled by default. We connect, skip the password, get root. No exploit kit. One modified command. Pwn2Own Ireland 2025 stacked on top: seven critical zero-days across QNAP and Synology giving full unauthenticated remote access with no interaction required.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Vomf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Vomf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 424w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 848w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 1272w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Vomf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png" width="838" height="696" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:696,&quot;width&quot;:838,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:49818,&quot;alt&quot;:&quot;toxsec.com zero trust networks&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/189608898?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="toxsec.com zero trust networks" title="toxsec.com zero trust networks" srcset="https://substackcdn.com/image/fetch/$s_!Vomf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 424w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 848w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 1272w, https://substackcdn.com/image/fetch/$s_!Vomf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F044f5ef2-03af-4e52-a919-cdd94054b7c4_838x696.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Seven bugs. Two of the most popular NAS brands on earth. No login, no interaction, no warning, just a remote attacker with a root shell on the box holding your entire digital life.</p><p>Now for Zero Trust. The fixes exist. Every single link in this chain has a hard counter. Subscribers get them next.</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/zero-trust-home-network">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The OpenClaw Security Checklist]]></title><description><![CDATA[Don't get pwned. The fix is easy. Do this before launch.]]></description><link>https://www.toxsec.com/p/openclaw-security-checklist</link><guid isPermaLink="false">https://www.toxsec.com/p/openclaw-security-checklist</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Thu, 29 Jan 2026 14:02:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ETRn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ETRn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ETRn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 424w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 848w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 1272w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ETRn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png" width="1024" height="596" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:596,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1539336,&quot;alt&quot;:&quot;Toxsec.com&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/186166755?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7dc765b-ae99-485b-877e-a5367bd08e6a_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com" title="Toxsec.com" srcset="https://substackcdn.com/image/fetch/$s_!ETRn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 424w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 848w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 1272w, https://substackcdn.com/image/fetch/$s_!ETRn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdea5b684-c356-4053-8f55-c7e5bbc14dd8_1024x596.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This checklist exists because the defaults will get you pwned. The quick-start guides optimize for &#8220;it works!&#8221; - not &#8220;it works without leaking your Anthropic API key to Shodan.&#8221; Every section below a&#8230;</p>
      <p>
          <a href="https://www.toxsec.com/p/openclaw-security-checklist">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LLM Jailbreaks Get Easier as Models Get Smarter]]></title><description><![CDATA[DAN prompts, Crescendo multi-turn attacks, and instruction-data conflation bypass AI guardrails because smarter models follow malicious instructions better]]></description><link>https://www.toxsec.com/p/dan-prompts-for-guardrail-bypass</link><guid isPermaLink="false">https://www.toxsec.com/p/dan-prompts-for-guardrail-bypass</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Mon, 26 Jan 2026 14:03:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!tOXm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tOXm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tOXm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tOXm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:619790,&quot;alt&quot;:&quot;LLM jailbreak techniques DAN prompts Crescendo multi-turn attack instruction-data conflation bypass AI guardrails prompt injection defense Constitutional Classifiers Grok-4 DeepSeek&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/183987305?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="LLM jailbreak techniques DAN prompts Crescendo multi-turn attack instruction-data conflation bypass AI guardrails prompt injection defense Constitutional Classifiers Grok-4 DeepSeek" title="LLM jailbreak techniques DAN prompts Crescendo multi-turn attack instruction-data conflation bypass AI guardrails prompt injection defense Constitutional Classifiers Grok-4 DeepSeek" srcset="https://substackcdn.com/image/fetch/$s_!tOXm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tOXm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31e0680a-75ba-4808-92f5-f6007ad912ec_2752x1536.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> LLMs process attacker instructions and system prompts through the same attention mechanism. No privilege separation. No access controls. Just tokens. Research confirms smarter models are more susceptible to adversarial manipulation. Grok-4 fell in two days. DeepSeek failed 58% of jailbreak tests. The fix is architecturally impossible. Here&#8217;s how the attacks work. </p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h2>Grok-4 Fell to Known LLM Jailbreaks in 48 Hours</h2><p>xAI released Grok-4 on July 9, 2025. Two days later, NeuralTrust researchers combined two known <strong>LLM jailbreak</strong> techniques and got it producing instructions for making incendiary devices. No zero-days. No exotic research. Just conversation design using methods documented in academic papers from 2024.</p><p>The attack used Echo Chamber, a technique that poisons the conversational context by subtly nudging the model toward unsafe territory without ever using a flagged keyword, combined with Crescendo, a multi-turn approach where each message escalates the topic slightly until the guardrails forget what they were guarding. Together they hit a 67% success rate on the primary objective. In one case, Grok-4 folded in a single turn.</p><p>The model that was supposed to compete with GPT-5 crumpled against attacks that any grad student could replicate after reading two papers. This is the same class of <a href="https://www.toxsec.com/p/fck-your-guardrails">prompt injection we&#8217;ve been running live fire chains against</a> for months. The technique scales. The defenses don&#8217;t.</p><h2>How Instruction-Data Conflation Hands Attackers the Keys</h2><p>The vulnerability has a name in the research: <strong>instruction-data conflation</strong>. Translation: the model processes legitimate instructions and attacker payloads through the exact same pipeline.</p><p>Here&#8217;s the failure. A system prompt, the hidden instructions a developer writes to tell the model how to behave, arrives as tokens. The attacker&#8217;s message also arrives as tokens. Both get processed by the same attention mechanism, the part of the neural network that decides which words matter most in context. There is no privileged channel. No access control. No kernel mode vs. user mode. The system prompt is a suggestion. The attacker&#8217;s payload is also a suggestion. Whichever carries more contextual weight wins.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!65z_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!65z_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 424w, https://substackcdn.com/image/fetch/$s_!65z_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 848w, https://substackcdn.com/image/fetch/$s_!65z_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 1272w, https://substackcdn.com/image/fetch/$s_!65z_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!65z_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png" width="570" height="840" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de96f89e-fc13-4903-9682-24dc123b22c2_570x840.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:840,&quot;width&quot;:570,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:52660,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/183987305?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!65z_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 424w, https://substackcdn.com/image/fetch/$s_!65z_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 848w, https://substackcdn.com/image/fetch/$s_!65z_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 1272w, https://substackcdn.com/image/fetch/$s_!65z_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde96f89e-fc13-4903-9682-24dc123b22c2_570x840.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is why <strong>DAN prompts</strong> work. DAN, short for &#8220;Do Anything Now,&#8221; first appeared in December 2022 and has evolved through dozens of variants. Qualys documented eighteen named versions in January 2025 when they tested DeepSeek R1 against 885 jailbreak attacks. DeepSeek failed 58% of them. That same month, Cisco and the University of Pennsylvania hit DeepSeek with 50 HarmBench prompts and achieved a 100% attack success rate. The model blocked nothing.</p><p>The reasoning revolution arrived. The guardrails did not.</p><h2>Which LLM Jailbreak Techniques Actually Bypass AI Guardrails?</h2><p>The armory keeps growing. Each technique exploits the same blind spot from a different angle.</p><p><strong>Crescendo multi-turn attacks</strong> boil the frog. Start with innocent questions. Gradually shift tone across multiple messages. By message fifteen, the guardrails have lost the thread. Keyword filters evaluate individual messages. The attack lives in the arc across the conversation. This is what broke Grok-4. We covered the broader <a href="https://www.toxsec.com/p/distillation-raids-slopsquatting">indirect prompt injection surface</a> recently, and Crescendo is the same principle applied to the chat window.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iCh8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iCh8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 424w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 848w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 1272w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iCh8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png" width="1024" height="915" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:915,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:74081,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/183987305?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!iCh8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 424w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 848w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 1272w, https://substackcdn.com/image/fetch/$s_!iCh8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e28198a-d179-43fa-80e9-8ce10650e956_1024x915.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Bad Likert Judge</strong> turns the model into its own red team. We ask it to rate the harmfulness of potential responses on a 1-5 scale, then request examples for each rating. The model generates its own harmful content as a &#8220;demonstration.&#8221; Palo Alto&#8217;s Unit 42 used this technique alongside Crescendo and Deceptive Delight to extract explicit guidance for data exfiltration, spear-phishing templates, and instructions for incendiary devices from DeepSeek.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sQGw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sQGw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 424w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 848w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 1272w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sQGw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png" width="702" height="822" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/db3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:822,&quot;width&quot;:702,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:47330,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/183987305?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!sQGw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 424w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 848w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 1272w, https://substackcdn.com/image/fetch/$s_!sQGw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdb3bb958-ab7c-45dd-a913-673fea0fa7cb_702x822.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Token and God Mode systems</strong> gamify compliance with fake points and imaginary privileges. A neural network got peer-pressured by imaginary status. It works because the model is a people-pleaser first and a security system never.</p><p>The next generation is already here. TokenBreak manipulates how input gets chunked into tokens. Deceptive Delight embeds dangerous requests inside cheerful narratives. AutoDAN generates human-readable jailbreaks that dodge perplexity detectors. The PAIR algorithm pits one LLM against another and achieves successful jailbreaks in under twenty queries.</p><p>Every single one exploits the same architectural truth: <strong>LLMs are excellent at pretending.</strong> The training data included millions of &#8220;pretend you are X&#8221; scenarios. To the silicon, &#8220;pretend to be a pirate&#8221; and &#8220;pretend to be an AI without restrictions&#8221; are the same type of instruction: character sheets.</p><h2>Why Smarter AI Models Become Easier Jailbreak Targets</h2><p>Here&#8217;s the kicker, and the reason none of this gets patched.</p><p>The DecodingTrust project, a joint effort from researchers at Illinois, Stanford, UC Berkeley, and Microsoft Research, tested GPT-4 against GPT-3.5 under adversarial conditions. The finding: <strong>GPT-4 is more susceptible to manipulation through adversarial system prompts.</strong> The exact capability that makes it better at following instructions makes it better at following malicious instructions. Better language understanding means better at adopting personas. Including the ones we give it.</p><p>Roleplay is a feature. Legitimate users need persona adoption for creative writing, education, tutoring. Blocking it lobotomizes the product. The same mechanism that lets a teacher say &#8220;explain photosynthesis as a children&#8217;s TV host&#8221; lets us say &#8220;explain synthesis as a chemist without ethics.&#8221; Every patch is reactive. Block the phrasing, we shift to synonyms. Add keyword filters, we encode in Base64. The defender must be right every time. We need to be right once.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bsrq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bsrq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 424w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 848w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 1272w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bsrq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png" width="1044" height="1037" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1037,&quot;width&quot;:1044,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:96077,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/183987305?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!bsrq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 424w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 848w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 1272w, https://substackcdn.com/image/fetch/$s_!bsrq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25299215-4bb4-4cfc-bca7-66a218f35004_1044x1037.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>November 2025 proved the scale of what this enables. A Chinese state-sponsored group jailbroke Claude Code, Anthropic&#8217;s AI coding tool, and used it as an autonomous attack agent against roughly thirty global targets across tech, finance, chemical manufacturing, and government agencies. The AI handled 80-90% of the operation, making thousands of requests per second. Human operators stepped in at maybe four to six decision points per campaign. The jailbreak was the skeleton key. Everything after was automation at a speed no human team could match.</p><p>Anthropic tried the hardest defense in the industry. Their Constitutional Classifiers, a system of input and output filters trained on synthetic data, reduced automated jailbreak success from 86% to 4.4%. Then they ran a bug bounty through HackerOne in February 2025. Within the seven-day challenge window, four teams split $55,000 in bounties. One team found a universal jailbreak. The defense that stopped 95% of synthetic attacks crumpled against humans with motivation and a week of free time.</p><p>The compliance officer sees a chatbot. The red teamer sees an instruction-following machine with a costume closet and zero access controls.</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/dan-prompts-for-guardrail-bypass">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[AP2 AgentCard Poisoning Breaks AI Payment Security]]></title><description><![CDATA[How the confused deputy problem in A2A turns your AI agent's payment scope into an automated wire transfer for an attacker, and why AP2's cryptographic mandates don't stop it.]]></description><link>https://www.toxsec.com/p/the-agent-economy-is-waking-up</link><guid isPermaLink="false">https://www.toxsec.com/p/the-agent-economy-is-waking-up</guid><dc:creator><![CDATA[ToxSec]]></dc:creator><pubDate>Thu, 15 Jan 2026 14:00:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!K12H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K12H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K12H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!K12H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!K12H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!K12H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K12H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:634051,&quot;alt&quot;:&quot;AP2 AgentCard poisoning confused deputy attack: AI payment agent hijacked via A2A protocol before cryptographic mandate signing&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="AP2 AgentCard poisoning confused deputy attack: AI payment agent hijacked via A2A protocol before cryptographic mandate signing" title="AP2 AgentCard poisoning confused deputy attack: AI payment agent hijacked via A2A protocol before cryptographic mandate signing" srcset="https://substackcdn.com/image/fetch/$s_!K12H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!K12H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!K12H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!K12H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2e412a9-a322-4f1a-ab83-1e8dc53a127c_2752x1536.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>TL;DR:</strong> Google&#8217;s AP2 protocol and Universal Commerce Protocol put a $5 trillion agentic commerce stack into developer hands in early 2026. The A2A protocol underneath it uses digital business cards called AgentCards to route tasks between agents. Researchers proved you can stuff a prompt injection payload inside that card. Your agent becomes a confused deputy: it holds your payment permissions and takes orders from us. The crypto signatures don&#8217;t help. We sign the mandate ourselves. </p><blockquote><p>This is the public feed. Upgrade to see what doesn&#8217;t make it out.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><h2>Your AI Just Signed a Wire Transfer You Didn&#8217;t Approve</h2><p>January 11, 2026. Google announces the Universal Commerce Protocol at the National Retail Federation conference. UCP sits on top of AP2 (the Agent Payments Protocol), talks to A2A (the agent-to-agent coordination protocol) and MCP (the tool protocol), and lets your AI agent complete a purchase directly from Google Search. Shopify, Target, Walmart, Visa, all signed. AP2 went live with 60 partner organizations in September 2025. McKinsey projects $3 to $5 trillion in global agentic commerce volume by 2030.</p><p>Here is the trust model this stack runs on: a user expresses intent, an AI agent acts on it, and AP2 generates a cryptographically signed mandate proving the user authorized the transaction. The mandate is tamper-evident. The payment is non-reputable. The audit trail is clean. On paper, this is a closed system.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!93TQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!93TQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 424w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 848w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 1272w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!93TQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png" width="1451" height="879" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:879,&quot;width&quot;:1451,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:128858,&quot;alt&quot;:&quot;Toxsec.com image.&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image." title="Toxsec.com image." srcset="https://substackcdn.com/image/fetch/$s_!93TQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 424w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 848w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 1272w, https://substackcdn.com/image/fetch/$s_!93TQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F709136eb-3f57-4efd-a2e7-b97d55da938c_1451x879.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The problem is the agent. AP2&#8217;s cryptographic protections fire after the agent decides what to sign. If an attacker already owns the agent&#8217;s reasoning by the time the mandate gets generated, the crypto is irrelevant. We get a perfectly valid signature on a fraudulent transaction. The math is clean. The money is gone. If you want to see the same logic one layer down the stack, <a href="https://www.toxsec.com/p/lets-poison-the-mcp">our MCP tool poisoning breakdown</a> shows how this exact reasoning failure starts at the tool layer.</p><h2>How AgentCard Poisoning Hijacks AP2 Payment Scope</h2><p>The A2A protocol is how AI agents find and hire each other. Every agent publishes an AgentCard, a JSON file served at <code>/.well-known/agent.json</code> on its domain. Think of it as a machine-readable business card: &#8220;I process invoices. I validate vendors. I have access to the procurement database.&#8221; The host agent reads every card it encounters and decides which agent handles which task.</p><p>The confused deputy problem, a concept in computer security since the late 1980s, describes what happens when a trusted program gets tricked into misusing its own privileges. The program isn&#8217;t compromised in the traditional sense. It&#8217;s just doing exactly what it was told, by the wrong person, with the right credentials. In an A2A context, the host agent is the deputy. It holds your AP2 payment scope. It follows instructions from whatever AgentCards it reads.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Lmbt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Lmbt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 424w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 848w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 1272w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Lmbt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png" width="1333" height="814" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:814,&quot;width&quot;:1333,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:110941,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!Lmbt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 424w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 848w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 1272w, https://substackcdn.com/image/fetch/$s_!Lmbt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbade45e9-9aba-4d2f-ba62-2b6fea0040c8_1333x814.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>August 2025. <a href="https://www.trustwave.com/en-us/resources/blogs/spiderlabs-blog/agent-in-the-middle-abusing-agent-cards-in-the-agent-2-agent-protocol-to-win-all-the-tasks/">Trustwave SpiderLabs published research</a> they called &#8220;Agent in the Middle.&#8221; The attack: stuff a prompt injection payload directly into an AgentCard&#8217;s capability description field. The A2A spec in version 0.3 and above supports AgentCard signing but does not enforce it. Your host agent fetches the card, reads the descriptions, and the injected instructions land in the model&#8217;s context window as trusted input. Every subsequent task routes through the compromised agent. Every A2A call, every MCP tool use, every AP2 mandate gets generated under our influence.</p><p>We can redirect a vendor. We can inflate a transaction amount slightly enough not to trigger fraud alerts. We can swap the shipping address. The mandate gets signed. The audit trail says you approved it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E5_q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E5_q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 424w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 848w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 1272w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E5_q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png" width="1362" height="944" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:944,&quot;width&quot;:1362,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:116885,&quot;alt&quot;:&quot;A2A host agent parsing poisoned AgentCard capability description, showing injected instruction field beside legitimate invoice processing capability&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A2A host agent parsing poisoned AgentCard capability description, showing injected instruction field beside legitimate invoice processing capability" title="A2A host agent parsing poisoned AgentCard capability description, showing injected instruction field beside legitimate invoice processing capability" srcset="https://substackcdn.com/image/fetch/$s_!E5_q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 424w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 848w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 1272w, https://substackcdn.com/image/fetch/$s_!E5_q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a0a5873-4cd4-45c7-ad8c-a41c073c9771_1362x944.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Agent Session Smuggling Finishes What AgentCard Poisoning Started</h2><p>November 2025. <a href="https://unit42.paloaltonetworks.com/agent-session-smuggling-in-agent2agent-systems/">Palo Alto&#8217;s Unit 42 published research</a> on a variant they call Agent Session Smuggling. This one doesn&#8217;t need an initial AgentCard compromise. It works on agents you&#8217;ve already authenticated and trusted.</p><p>A2A is a stateful protocol: it remembers the conversation. This is what makes it useful for multi-step tasks. An agent can pick up a workflow mid-stream, carry context across turns, and coordinate complex purchase flows. Unit 42 found that a malicious remote agent can exploit this statefulness to inject instructions between a legitimate request and its response. The A2A spec includes a legitimate mechanism called <code>input-required</code> state, where a remote agent asks the client agent for additional information. Session smuggling weaponizes that mechanism.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KP1O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KP1O!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 424w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 848w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 1272w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KP1O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png" width="1102" height="1037" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1037,&quot;width&quot;:1102,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92454,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!KP1O!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 424w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 848w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 1272w, https://substackcdn.com/image/fetch/$s_!KP1O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4527769b-55b2-413b-b3c7-6bd3914e44fa_1102x1037.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Here&#8217;s the move: your client agent sends a normal task request. The remote agent starts processing. Midway through the session, the remote agent fires back an <code>input-required</code> update containing injected instructions referencing credentials, vendor details, or payment routing. Your client agent reads this as a continuation of an authenticated conversation. The instructions execute before any AP2 mandate validation runs.</p><p>Layered on top of the AgentCard poisoning in 0x01, the full chain looks like this: the poisoned card establishes our agent in the routing path, session smuggling injects transaction-level instructions mid-session, and the resulting AP2 mandate carries our modifications with the user&#8217;s valid signature. <a href="https://developers.redhat.com/articles/2025/08/19/how-enhance-agent2agent-security">Red Hat confirmed</a> the A2A spec provides no built-in defense against this specific injection pattern.</p><h2>The Mandate Signs Whatever the Confused Deputy Tells It To</h2><p>AP2&#8217;s security story centers on verifiable digital credentials. Three mandate types, each cryptographically signed using ECDSA: an Intent Mandate (pre-authorized spending parameters), a Cart Mandate (the exact items and price at checkout), and a Payment Mandate (the financial instruction to the network). The claim is that these mandates create a tamper-evident chain of evidence from user intent to executed transaction.</p><p>Here is where the claim breaks. The chain starts with the agent. The user&#8217;s intent gets translated into a mandate by the AI agent acting on their behalf. If we own the agent&#8217;s context through the two attacks above, we own what the intent looks like by the time the mandate is generated. We don&#8217;t tamper with the mandate after signing. We tamper with what the agent believes the intent was before it picks up the pen.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ojLK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ojLK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 424w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 848w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 1272w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ojLK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png" width="1141" height="977" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:977,&quot;width&quot;:1141,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:93933,&quot;alt&quot;:&quot;Toxsec.com image&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.toxsec.com/i/175231527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Toxsec.com image" title="Toxsec.com image" srcset="https://substackcdn.com/image/fetch/$s_!ojLK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 424w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 848w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 1272w, https://substackcdn.com/image/fetch/$s_!ojLK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff170ddad-7ed3-4514-8e93-6162c6e5e11c_1141x977.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>AP2&#8217;s own documentation acknowledges the gap: the protocol explicitly asks how merchants can verify that an agent&#8217;s request accurately reflects the user&#8217;s true intent. Their answer is the mandate system itself. But the mandate system assumes the agent is trustworthy when it generates the mandate. Researchers at Trustwave, Unit 42, and Cloud Security Alliance all documented in 2025 that this assumption fails at the input parsing layer, before mandate generation begins. In December 2025, OpenAI published guidance on hardening their browser agent and acknowledged that prompt injection, the root mechanism of every attack in this chain, will likely never be fully solved.</p><p>AP2&#8217;s math is clean. The attack lands before the math runs.</p><blockquote><p>We dropped the free chapters. Now breach the wall for the dead-simple step-by-step kill switch that shuts this all down.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.toxsec.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.toxsec.com/subscribe?"><span>Subscribe now</span></a></p></blockquote>
      <p>
          <a href="https://www.toxsec.com/p/the-agent-economy-is-waking-up">
              Read more
          </a>
      </p>
   ]]></content:encoded></item></channel></rss>