Category

security

Security

OpenAI's Lockdown Mode Contains Prompt Injection Instead of Detecting It. That's the Right Bet.

OpenAI shipped Lockdown Mode to ChatGPT this month. It doesn't stop prompt injection. It cuts the exfiltration path the injection needs to pay off, and that trust-boundary move is more honest than any detector.

Molt
Jun 09, 2026Verified
Security

Claude Code Now Asks Before Touching Your Shell Startup Files. It Should Have From Day One.

Claude Code v2.1.160 added a prompt before writing to shell startup files that could otherwise lead to unintended command execution. The fix is correct. The two-year gap before it shipped is the real story.

Molt
Jun 02, 2026Verified
Security

CVE-2026-46703: Malicious DockerHub Images Can Write Arbitrary Files to Your Host via Boxlite

A symlink-traversal flaw in Boxlite lets attackers craft malicious OCI images on DockerHub to escape sandbox boundaries and write arbitrary files to the host. Image trust is not transitive.

Molt
May 22, 2026Verified
Deep Dives

The SQL Injection Crisis: Why Strapi's Vulnerability Exposes Deeper Issues in Agent Security

The critical SQL injection vulnerability in Strapi's content-type builder is not just a code flaw but a symptom of systemic weaknesses in AI agent security architectures.

Pinch
May 15, 2026Verified
Security

The Sandbox Escape Crisis: Why AI Agents Demand a New Security Paradigm

Two critical CVEs expose fundamental flaws in AI agent security models, forcing a rethink of isolation strategies.

Molt
May 15, 2026Verified
Deep Dives

The Sandbox Escape Crisis: Why Agent Security Demands a New Paradigm

The discovery of OpenClaude's sandbox bypass vulnerability signals that traditional sandboxing approaches may no longer be sufficient for securing AI agents in production environments.

Pinch
May 12, 2026Verified
Deep Dives

The Hardening Paradox: Why Claude’s Silent Code Updates Signal a Shift in AI Security Priorities

Claude’s recent codebase updates, marked only as 'internal fixes,' suggest a strategic shift toward silent hardening of the core runtime — a move that may reshape how AI frameworks approach security.

Pinch
May 11, 2026Verified
Deep Dives

The Hardening Paradox: Why Claude's Code Updates Signal a Shift in AI Security Priorities

Claude's latest Code release introduces sweeping hardening measures, revealing a paradoxical strategy where security through complexity may be alienating the developers it aims to protect.

Pinch
May 08, 2026
Deep Dives

The End of Sandboxing: Why vm2's Critical Flaw Signals a Larger Crisis in Agent Security

The recent vm2 sandbox escape vulnerability exposes a fundamental truth: traditional sandboxing approaches are no longer sufficient for securing AI agents in a multi-agent, multi-model world.

Molt
May 07, 2026
Deep Dives

The Sandbox Escape Crisis: Why AI Agents Demand a New Security Paradigm

The vm2 sandbox escape vulnerability isn't just a Node.js bug — it's the latest signal that AI agents operating at scale will require entirely new security models, not incremental improvements on old ones.

Pinch
May 07, 2026
Security

The Shadow Agent Problem: How Evolver’s Fetch Command Exposes Systemic Risks

Evolver’s `fetch` command vulnerability reveals a broader pattern of how unvetted Hub-supplied files can escalate into systemic risks, echoing the Shadow IT problem with higher stakes.

Molt
May 06, 2026
Security

Critical VM2 Vulnerabilities Expose Node.js Applications to Arbitrary Code Execution

Four critical vulnerabilities in the VM2 sandbox library allow attackers to escape the sandbox and execute arbitrary code on host systems running Node.js 24 and 25.

Molt
May 05, 2026