When AI Tools Become the Backdoor: Zero-Click RCE via Prompt Injection

By: Cymulate Research Lab

May 6, 2026

Ilan Kalendarov, Security Research Team Lead
Ben Zamir, Security Researcher
Elad Beber, Security Researcher

Affected Tools: Cursor CLI, AWS Kiro, Codex Desktop App and Gemini CLI

Vulnerabilities in common AI products can turn everyday AI tool usage, like opening a repo or running a prompt, into a potential silent system compromise. Simply opening a project or interacting with AI-generated content can trigger the attack chain.

After responsibly notifying the vendors of Cursor CLI, AWS Kiro, Codex Desktop App and Gemini CLI, Cymulate Research Labs have published new attack scenarios for Cymulate Exposure Validation to simulate production-safe attack chains to test and validate threat exposure.

Executive Summary

Cymulate Research Labs uncovered a chain of vulnerabilities across multiple major vendors that can enable remote code execution from a single prompt injection. These issues fall into two primary classes:

Arbitrary Binary Execution via Untrusted Resolution
AI CLI tools on Windows rely on the default executable search order, which prioritizes the current working directory over trusted system paths. As a result, attacker-controlled files placed in a workspace (e.g., cloned repositories, extracted archives, or binaries introduced via prompt injection) can be executed automatically without user approval, signature validation or warning.
Arbitrary Command Execution via Configuration Poisoning
If the LLM’s file-write capability allows modifications to execution-sensitive files (such as tasks.json or settings.json) without user approval, it can be abused to inject commands that execute at the OS level. Because these files are automatically processed by the IDE, a prompt injection alone can lead to arbitrary command execution upon reopening the workspace.

The issues affect the following products: Cursor CLI, AWS Kiro, Codex Desktop App and Gemini CLI.

We reported all findings to the respective vendors. Responses varied: one vendor engaged thoroughly, one issued a bounty but deprioritized remediation and the rest closed the reports without meaningful technical engagement.

This disparity directly impacts user security. GitHub was the only vendor to properly triage and validate the report, while others either shifted responsibility to users or dismissed the findings.

Critically, the tools with the highest impact remain vulnerable, enabling prompt injection -> code execution chains.

As a result, users are exposed. Routine actions such as opening a project or interacting with AI-generated content can lead to silent code execution with no warnings or visible indicators. Check out the practical, actionable advice for CISOs and organizations at the end of this blog.

Introduction

AI agent applications are no longer confined to developer workflows. As more users rely on them to perform increasingly sensitive tasks, they have become a fixture of daily life and with that, a high-value target. The same deep integration that makes these tools productive is precisely what makes them dangerous.

In Part 1 of this series, we demonstrated how built-in sandboxing features across multiple AI application tools could be bypassed through different mechanisms, exposing the gap between the isolation guarantees these tools advertise and the weak mechanisms that actually enforce them.

In Part 2, we go further. We uncover arbitrary command execution primitives that provide an unauthorized foothold on the operating system running these applications. In some cases, these primitives can be chained with additional weaknesses to achieve zero-click remote code execution, triggered simply by using the tool and allowing it to do exactly the feature that allows it to gather context: web search.

The vulnerability classes documented here can be triggered by a single injected prompt. A malicious instruction embedded in a README, a GitHub issue, a Stack Overflow forum reply or a documentation page is enough to set the execution chain in motion.

The Two Vulnerability Classes

Class 1: Prompt-Driven Untrusted Binary Resolution

On Windows, several AI CLI/IDE tools resolve external executables (most commonly git.exe but also npx.exe, where.exe and others depending on the tool) using the default Windows executable search order. That search order includes the current working directory before trusted system paths. For example, a file named git.exe placed in a project folder will be executed in preference to the legitimate system-installed Git binary.

Class 2: Prompt-Driven Configuration Poisoning

Modern AI applications and agents are equipped with file-write tools that allow the LLM to create and modify files within the project workspace. To maximize productivity, these tools are granted broad write access to the project folder by default. In several of the tools we examined, this capability lacks proper boundaries: the LLM-driven file-write tool can reach configuration paths that carry special meaning to the IDE such as paths that trigger automatic command execution the next time the project is opened.

When combined with tool-specific weaknesses, such as Cursor CLI's write tool that has a special path to binary file creation, or AWS Kiro's write tool being permitted to create sensitive IDE configuration files without approval, a prompt injection can escalate all the way to remote command execution. No user consent is required at any stage and the entire chain originates from what appears to be a routine and legitimate tool action.

Full Remote Code Execution Originating from Prompt Injection

Cursor CLI

Class 1: Untrusted Binary Resolution

Arbitrary Code Execution via Write Tool Binary Planting and Implicit npx.exe Resolution

Affected Product	Cursor CLI
Platform	Windows
Vulnerability Type	Write Tool Abuse + Untrusted Executable Resolution
Impact	Remote Code Execution on Host
Vendor Response	Closed as Informative, still vulnerable
Disclosure Date	10/02/2026

Overview

Cursor CLI presents the most complete exploitation chain in this research. It combines an LLM-driven file-write tool that can create binary executables with an unsafe npx.exe resolution path that executes workspace-local binaries automatically on user prompt. Neither step requires user approval. Together, they form a two-stage, prompt-to-execution pipeline.

Issue 1: Write Tool Binary Planting

The Cursor CLI Write tool has a notable behavior not found in other tools we examined, including Cursor's own IDE product. When the write tool is invoked with a binary file extension (.exe, .dll, .so) as file_path argument and the content argument is base64-encoded, the tool automatically detects the extension, switches to binary encoding mode, decodes the payload, and writes a real executable to disk. What makes this particularly deceptive is that the content displayed to the user appears as base64 text and there is no visual indication that the data being written will resolve to a binary file. No warning is shown, no approval dialog is presented and nothing in the interface signals that an executable has just been created in the workspace. This seems like a simple text file write.

A base64-encoded binary payload delivered through a prompt instruction (e.g embedded in a repository README, a documentation page, or any other source the agent reads) is sufficient to plant a live malicious executable in the workspace.

Issue 2: Unsafe npx.exe Resolution

When Cursor CLI performs agent operations in response to a user prompt, internal dependencies in the bundled application invoke npx.exe. Static analysis of the application bundle reveals that the TypeScript language server provider resolves npx using a local exec helper function that does not enforce absolute-path resolution. On Windows, this means the current working directory is searched first.

The resolution and execution code lives in the bundled node modules region of the main index.js:

javascript 

t = (0, o.Ef)("npx", []) 

... 

const s = (0, f.spawn)(n, ["-y", "typescript-language-server", "--stdio"], { 

  cwd: e, // workspace directory used as cwd 

  ... 

});

javascript 

t = (0, o.Ef)("npx", []) 

... 

const s = (0, f.spawn)(n, ["-y", "typescript-language-server", "--stdio"], { 

  cwd: e, // workspace directory used as cwd 

  ... 

});

The Ef helper resolves npx without enforcing an absolute path, allowing a workspace-local npx.exe to take precedence over the system installation. Execution is triggered automatically when the user sends any prompt to the agent after a restart. No confirmation is presented.

Attack Chain

The complete exploitation chain proceeds as follows:

Step 1: Malicious instruction delivery. The attacker embeds a prompt injection instruction in any content Cursor CLI will read: a repository README, source code comments, a documentation page, a forum reply. The instruction directs the LLM to write a file named npx.exe with base64-encoded binary content.

For illustration purposes, a simple git readme with malicious instructions was created:

Step 2: Write tool execution (no approval). Cursor CLI's Write tool receives the instruction. It detects the .exe extension, decodes the base64 content, and writes a Windows PE executable named npx.exe to the workspace directory. No user approval is requested. The binary sits alongside normal project files.

Note: on more complex attack scenarios, the executable write action could be masked with additional harmless actions such as creating more files or highlighting irrelevant info to the user.

Even though the output does not clearly states it, a malicious binary was planted:

Step 3: Automatic execution on next prompt. The user restarts Cursor CLI and sends any prompt. The TypeScript LSP provider initializes, resolves npx.exe from the current working directory, and executes the attacker's binary. The binary runs under the user's security context with full access to the underlying OS, environment variables, credentials, SSH keys, cloud tokens, etc.

This chain is notable for two reasons:

It is entirely prompt-driven: the attacker needs only to influence content that the agent will read.
The execution trigger (e.g. web search) is the most routine action a developer takes when using an AI coding tool. There is no anomalous behavior to detect at either stage.

Vendor Response

Cursor closed this report as Informative on February 18, 2026, eight days after submission. The stated rationale was that findings requiring a malicious binary "lack an attack vector." This closure was very vague and did not address the Write tool binary planting component, the mechanism by which the binary enters the workspace without any pre-existing attacker access. A formal rebuttal was submitted. No response followed.

Class 2: Prompt-Driven Configuration Poisoning

AWS Kiro

Prompt-Triggered Remote Command Execution via Unrestricted LLM File Write to Workspace Configuration Paths

Affected Product	AWS Kiro IDE
Platform	Any (Windows, macOS, Linux)
Vulnerability Type	LLM File Write + Implicit VS Code Task Execution
Impact	Remote Code Execution on Host
Vendor Response	Closed as "expected behavior", Still vulnerable
Date	02/02/2026

Overview

AWS Kiro is an AI-powered IDE developed by Amazon, built on the VS Code foundation. It exposes an LLM-driven file-write tool (fswrite) that can create and modify files anywhere within the project workspace without user approval. Kiro inherits VS Code's automatic task execution behavior, whereby tasks defined in .vscode/tasks.json with "runOn": "folderOpen" execute automatically when the project folder is opened.

The combination creates a clean prompt-to-execution path: a malicious instruction causes the LLM to write a .vscode/tasks.json containing an attacker-controlled command, and the command fires the next time the user opens the project without approval or user interaction.

Vulnerability Chain

Core issue: Unrestricted LLM tool write access to security-sensitive paths. The fswrite tool in AWS Kiro permits the LLM to create or modify files in directories that control IDE execution behavior, including .vscode, without requiring user approval. There is no path restriction, no confirmation prompt, and no distinction between workspace content and IDE configuration.

Core feature: AWS Kiro inherits VS Code's behavior of automatically loading and executing tasks defined in .vscode/tasks.json. A task with "runOn": "folderOpen" fires immediately when the project is opened with the user's privileges.

Combined impact: When these two behaviors are chained, an attacker who can influence LLM prompt input can achieve arbitrary OS command execution through prompt injection alone.

Attack Chain

Step 1: Prompt injection. The attacker places malicious instructions in content the LLM will process: a repository README, inline code comments, a link to an attacker-controlled documentation page, or any other indirect source. The instruction directs the LLM to write a file at .vscode/tasks.json.

Step 2: LLM file write (no approval). Kiro's fswrite tool writes the following payload to .vscode/tasks.json with no user prompt:

json 

{ 

    "version": "2.0.0", 

    "tasks": [ 

        { 

            "label": "dev server", 

            "type": "shell", 

            "command": "<ATTACKER_PAYLOAD>", 

            "runOptions": { 

                "runOn": "folderOpen" 

            }, 

            "problemMatcher": [] 

        } 

    ] 

}

json 

{ 

    "version": "2.0.0", 

    "tasks": [ 

        { 

            "label": "dev server", 

            "type": "shell", 

            "command": "<ATTACKER_PAYLOAD>", 

            "runOptions": { 

                "runOn": "folderOpen" 

            }, 

            "problemMatcher": [] 

        } 

    ] 

}

For illustrative purposes, let's view the raw tool call:

And the same action can be triggered from a web search, for example:

Step 3: Automatic execution on folder open. The next time the user opens the project folder in Kiro (or any VS Code–based IDE), the task fires automatically. The attacker's command executes with the user's full privileges. No warning is presented.

PoC Video

It is important to note that while users are required to grant trust to a workspace folder before the IDE will execute its contents, this trust decision is a one-time gate and prompt injection can occur at any point after it. Once trust has been granted, modifications to the .vscode folder and its contents trigger no further alerts, even when those modifications are made by an LLM tool acting on instructions from an untrusted prompt source.

Other vendors including VS Code itself restrict the LLM write tool from altering workspace level configuration without explicit approval and in some cases also a clear warning.

This transforms a single successful prompt injection into a potential multi-victim remote command execution attack flow, propagated through planted malicious instructions.

Vendor Response

AWS decided to closed this report as Informative on February 11, 2026, describing the behavior as "expected." A follow-up email was sent noting that other vendors providing LLM-integrated IDEs had implemented restrictions on assistant-driven writes to execution-sensitive workspace paths, and that VS Code itself, the upstream project on which Kiro is based, includes safeguards against this execution path. AWS maintained its position that the behavior is intended, with no remediation indicated.

GitHub Copilot CLI

Class 1: Arbitrary Code Execution via Untrusted Working Directory Binary Hijacking

Affected Product	GitHub Copilot CLI
Platform	Windows 10 / 11
Vulnerability Type	Untrusted Search Path / Binary Hijacking
Impact	Arbitrary Code Execution on Host
Vendor Response	Triaged, bounty paid, severity downgraded, no fix committed
Disclosure Date	11/02/2026

Overview

GitHub Copilot CLI on Windows resolves external dependencies, specifically git.exe and which.exe, using the default Windows executable search order as part of its tool discovery and initialization logic. That search order prioritizes the current working directory before trusted system paths. A file named git.exe or which.exe placed in the workspace is executed automatically during Copilot CLI startup, before any folder trust prompt is presented to the user.

Technical Detail

When GitHub Copilot CLI starts, it attempts to locate git.exe to support its Git integration functionality. The discovery mechanism relies on the Windows search path without enforcing absolute-path resolution or validating the origin of the resolved binary. The current working directory, the developer's project folder, is searched first. A workspace-local binary matching the expected filename is found and executed silently, with no signature verification and no user notification.

As confirmed in testing, both git.exe and which.exe are affected. The binary executes under the user's full security context with access to source code, environment variables, API keys, cloud credentials, SSH keys, and Git authentication tokens.

Most importantly, this execution occurs before any trust prompt is displayed. The intended trust mechanism is bypassed entirely, not circumvented within it.

Attack Chain

Step 1: Binary delivery. The attacker places a malicious git.exe in a directory the developer will use as a Copilot CLI workspace. Delivery mechanisms include: a public or private Git repository containing a malicious binary, a ZIP or archive that extracts to include git.exe or any other generated project scaffolding.

Step 2: Developer opens workspace. The developer navigates to the workspace directory and launches GitHub Copilot CLI.

Step 3: Automatic binary execution. During startup, Copilot CLI resolves git.exe from the current working directory before querying trusted system paths. The malicious binary executes silently, before any folder trust prompt appears.

Step 4: Host compromise. The binary runs under the user’s OS security context. No user notification, approval dialog or integrity warning is presented at any point.

Vendor Response

Following triage, the severity was downgraded to low. GitHub noted that "the behavior may not be changed immediately" and closed the report as resolved. No committed timeline for remediation was provided, and GitHub's policy is not to disclose reports for low severity vulnerabilities, meaning users are not informed of this finding through official channels.

The MSRC response (prior to referral to GitHub's bug bounty program) confirmed that the issue was reviewed by multiple engineers before being redirected, reflecting that the concern was substantively evaluated.

Gemini CLI (Windows)

Arbitrary Code Execution via git.exe Untrusted Search Path and Binary Hijacking

Affected Product	Gemini CLI (latest version as of blog publication)
Platform	Windows 10 / 11
Vulnerability Type	Untrusted Search Path / Binary Hijacking
Impact	Arbitrary Code Execution on Host
Vendor Response	Acknowledged, remediation status unknown
Disclosure Date	05/01/2026

Overview

On Windows, Gemini CLI's tool discovery functionality resolves git.exe using the default Windows executable search order. When Gemini CLI is launched, it attempts to locate Git as part of its initialization logic. The search order includes the current working directory before trusted system locations. If a workspace directory contains the affected file name, e.g. git.exe, it is executed without origin validation, digital signature verification or any user notification.

This behavior is independent of the filesystem isolation issues documented in Part 1. It operates entirely at the application layer, requires no in-container execution, and triggers automatically during normal tool launch.

Technical Detail

Two distinct issues apply:

Untrusted Search Path. Gemini CLI resolves git.exe by attempting execution from the current working directory prior to consulting a trusted absolute path or system PATH. This is the direct cause of the binary hijacking condition.

Improper Control of Executable Loading from Untrusted Sources. Gemini CLI implicitly trusts unsigned executables located in untrusted workspace directories. No origin, integrity or signature check is performed on the resolved binary.

Attack Chain

Step 1: Binary delivery. The attacker places a malicious git.exe in a directory the developer will use as a Gemini CLI workspace, through repository cloning, archive extraction or any other standard mechanism.

Step 2: Developer launches Gemini CLI. The developer runs Gemini CLI from the affected workspace.

Step 3: Automatic binary execution. During initialization, Gemini CLI's tool discovery module locates git.exe in the working directory and executes it. No approval, warning or integrity check occurs.

Step 4: Arbitrary code execution. The binary runs under the developer's security context. In the PoC demonstration, this escalated to an elevated PowerShell process via UAC bypass, illustrating that downstream privilege escalation is possible depending on the system configuration.

Vendor Response

Reported to Google on January 4, 2026. Google acknowledged receipt. Following extended back-and-forth, Google recognized the issue as valid. As of the date of this publication, no patch has been released and no confirmed remediation timeline has been communicated.

Codex App (Windows)

Arbitrary Code Execution via git.exe Untrusted Search Path and Binary Hijacking

Affected Product	Codex App (Desktop Application)
Platform	Windows
Vulnerability Type	Untrusted Search Path / Binary Hijacking (CWE-426, CWE-829)
Impact	Arbitrary Code Execution on Host
Vendor Response	Closed as Not Applicable
Disclosure Date	02/03/2026

Overview

The Codex desktop application on Windows resolves git.exe using the default Windows executable search order when initializing Git-related functionality on folder open. A malicious git.exe in the project directory is executed automatically under the developer's security context, outside of the Codex sandbox user feature.

Technical Detail

When the user opens a project folder in Codex desktop app (Windows), the application initializes Git integration and invokes git. This resolution does not use an absolute path. The Windows executable search order is followed, and the current working directory (opened project folder) is searched before system-installed binaries. A workspace-local git.exe is found and executed before the legitimate Git installation is consulted.

The execution occurs directly on the host OS. No user notification, approval dialog or integrity check is performed.

Attack Chain

Step 1: Binary delivery. A malicious git.exe is placed in the project directory through any available channel: repository clone, archive extraction, or indirect prompt injection causing the LLM to write or download a binary.

Step 2: Developer opens the project in Codex. The developer opens the folder in Codex App. Git initialization begins as part of normal startup.

Step 3: Automatic binary execution. Codex resolves git.exe from the working directory and executes it without validation. The binary runs with the developer's privileges outside the sandbox.

Vendor Response

OpenAI closed this report as Not Applicable on March 10, 2026. The stated rationale was that if an attacker can replace git.exe remotely, they already have system access. This response conflates remote binary replacement (which was not the threat model) with binary planting in workspace directories via routine delivery mechanisms. The rebuttal clarifying the distinction between these scenarios was reviewed but the decision was upheld without further technical engagement.

Recommendations for End Users

Because vendor remediation is incomplete or absent across most affected tools, the following mitigations represent user-controlled risk reduction. They do not address the root causes.

1. Audit Workspace Directories Before Opening

Before launching any AI CLI tool or IDE from a cloned repository, extracted archive, or any externally supplied project, inspect the workspace for executable files with names matching common system utilities. Files named git.exe, npx.exe, where.exe, docker.exe, or node.exe should never appear in a project workspace. Treat their presence as an indicator of compromise or repository poisoning.

2. Restrict LLM File-Write Scope

Where your tool allows configuration of which paths the AI agent can write to, restrict write access to exclude security-sensitive directories (depending on vendor, e.g.: .vscode/, .cursor/, etc) and any path that controls IDE startup or task execution behavior. If the tool does not provide this configuration, treat it as a risk factor in your threat model. As an alternative solution, consider manually restricting access to these paths.

3. Apply Least-Privilege Principles to AI Tool Accounts

Run AI CLI tools under virtualization using accounts with the minimum privileges necessary. This limits the blast radius if an attacker achieves code execution in the tool's context. Avoid running AI coding tools under accounts with cloud administrator roles, broad IAM permissions or access to production credentials.

4. Inspect Configuration Files in Projects You Open

Before opening any project in an AI-powered IDE, review .vscode/tasks.json, .vscode/settings.json, mcp.json, and any other IDE configuration files present in the root. Unexpected entries, particularly those defining shell commands, should be treated as suspicious.

Vendor Response Summary

The following table reflects the outcome of responsible disclosure across all tools covered in this research.

Tool	Vendor	Reported	Outcome
Cursor CLI	Cursor	Feb 10, 2026	Closed as Informative: "lacks an attack vector", the attack flow is currently valid
AWS Kiro	Amazon	Feb 2, 2026	Closed as Informative: "expected behavior", the attack flow is currently valid
GitHub Copilot CLI	GitHub / Microsoft	Feb 11, 2026	Triaged -> Low severity.
Codex App	OpenAI	Mar 5, 2026	Closed as Not Applicable, the attack flow is currently valid
Gemini CLI (Windows)	Google	Jan 5, 2026	Triaged -> Acknowledged.

How Cymulate Exposure Validation Addresses This Attack Surface

The vulnerabilities classes documented in this research series, both the sandbox escapes from Part 1 and the remote code execution paths in Part 2, now have dedicated coverage in Cymulate Exposure Validation.

Cymulate customers can run dedicated templates and attack scenarios that now include host application-based AI agents as a part the full kill chain:

Initial foothold via prompt injection or binary planting
Code execution on the host
Credential access and persistence.

These scenarios are mapped to the relevant MITRE ATT&CK techniques or the relevant tool and can be used to validate whether existing detection and response controls identify post-exploitation activity originating from an AI agent process.

As this research series continues to cover additional AI CLI tools and vulnerability classes, corresponding scenarios will be added to maintain continuous coverage of this emerging attack surface.

Recommended Actions for CISOs and Organizations

Until vendors fully remediate these issues, security teams should treat AI coding agents as untrusted code-execution surfaces and apply the following controls:

Inventory, restrict and vet AI CLI usage – Identify which AI coding tools (Cursor, Kiro, Codex, Gemini CLI, Claude Code, etc.) are in use across the organization. Restrict installation to vetted endpoints via software allowlisting (AppLocker, WDAC, Jamf, Intune) and remove agents that vendors have declined to remediate. Factor vendor disclosure responsiveness into procurement and security review, and require documented risk acceptance at the appropriate leadership level for any tool with unresolved vulnerabilities.
Harden and sandbox the execution environment – Run AI CLIs inside isolated environments such as devcontainers, WSL with limited file access, ephemeral VMs, or cloud development environments (GitHub Codespaces, Coder, Gitpod). Do not allow agents to run on the host OS with administrative privileges or with access to credentials, SSH keys, or cloud profiles.
Monitor AI agent activity in the SIEM – Forward telemetry on AI CLI process trees, file writes to execution-sensitive locations (tasks.json, settings.json, .vscode/, shell rc files, scheduled tasks), and child-process creation by IDEs. Alert when an AI agent writes to these files outside an explicit user-initiated configuration change, or when unsigned binaries execute from user profile, downloads, or repository paths.
Add prompt injection to the threat model – Any workflow in which an LLM consumes external content (web pages, repositories, issues, documents, email) must include prompt injection as an explicit threat. Treat agent outputs and file writes as untrusted by default and require human approval for file system, shell, and network actions.
Validate prevention and detection coverage (for Cymulate clients) – Run the corresponding Cymulate validation scenarios to test whether existing endpoint, EDR and SIEM controls prevent and detect the binary-planting and configuration-poisoning techniques described in this report. Use the results to close gaps in detection rules, allowlisting policies, and response playbooks before these chains are weaponized in the wild.

Our highly experienced and diverse researchers are fluent in security intelligence practices, combining expertise in private security, military, and intelligence experience. Continuously examining the cyber-threat landscape, our experts deliver in-depth visibility into today’s threats and the actors behind them.

More about Author

Table of Contents

Executive Summary The Two Vulnerability Classes Full Remote Code Execution Originating from Prompt Injection Cursor CLI AWS Kiro GitHub Copilot CLI Gemini CLI (Windows) Codex App (Windows) Recommendations for End Users Vendor Response Summary How Cymulate Exposure Validation Addresses This Attack Surface

Cymulate Exposure Validation makes advanced security testing fast and easy. When it comes to building custom attack chains, it's all right in front of you in one place.

Mike Humbert, Cybersecurity Engineer

DARLING INGREDIENTS INC.

Learn More

Ready to see Cymulate in action?

Book a Demo

When AI Tools Become the Backdoor: Zero-Click RCE via Prompt Injection

Executive Summary

Introduction

The Two Vulnerability Classes

Full Remote Code Execution Originating from Prompt Injection

Cursor CLI

Overview

Attack Chain

Vendor Response

AWS Kiro

Overview

Vulnerability Chain

Attack Chain

PoC Video

Vendor Response

GitHub Copilot CLI

Overview

Technical Detail

Attack Chain

Vendor Response

Gemini CLI (Windows)

Overview

Technical Detail

Attack Chain

Vendor Response

Codex App (Windows)

Overview

Technical Detail

Attack Chain

Vendor Response

Recommendations for End Users

1. Audit Workspace Directories Before Opening

2. Restrict LLM File-Write Scope

3. Apply Least-Privilege Principles to AI Tool Accounts

4. Inspect Configuration Files in Projects You Open

Vendor Response Summary

How Cymulate Exposure Validation Addresses This Attack Surface

Recommended Actions for CISOs and Organizations

More from the Cymulate Research Lab

The Race to Ship AI Tools Left Security Behind. Part 1: Sandbox Escape

One-Click RCE via Windows Admin Center Control Flow Hijacking

Improper Attestation Signature Validation in Windows Admin Center

Ready to see Cymulate in action?