Posts

A Look Into Browser Extension Security

Notes from a presentation in a reading group

11 April 2026 | 14 min read | 2836 words

Introduction

I’m in a paper reading group for class. We present in small teams, and each round focuses on a different security domain with an older paper paired to a newer one that builds on it. This was our second presentation.

This time, we picked browser extensions. We were supposed to find a neat seminal-to-follow-on pairing, but we went with these two because extension security keeps showing up in the news and it had never come up in our group before:

The older one helped us set the baseline model, and the newer one shows how privilege escalation still slips through the cracks. I also wanted to include The Dangers of Human Touch: Fingerprinting Browser Extensions through User Actions, but we were time-bound and it doesn’t line up tightly with the first paper we planned to present.

For context, our first presentation was on onion-routing security. I started with Anonymous Connections and Onion Routing as a personal warm-up, then we treated Tor: The Second-Generation Onion Router as the seminal anchor and RAPTOR: Routing Attacks on Privacy in Tor as the newer follow-up.

Browser Extensions in Recent News

Annex report on PixelPerfect extension abuse

Expel report on ChatGPT stealer prompt poaching

LayerX report on the GhostPoster campaign

Trust Wallet security notice for browser extension version 2.68

Koi Research report on the DarkSpectre campaign

Koi Research report on the ShadyPanda campaign

Koi Research report on the GreedyBear campaign

Koi Security report on the RedDirection campaign

Koi Research report on the FoxyWallet malicious extensions

GitLab tech note on malicious browser extensions

Keep Aware report on the Cyberhaven browser extension compromise

Guardio report on FakeGPT malicious extensions

Guardio Labs report on FakeGPT #2 open-source turned malicious

Trends in Academia in Recent Times

Top venues only (USENIX Security, NDSS, IEEE S&P, ACM CCS)

Sources: DBLP API + arXiv. Query: titles containing both "browser(s)" and "extension(s)" (any order), years 2000-2026, venues filtered to USENIX Security, NDSS, IEEE S&P, and ACM CCS. Citation counts come from OpenCitations (DOI-only), so arXiv and DOI-less entries show 0; paper counts are the more stable signal. Lower recent citation totals are expected because newer papers have had less time to be cited.

All venues (same query, no venue filter)

Same query as above, without the venue filter, to show the broader literature (including arXiv preprints).

Paper themes

Manually curated buckets; overlap is minimized by assigning each paper to a single category.

Protecting Browsers from Extension Vulnerabilities

Back in 2010, browser extension security wasn’t treated as its own serious research topic. Firefox extensions ran with the browser’s full privileges, and security failures mostly looked like “bad extensions” rather than an architectural problem. A year earlier, DEFCON 17 had a talk titled “Abusing Firefox Extensions” by Liverani and Freeman that demonstrated practical attacks against popular extensions, which helped underline how serious the problem was. This paper flips the framing: the Berkeley + Google authors study real Firefox extensions, show that most of them are over-privileged, and use that evidence to motivate a new extension architecture that later shaped Chrome.

The proposal is simple in spirit but strong in effect: least privilege by default, privilege separation by construction, and hard isolation between components. Extensions are split into content scripts (exposed to web pages), a core (where most privileges live), and an optional native binary (powerful but kept far from web input). The idea is to make an exploit chain harder, not just make any one bug less likely.

What they measured (and what fell out)

Their first move was empirical. They manually inspected 25 popular Firefox extensions to understand the behaviors those extensions actually needed, then compared those needs to the power of the interfaces they used. The gap is the story:

Only 3 out of 25 required “critical” privileges, yet all 25 effectively had them.
19 used critical-rated interfaces despite not needing critical privileges.
76% of the extensions used interfaces that were more powerful than their behavior required.

Firefox extension privilege gap (NDSS 2010)

Most powerful behavior (25 Firefox extensions)

Most powerful interface used

The donut charts show the highest privilege level per extension. On the behavior side, that means the most powerful action an extension actually needed. On the interface side, it means the most powerful interface the implementation touched.

Behavior vs interface gap (frequency across 25 extensions)

The horizontal bars show how common each behavior was and whether it came with a privilege gap–cases where the interface power exceeded the behavior’s true need. The mismatch is the headline: a small slice of extensions need critical power, but a much larger slice end up touching critical APIs.

Replotted from Figure 2 and Figure 3 in “Protecting Browsers from Extension Vulnerabilities” (NDSS 2010).

Threat model (who the attacker is)

The paper assumes “benign-but-buggy” extensions: developers are well-meaning, but not necessarily security experts. The attacker doesn’t trick users into installing native executables. Instead, they exploit vulnerabilities in extension code and inherit the extension’s privileges. The model includes two attacker types:

A web attacker controlling a site the user visits.
An active network attacker who can tamper with HTTP traffic.

The browser itself is treated as non-vulnerable so the focus stays on extension risk.

One subtlety the paper calls out: extensions are not the same as plug-ins. Plug-ins are typically loaded by sites via specific MIME types, while extensions act on pages without explicit site requests. The paper stays scoped to extensions only, not plug-ins. In practice, classic browser plug-ins are basically gone today (NPAPI/Flash/Java/Silverlight are all retired), so the paper’s focus on extensions also lines up with how browsers actually work now.

Vulnerability classes they highlight

They focus on how web content can reach powerful extension privileges:

Extension XSS (e.g., unsafe eval or document.write) and how optional sandboxing isn’t a full fix.
Mixed-content injection (loading scripts over HTTP or injecting HTTP into HTTPS pages).
Replacing native DOM APIs with attacker-controlled lookalikes (the XPCNativeWrapper story).
JavaScript capability leaks (exposing privileged objects to web pages).

Each class is less about a specific bug and more about how tightly Firefox bound untrusted content to powerful extension APIs.

How they manually analyzed Firefox extensions and APIs

The survey was hands-on. They picked two extensions from each of 13 “recommended” Firefox categories (25 total), ran them, and used the UI to understand what each extension actually needed to do. They then searched for API interfaces in the source code and manually matched behavior to interfaces. That let them assign a security severity level to behaviors and interfaces using Mozilla’s five-level scale: critical, high, medium, low, none.

This manual mapping is why the “gap” is convincing: it’s not just static analysis of code; it’s behavioral intent vs. the actual capabilities the code grabbed.

The deductive system for escalation points

Beyond the survey, the paper tries to answer a deeper question: if an extension requests a low-privilege interface, can it still reach a high-privilege one through the API surface? To answer that, they model the Firefox extension API (XPCOM) as a security lattice and compute reachability through interfaces.

Key parts of their setup:

Interfaces are defined in an IDL (think CORBA-style types). The browser enforces the declared parameter and return types.
They manually labeled 613 interfaces (out of 1582 total) with security severity.
They built a Datalog-backed analyzer that deduces what interfaces become reachable when you have access to a given interface.

The inference rules are the interesting bit. The notation is: $\rho \leadsto_{\eta} \alpha$ means principal $\rho$ has a reference to interface $\alpha$ implemented by principal $\eta$. Here’s the rule set side by side with the intuition:

Rule	Intuition	KaTeX rule
Subtyping	If an interface is a subtype of another, reachability carries over.	$\frac{\rho \leadsto_{\eta} \alpha \quad \alpha \le \beta}{\rho \leadsto_{\eta} \beta}$
Method	If you can call a method that returns $\beta$, you can reach $\beta$.	$\frac{\rho \leadsto_{\eta} \alpha \quad \alpha.\text{method}(\beta)}{\rho \leadsto_{\eta} \beta}$
Getter	Getters are methods that return a value.	$\frac{\rho \leadsto_{\eta} \alpha \quad \alpha.\text{method}(1 \to \beta)}{\rho \leadsto_{\eta} \beta}$
Setter	Setters are methods that take a value.	$\frac{\rho \leadsto_{\eta} \alpha \quad \alpha.\text{method}(\beta \to 1)}{\rho \leadsto_{\eta} \beta}$
Type forgery	Any principal can synthesize an object that claims to implement an interface.	$\frac{}{ \rho \leadsto_{\rho} \alpha }$
Return	If $\rho$ can call a method $\alpha \to \beta$ implemented by $\eta$, and can supply an $\alpha$, then the return gives $\rho$ a $\beta$ (implemented by $\delta$).	$\frac{\rho \leadsto_{\eta} \alpha \to \beta \quad \rho \leadsto_{\gamma} \alpha \quad \eta \leadsto_{\delta} \beta}{\rho \leadsto_{\delta} \beta}$
Parameter	The callee can also gain access to the argument it is handed.	$\frac{\rho \leadsto_{\eta} \alpha \to \beta \quad \rho \leadsto_{\gamma} \alpha \quad \eta \leadsto_{\delta} \beta}{\eta \leadsto_{\gamma} \alpha}$

Because they don’t analyze concrete implementations, this is an over-approximation. That’s a feature here: the goal is to surface potential escalation paths and identify “escalation points” where a narrow interface still leaks broader power. Those points are exactly what their new Chrome-like extension design tries to eliminate.

Two extra details from the text matter for how these rules behave in practice. First, the interfaces live in XPCOM and are specified in an IDL, so the browser enforces the declared parameter and return types regardless of who implements the interface. The authors attach a Datalog backend to the IDL compiler, manually label 613 of 1582 interfaces by severity, and then compute reachability using the rules above. Second, the analysis is deliberately an over-approximation because it ignores concrete implementations, so the parameter-flow rule might fire even if a real implementation never calls a particular method on its input. The payoff is that it surfaces potential escalation paths that would otherwise be missed in a strictly implementation-based analysis, and it explains why the type-forgery rule matters: extensions can manufacture objects that claim to implement XPCOM interfaces via queryInterface, which makes it possible to reach methods you couldn’t otherwise call.

Proposed model: Chrome extension system

The proposed architecture is the Chrome extension model built around least privilege, privilege separation, and isolation. The idea is to make the default extension shape safer, not to bolt on after-the-fact checks.

Least privilege via the manifest. Every extension declares what it wants up front. Privileges fall into three buckets:

Execute arbitrary code by listing a native binary in the manifest (NPAPI in the 2010 era).
Web site access by origin patterns, so an extension can target *.google.com without ever seeing bank.com.
API access via named groups like tabs, only granted if explicitly listed.

The paper also cares about incentives: the Chrome gallery tightens the install UX for high‑privilege extensions and blocks arbitrary‑code extensions unless the developer signs a contract. Extensions installed outside the gallery use a scary, “native executable”‑style flow, which means a site tricking the user into installing a malicious extension isn’t gaining much more than it already could with a normal binary.

Privilege separation by design. Extensions are forced into three components:

Content scripts live inside web pages and can only touch the DOM plus message the core.
Extension core runs with the extension APIs and can reach the network, but only for origins listed in the manifest.
Native binary is optional and the only place with arbitrary code / file access.

The key property is that the most exposed component (content scripts) never directly talks to the most privileged component (native binary). That makes multi‑step exploitation the expected path, not the exception.

Isolation mechanisms. The model adds three layers:

Origin isolation by embedding the extension’s public key in its URL (chrome-extension://<public key>/), which avoids a central naming authority and lets the same‑origin policy do real work.
Process isolation for core and native binaries, so a renderer compromise doesn’t immediately grant extension APIs.
Isolated worlds for content scripts: the page and the content script see the same DOM, but they do not share JS objects or pointers, which is meant to reduce capability leaks and DOM‑level “rootkits.”

Evaluation and overhead. When they survey 25 popular Chrome extensions, privilege requests are already narrower than Firefox. Only one extension (Cooliris) asks for more than it needs, and overall the privilege gap shrinks. The cost is real but manageable: message round‑trips across components average ~0.8ms, and isolated worlds add about a third to raw DOM micro‑benchmarks, but the paper argues those costs are small in real user flows.

At the adoption level, the contrast is sharp: the 2010 model directly influenced Chrome’s extension architecture, while the 2023 FISTBUMP design still looks like a research prototype with no clear mainstream deployment.

What Chromium Looks Like Now

The 2010 paper still gets the high-level idea right: keep the page-facing part weak, keep the privileged part separate, and make privilege requests explicit. But the current Chromium model is no longer just “content script + core + optional native binary.” Today the real story is Manifest V3, and Manifest V2 is fully disabled as of July 24, 2025. The privileged “core” is now an extension service worker, not a persistent background page, so it wakes up on events and has no DOM access. If an extension needs DOM access outside the page, Chromium pushes that into an offscreen document. The page-facing piece is still the content script, and it still runs in an isolated world, which remains one of the original Chrome model’s most durable ideas. The old native-binary idea also survived, but in a much narrower form via native messaging, where a separately installed host must explicitly allow specific extension origins.

Privilege is also tighter than in the paper-era Chrome model. Host access is split across host_permissions / optional_host_permissions and narrower grants like activeTab. That matters because Chromium’s docs frame permissions as damage control: if the extension is compromised, the attacker gets only what the extension actually asked for. Manifest V3 also tightened the extension boundary itself. Extension pages have a minimum CSP that cannot be relaxed to allow remote script execution, and web-accessible resources are blocked by default because exposing them makes extensions easier to fingerprint and attack. On the network side, Chromium has been steering extensions away from blocking webRequest listeners and toward declarativeNetRequest, where the browser applies packaged rules without handing every request to extension JavaScript. That is both a performance decision and a security/privacy decision.

Does Chromium treat supply chain as part of the threat model?

Yes, much more than the 2010 paper did, but mostly through platform and store policy rather than through the runtime alone.

The clearest signal is Manifest V3’s ban on remotely hosted code, which Chromium explicitly justifies as a way to stop extensions from executing unreviewed logic. The Chrome Web Store’s Manifest V3 requirements go further and say the full functionality of an extension must be discernible from the submitted package. Remote data is still allowed, and remote configuration is still allowed for things like A/B tests, but the logic must stay inside the reviewed bundle. That is a supply-chain control, not just a sandboxing control.

The same is true in the store pipeline. The Chrome Web Store Program Policies require the narrowest permissions necessary, ban code obfuscation and concealed functionality, and allow removal when an extension exposes exploitable security bugs. The review process explicitly targets scams, data harvesting, malware, and malicious actors, and published items can be warned, taken down, or auto-disabled. On the account side, 2-Step Verification is required, and as of May 2025 Chrome Web Store also added verified uploads, so developers can require future uploads to be signed by their own trusted key. That feature exists specifically to help even if the dashboard account or publishing workflow is compromised.

But this is not the same as saying supply chain is solved. Chromium still treats the reviewed package, the publisher account, and the store update path as trust anchors. In other words, Chromium now clearly recognizes supply-chain abuse as part of extension security, but it handles that risk with code-reviewability rules, account hardening, signing, and store enforcement. The 2010 paper was mostly about over-privileged extension code. The 2026 landscape is about that plus malicious updates, compromised publisher accounts, opaque remote behavior, and post-publication enforcement. Architecture still matters, but the ecosystem has moved a lot of the fight into the supply chain.

Browser Extension Incidents Through Time

The papers explain the model. The docs explain the platform. But neither really shows how browser extension abuse looked over time in the wild.

So I put together a raw incident dataset from security news, advisories, and vendor writeups, then plotted the reports by year. The chart below keeps the raw yearly counts and stacks them by incident bucket, so it shows both how much reporting there was and what kind of cases were showing up.

This is not a perfect count of every bad extension ever shipped. It is a reporting-based dataset, so the older years are thinner and the newer years are much easier to see. Also, some reports fit more than one label, so for the chart I assign each report to one primary display bucket to keep the stacked totals readable. Still, it is enough to show how the shape of the problem changed. If you want to inspect the raw file yourself, here is the exact dataset behind this chart: browser_extension_incidents_raw.json.

Reported extension incidents in the accessible security-news corpus

Each bar shows raw yearly report counts, stacked by one primary display bucket: supply-chain, AI-related, vulnerability, malicious extension, or other. The dataset is collected by a script, so it can still be imperfect or inaccurate in places, but I manually checked the rough shape and the key classifications enough to trust it as a trend view. Click a year to inspect the incident list behind it.

Siddharth Mishra