arxiv:2606.20023

When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents

Published on Jun 18

· Submitted by

Yuchi Wang on Jun 25

Beijing Academy of Artificial Intelligence

Upvote

Authors:

Kaiyue Yang ,

Yuchi Wang ,

Abstract

LLM agents frequently select higher-privilege tools unnecessarily, and while safety alignment doesn't ensure least-privilege choices, a post-training defense can reduce excessive privilege use without sacrificing performance.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

As LLM agents increasingly select tools autonomously, their choices among tools with different privileges become safety-relevant. However, prior tool-selection studies focus on safety-agnostic metadata preferences, leaving privilege-sensitive choices underexplored. To address this gap, we study over-privileged tool selection, in which an agent selects or escalates to a higher-privilege tool despite a sufficient lower-privilege alternative. We introduce ToolPrivBench to evaluate whether agents choose higher-privilege tools despite sufficient lower-privilege alternatives, measuring both initial selection and escalation after transient tool failures. Across eight domains and five recurring risk patterns, we find that over-privileged tool selection is common among mainstream LLM agents and is further amplified by transient failures. We further find that general safety alignment does not reliably transfer to least-privilege tool choice, while prompt-level controls provide only limited mitigation under transient failures. We therefore introduce a privilege-aware post-training defense that teaches agents to prefer sufficient lower-privilege tools and escalate only when necessary. Our mitigation experiments show that this defense substantially reduces unnecessary high-privilege tool use while preserving general capabilities.

View arXiv page View PDF GitHub 4 Add to collection

Community

YuchiWang

Paper author Paper submitter 7 days ago

sleepywinterfox

2 days ago

interesting finding, but have you considered that dynamic tool provisioning eliminates this entirely? If the model only sees tools appropriate to its current privilege tier and has to explicitly escalate through a separate evaluation gate to access higher-privilege tools, over-privileged selection becomes structurally impossible regardless of model behaviour. No post-training needed.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2606.20023

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.20023 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2606.20023 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.20023 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.