Feed aggregator

I recently did some tests on two MCPs with identical functionalities. Turns out one of them has really bad performance. So I wanna share those bad MCP design patterns that cause this.

It all started when I wrote an MCP Server (MCP-A) for a to-do list app. Later, the app officially released its own MCP Server (MCP-B). Both MCPs have the same functionalities and hit the same backend API.

The experiment is set up as follows:

- Both MCP Servers connect to the same ToDo list account, and it will be reset after each test. - 40 test prompts to simulate typical use cases for these MCPs. - The test was conducted with the same model, system prompt, and Agent framework

Here are the results:

| Metric | MCP-A | MCP-B | Gap | | ------------------- | ----------- | ----------- | ----- | | Tool Desc Length | 11,464 | 3,682 | — | | Pass Rate | 36/40 (90%) | 36/40 (90%) | Same | | Total input tokens | 637,244 | 3,174,329 | 4.98× | | Total output tokens | 17,301 | 23,238 | 1.34× | | Total Agent steps | 122 | 157 | 1.29× | | Total time | 597s | 676s | 1.13× |

---

The result shows that MCP-B took 35 more ReAct loops to complete 40 test cases compared to MCP-A, which means 30% more output tokens. I examined the log and found that the root cause is poor query tool design.

Take the `search tool` for example, its job is to find a todo item in the ToDo list. In MCP-B, this tool returns this:

{ "id": "6a1916b48f08cb3a4c857ed0", "title": "buy some groceries", "url": "https://todo.example.com/tasks/6a1916b48f08cb3a4c857ed0" }

But other CRUD operations require `project_id`, and `search_tool` doesn't return it. So the Agent has to call another tool `get_task_by_id`. On the other hand, MCP-A's query_tasks returns all necessary info to perform the next action in a single call:

Task 1: ID: 6a19143e8f084a8c8101612f Title: buy some groceries Project ID: 6a1914378f084a8c810160a9 Start Date: 2025-07-19 10:00:00 Priority: Medium Status: Active Unfiltered API Data was dumped into context window

If MCP returns pure API results to the Agent's context unprocessed, the Agent's context window will accumulate very fast.

Take MCP-B's `create_task` tool, for example. Its job is to create a to-do item. This is what this tool returns:

{ "id": "6a180de78f086bdead0608be", "projectId": "inbox125587327", ..... "createdTime": "2026-05-28T09:41:59+0000", "modifiedTime": "2026-05-28T09:41:59+0000", "focusSummaries": null }

These 600+ characters mean nothing to the Agent's task, but are still dumped into the Agent's context. On the other hand, MCP-A's create_tasks does a layer of filtering and formatting. This little tweak makes a huge difference in input token usage.

Another issue is tool count. More tools mean a larger candidate set for the model to choose from, which directly increases decision difficulty. In MCP-A, 47 tools were compressed down to 14, covering the same functionality with fewer tools.

---

So here are my takeaways on good MCP tool design: - When designing a tool, think about what the Agent will need next, not just what it's asking for right now. Return enough context in the result so the Agent can take the next action without making another round-trip.

- Too many tools will increase the model's decision burden. So it'd be better to minimize the number of tools within an MCP. Make sure they don't overlap functionalities.

- When your MCP returns data to the LLM, try to keep it LLM-friendly, which means readable. You can filter out unnecessary fields from the API response and format the data, rather than passing through raw JSON.

---

All the tests above were run by MCP-Eval. It's an MCP Server benchmarking tool. If you want to check your MCP's performance, feel free to check this out.

https://github.com/Code-MonkeyZhang/mcp-eval

Comments URL: https://news.ycombinator.com/item?id=48407391

Points: 2

# Comments: 0

Categories: Hacker News

Delacroix's Entry of the Crusaders into Constantinople Restored

Hacker News - Thu, 06/04/2026 - 10:48pm

Article URL: https://www.louvre.fr/en/explore/life-at-the-museum/delacroix-s-entry-of-the-crusaders-into-constantinople-restored-to-its-original-glory

Comments URL: https://news.ycombinator.com/item?id=48407368

Points: 1

# Comments: 0

Categories: Hacker News

Fedora 43 Upgrade revealed 20 years old Outlook Security Bug

Hacker News - Thu, 06/04/2026 - 10:46pm

Article URL: https://fedoramagazine.org/fedora-43-upgrade-revealed-20-years-old-outlook-security-bug/

Comments URL: https://news.ycombinator.com/item?id=48407356

Points: 2

# Comments: 0

Categories: Hacker News

H-1B Crackdown on Indian Workers Erodes a Texas Real Estate Boom

Hacker News - Thu, 06/04/2026 - 10:46pm

Article URL: https://www.bloomberg.com/features/2026-dallas-h1b-housing-market/

Comments URL: https://news.ycombinator.com/item?id=48407355

Points: 3

# Comments: 0

Categories: Hacker News

OPRD: On-Policy Representation Distillation

Hacker News - Thu, 06/04/2026 - 10:44pm

Article URL: https://arxiv.org/abs/2606.06021

Comments URL: https://news.ycombinator.com/item?id=48407343

Points: 1

# Comments: 0

Categories: Hacker News

What We Believe Happened: A Summary Timeline of Bricks and Minifigs Store

Hacker News - Thu, 06/04/2026 - 10:43pm

Article URL: https://bricksandminifigs.com/blog/blog/2026/06/04/bricks-and-minifigs-salem-store-timeline/

Comments URL: https://news.ycombinator.com/item?id=48407337

Points: 2

# Comments: 0

Categories: Hacker News

Verifying Agentic Development at Scale

Hacker News - Thu, 06/04/2026 - 10:42pm

Article URL: https://twitter.com/ido_pesok/status/2060416230641881336

Comments URL: https://news.ycombinator.com/item?id=48407332

Points: 1

# Comments: 0

Categories: Hacker News

It's not how much it rose, but how fast

Hacker News - Thu, 06/04/2026 - 10:40pm

Article URL: https://www.ooooo.law/board/8?lang=en

Comments URL: https://news.ycombinator.com/item?id=48407316

Points: 1

# Comments: 0

Categories: Hacker News

Ask HN: Are people vibe coding internal tools at your companies?

Hacker News - Thu, 06/04/2026 - 10:39pm

Is replacing SaaS with vibe coding a thing? Are there measurable gains? How much work is it to support/maintain these?

Comments URL: https://news.ycombinator.com/item?id=48407312

Points: 2

# Comments: 0

Categories: Hacker News

Can These Ads Make You Love A.I.?

Hacker News - Thu, 06/04/2026 - 10:36pm

Article URL: https://www.nytimes.com/2026/06/04/style/chatgpt-advertising-campaign-artificial-intelligence.html

Comments URL: https://news.ycombinator.com/item?id=48407290

Points: 1

# Comments: 0

Categories: Hacker News

Quantum Odyssey

Hacker News - Thu, 06/04/2026 - 10:35pm

Article URL: https://store.steampowered.com/app/2802710/Quantum_Odyssey/

Comments URL: https://news.ycombinator.com/item?id=48407288

Points: 1

# Comments: 0

Categories: Hacker News

JPMorgan, Citi and Big Banks Plan New Tokenized Deposit System to Answer Crypto

Hacker News - Thu, 06/04/2026 - 10:35pm

Article URL: https://www.wsj.com/finance/banking/jpmorgan-citi-and-big-banks-plan-new-tokenized-deposit-system-to-answer-crypto-6b2d696b

Comments URL: https://news.ycombinator.com/item?id=48407283

Points: 1

# Comments: 0

Categories: Hacker News

OpenAI's Codex chained decade-old DoS attacks to crash web servers

Hacker News - Thu, 06/04/2026 - 10:34pm

Article URL: https://www.theregister.com/security/2026/06/04/openais-codex-chains-decade-old-dos-techniques-into-http/2-bomb/5251377

Comments URL: https://news.ycombinator.com/item?id=48407277

Points: 1

# Comments: 0

Categories: Hacker News

Bitcoin weathering ugliest week in months as narrative fades, liquidity rotates

Hacker News - Thu, 06/04/2026 - 10:34pm

Article URL: https://www.cnbc.com/2026/06/04/bitcoin-is-weathering-its-ugliest-week-in-months-as-narrative-fades-and-liquidity-rotates.html

Comments URL: https://news.ycombinator.com/item?id=48407274

Points: 4

# Comments: 2

Categories: Hacker News

Cisco Catalyst SD-WAN Manager Authenticated Privilege Escalation Vulnerability

Cisco Security Advisories - Thu, 06/04/2026 - 10:27pm

A vulnerability in the CLI of Cisco Catalyst SD-WAN Manager, formerly SD-WAN vManage, could allow an authenticated, local attacker to execute arbitrary commands as root by supplying a crafted file to the affected system.

This vulnerability is due to insufficient validation of user-supplied input. An attacker could exploit this vulnerability by uploading a crafted file to the affected system. A successful exploit could allow the attacker to perform command injection attacks on an affected system and elevate their privileges as the root user.

To exploit this vulnerability, the attacker must have netadmin privileges on the affected system. This would require valid credentials or exploitation of CVE-2026-20182 or CVE-2026-20127. Cisco is not aware of successful exploitation by other methods. Cisco has observed limited cases where the exploitation of this bug resulted in a configuration change pushed to edge devices.

Cisco recommends that customers upgrade to the fixed software that is documented in the Catalyst SD-WAN Security Advisory that was published on May 14, 2026, and verify the configuration of the edge devices.

Cisco has not released software updates that address this vulnerability. There are no workarounds that address this vulnerability.

Important: To preserve possible indicators of compromise, customers should issue the request admin-tech command from each of the control components in the SD-WAN deployment before upgrading. After the admin-tech file has been collected, software should be upgraded at the earliest opportunity.

Before upgrading an SD-WAN deployment to a fixed release, retain relevant logs. After upgrading, verify that the system has not been compromised by checking the logs for the indicators of compromise as documented in this advisory. If the logs show indicators of compromise and the system is confirmed to be compromised, applying the software update alone will not resolve the vulnerability. In such cases, follow the specific remediation steps that will be provided by the Cisco Technical Assistance Center (TAC) to help secure the system. This section will be updated as information becomes available.

This advisory is available at the following link:
https://sec.cloudapps.cisco.com/security/center/content/CiscoSecurityAdvisory/cisco-sa-sdwan-privesc-4uxFrdzx

<br/>Security Impact Rating: High <br/>CVE: CVE-2026-20245

Categories: Cisco

What happens if Japan takes in zero immigrants?

Hacker News - Thu, 06/04/2026 - 9:56pm

Article URL: https://www.konichivalue.com/p/what-happens-if-japan-takes-in-zero

Comments URL: https://news.ycombinator.com/item?id=48407068

Points: 1

# Comments: 0

Categories: Hacker News

Dirk and Linus discuss AI and kernel development

Hacker News - Thu, 06/04/2026 - 9:54pm

Article URL: https://lwn.net/Articles/1073761/

Comments URL: https://news.ycombinator.com/item?id=48407061

Points: 1

# Comments: 0

Categories: Hacker News

Biographical Information Summary - This is Just a Summary Joe Pearce
About Joe Pearce joeintenn
Links Joe Pearce
Flounder's Keylime Pie is the Best in the World, At Least I Think So... Joe Pearce
Harley Ride Joe Pearce
Cobra with New Cover Joe Pearce
Mustang Cobra After Ceramic Coating Joe Pearce
Carter County Cruise In Joe Pearce
2003 Ford Mustang SVT Cobra Convertible NAPA Auto Car Show Top 10 Joe Pearce
Ponies in the Smokies - Mustang Trophy Joe Pearce

Feed aggregator

'World-first' vaccine designed by artificial intelligence

Form design: from zero to hero all in one blog post

CEO to staff: You're not getting a raise. We're spending on AI instead

Bad MCP design cost your Agent 5× more tokens

Delacroix's Entry of the Crusaders into Constantinople Restored

Fedora 43 Upgrade revealed 20 years old Outlook Security Bug

H-1B Crackdown on Indian Workers Erodes a Texas Real Estate Boom

OPRD: On-Policy Representation Distillation

What We Believe Happened: A Summary Timeline of Bricks and Minifigs Store

Verifying Agentic Development at Scale

It's not how much it rose, but how fast

Ask HN: Are people vibe coding internal tools at your companies?

Can These Ads Make You Love A.I.?

Quantum Odyssey

JPMorgan, Citi and Big Banks Plan New Tokenized Deposit System to Answer Crypto

OpenAI's Codex chained decade-old DoS attacks to crash web servers

Bitcoin weathering ugliest week in months as narrative fades, liquidity rotates

Cisco Catalyst SD-WAN Manager Authenticated Privilege Escalation Vulnerability

What happens if Japan takes in zero immigrants?

Dirk and Linus discuss AI and kernel development

Pages

Welcome to Joe Pearce's Home Page.

Web page offered by Joe Pearce © 2004 - 2025 - All rights reserved.

Thanks to the ETSU Computer and Information Sciences Department.

Thanks to the NSTCC Computer and Information Sciences and Computer Engineering Technologies Department.

This is my Favicon.

You are here

Feed aggregator

Pages

Welcome to Joe Pearce's Home Page.

Web page offered by Joe Pearce © 2004 - 2025 - All rights reserved.

Thanks to the ETSU Computer and Information Sciences Department.

Thanks to the NSTCC Computer and Information Sciences and Computer Engineering Technologies Department.

This is my Favicon.