28

Show HN: Comet MCP – Give Claude Code a browser that can click

Hey HN,

Claude Code is pretty agentic now. It writes scripts, calls APIs, uses CLIs. But when something requires actually clicking through a website, it stops and asks me to do it.

Problem is, I'm often unfamiliar with these platforms myself. "Go to App Store Connect and generate a P8 key" okay but where? I end up spending 10 minutes navigating menus I've never seen before.

I started delegating these tasks to Perplexity's Comet browser. It handles the clicking, returns what I need. But copy-pasting between Claude and Comet got old fast.

So I built this MCP server to connect them directly. Now when Claude needs to interact with a website that has no API, it can just ask Comet to handle it.

  Examples:
  - Grab my app ID from RevenueCat dashboard
  - Generate a P8 key in App Store Connect
  - Navigate admin panels behind login walls
I tried Playwright MCP but having Claude do the clicking itself overwhelms the context window. Comet's agentic browsing just works better in my experience.

Comet doesn't have an API, so this uses CDP to communicate with it directly.

I didn’t realize AI could interact with browsers like this already (guess I’m naive). Isn’t this setting up for the scenario where the AI is duped into logging into your bank account and transferring your money away? Not sure I have enough trust to allow an AI to touch a browser.

a day agoDougN7

People are already going full Leroy Jenkins with this stuff, and OpenAI, other labs are snarfing up their usage data. Hopefully with their brave sacrifice, they can figure out all the security pitfalls before it becomes common enough that someone with a clever jailbreak ends up pulling of a billion dollar heist, or orders pizza for half the country.

It's 100% absolutely not safe yet. You can effectively copy and paste Pliny prompts and pwn any of the frontier lab models. Anyone with a little time and creativity can tailor a unique one and set hidden text traps for AI browsers or agents, and depending on what access you've given the software it could be very dangerous.

a day agoobservationist

Great time to be an offensive security researcher specialising in researching LLM adversarial attacks.

a day agorvz

Yeah - the red team folks probably have one of the most fun jobs in the world right now.

a day agoobservationist

Depends on your definition of "fun"

a day agoluckman212

There are folks on X running vibe-coded Polymarket arbitrage bots playing with hundreds of thousands of dollars. Some people have pretty wild risk tolerances!

a day agojoenot443

That's a valid concern. I took a more constrained approach for web searches for exactly this reason. Instead of giving the LLM full browser control, I built a Firefox extension that only handles web search client-side.

When my local LLM (llama.cpp) needs to search, it opens DuckDuckGo in a new window, loads the result pages in tabs, extracts content with Readability.js, and feeds it back. You stay in the loop - can see what's loading, solve captchas if needed. Less autonomous than Comet/Playwright, with a narrower use-case, but also less risk.

Its still a prototype though: https://github.com/tbocek/llm-local-web-search

a day agotbocek

It's totally setting up for exactly that scenario. You need to ensure the browser that it uses is totally unprivileged if you're going to do this, or at the very least that it can only access a small set of trusted destinations.

a day agowat10000

"claude --chrome" does this out of the box and works pretty well.

a day agoEMM_386

Is it included in "--yolo"?

3 hours agolostmsu

I was going to ask what makes this better than just using Playwright and this largely answers that question. I will have to try it out and see how it compares.

I haven't really had luck with MCP in general for quite a while though. I have just been using Google Antigravity for most of my vibe coding needs.

3 days agorobertheadley

I've used chrome devtools mcp successfully to do all kinds of advanced in browser tasks, agents like claude code can write js and inject it into the context in a live browser and do all kinds of neat tricks. I've used this extensively in gemini-cli.

a day agoprettyblocks

a brittle MCP that connects a brittle (unless using Opus 4.5) CLI to a brittle browser? (see: Scamlexity, an actual vulnerability name)

I trust Claude in Chrome a lot more, and I trust my own hands and eyes most.

a day agobpavuk

I was just thinking to myself this morning, I wonder if I can make Claude Code and Comet work together... Now I have my answer!

a day agogiancarlostoro

Anyone know of any good articles around having claude code build playwright test suites for a given website and parameters?

a day agonickpeterson

They literally already have Chrome integration… sorry you wasted your time

a day agodbbk

I just used puppeteer for this until it came natively

21 hours agoramon156

nice, literally had claude tell me it couldnt do browser today.

a day agothedudeabides5

I noticed that /plugins

Doesn't lost plugins anymore. I'm sure I installed playwright using that menu, but now it lists no plugins (and the plugin can't be found locally)

However, claude add mcp and /mcp still works.

a day agonurettin

Any notions of how this differs from Vibium?

a day agojMyles

I tried it. My Perplexity premium expired, maybe that is it, but it barely did anything.

When I put prompt you suggested, it did open Perplexity in Comet and then I guess didn't get response even though Perplexity did research, so it used regular search mcp to get results...

It is cool idea, this is what I would like to have, something to automate boring stuff. Find all LinkedIn connections that are not active and remove them from my network for example.

I don't think it is your mcp or code, as tech is just not there yet. It is much easier to accomplish this through other automations.

a day agodesireco42

Claude in Chrome is excellent - as is Claude in Excel. I was shocked at how useful the latter is.

a day agomrcwinn
[deleted]
5 days ago

Another day, another MCP server. Wake me up when we stop needing a new protocol for every AI tool to talk to every other AI tool.

a day agonubskr