> There is currently no option to change this behavior, no startup flag, nothing. You do not have the option to serve the web app locally, using `opencode web` just automatically opens the browser with the proxied web app, not a true locally served UI.
That is the address of their hosted WebUI which connects to an OpenCode server on your localhost. Would be nice if there was an option to selfhost it, but it is nowhere near as bad as "proxying all requests".
For some anecdata, I've set up Qwen3.5 on a RX 7900XTX last weekend. It runs fine, did some simple coding prompts and got responses in 15-30 seconds. It's my first foray into running models locally just to see what's possible, and I guess I'm happily surprised so far.
Also, the entire setup was done through Codex. I asked Codex to figure out how to run models locally given my architecture (Ubuntu, AMD GPU). It told me which steps to apply and I hit zero snags.
They may but note that this isn't an official Newgrounds project - this is just a user ("Bill") posting on his own Newgrounds blog that he has made this (its not Newgrounds' official blog).
Yep, the email they sent out is terribly worded so it looks like the age requirement is for Zed itself.
Their actual blog ( https://zed.dev/blog/terms-update ) says the age requirement is only for their AI service (still not the best wording but a little clearer):
> Age requirement. You must be 18 or older to use Zed’s AI-enabled software-as-a-service offering (the “Service").
> I really hope more people realize that local LLMs are where it's at
No worries, the AI companites thought ahead - by sending GPU, RAM, and now even harddrive prices through the roof, you won't have a computer to run a local model.
3bit is a bit ridiculous. From that page I am unclear if the current model is 3 or 4bit.
If it’s 4bit… well, NVIDIA showed that a well organized model can perform almost as well as 8bit.
Local models are quite capable. Obviously a 4B model isn't going to do the job of a trillion parameter SOTA model but there are many local models that are both fast and very usable for these agentic flows.
Qwen 30B and GLM Flash (also around 30B) are both very good for example and I use them regularly.
Playdate's SDK is free.
reply