Hacker Newsnew | past | comments | ask | show | jobs | submit | nprateem's commentslogin

OMG, some of those are legit good. That said the AI seems minimally guidable. It seems to ignore three majority of instructions in https://suno.com/song/25b16ab7-bfea-451d-abb3-8b52cdd783d0?s... so I guess like most tools, it's fine if you want to get what you're given but not really control it.

Yep agreed. You can guide it only so much and then you're at the mercy of running it a few times to get the closest match

Shit in the pool then sell nets

But they are shit. Over the last 2 days I've got bored of the predictable cycle of it first getting excited about a new idea then back peddling once I shoot it to pieces.

They can't write and think critically at the same time. Then subsequent messages are tainted by their earlier nonsensical statements.

Opus 3.7 BTW, not some toy open source model.


God knows why you think this is possible. If I don't even know what might be relevant to the conversation in several turns, there's no way an agent could either.

One of us is confusing prediction with retrieval. The embedding model doesn't predict what is going to be relevant in several turns, just on the turn at hand. Each turn gets a fresh semantic search against the full body of memory/agent comms. If the conversation or prompt changes the next query surfaces different context automatically.

As you build up a "body of work" it gets better at handling massive, disparate tasks in my admittedly short experience. Been running this for two weeks. Trying to improve it.


So the embedding model is a fixed-size view on a arbitrarily sized work history (tool calls, natural language messages)? The model is like a summarizer, but in latent space? And not aimed to summarize, but trained to hold whatever is needed for the agent to be autonomous for longer runs?

Pretty much. It's a fixed-size vector per chunk-- 1024 dims in the case of Voyager Nano. The autonomy part is entirely in how you build the vectorDB and query it, not in the model's training. That's the part I've been focusing on lately. Trying different methods and seeing what gives the best results.

At the moment I wouldn't emphasize "autonomous-ness", there's still a fair bit of human hand holding. But once I get a model on the right path it can switch back to to an old project, autonomously locate and debug 2-week old commits and the context around their development, and apply that knowledge to the task at hand.

It's only been a day but I seeing an improvement from nomite (768dims) to Voayager.


It definitely overcompensates to the point of defensiveness. They have all done so for years.

"Why did you do that?" (Me, just wanting to understand)

"You're right I should have done the opposite" (starts implementing the opposite without seeking approval, etc.

But if you agree with it it won't do that, so it isn't simply a case of randomly rerunning prompts.


So you're happy with an untrustworthy lazy moron prone to stupid mistakes and guesswork?

Surely you can see the first lab that solves this gains a massive advantage?


I wonder if this one will be able to stop putting my fucking python imports inline LIKE I'VE TOLD IT A THOUSAND TIMES.

You could pitch it as your "digital nagging housewife", or a "micromanager in a box". How about "your time wasting interrupt-otron" or just "flow-breaker"?

Seriously why would you think AI could read my mind and tell me what to do next without knowing my goals?

This sounds like the irritating tangential follow-on questions they ask on steroids. Generally irrelevant and take the conversation in a direction you don't want to go.


I recommend the book The Proof is in the Plants for a seemingly unbiased review of the literature.

Dude cites study after study pretty much all with the same conclusion: Eating animal products is bad for your health.


People need purpose. They may want money, etc but you see it all the time. People get bored and want something to channel their energy into.

So even universal income won't solve everything, not that it's ever likely.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: