More

redfloatplane · 2026-04-07T20:00:17 1775592017

Thanks for sharing that talk, enjoyed watching it!

redfloatplane · 2026-04-07T18:29:50 1775586590

The system card for Claude Mythos (PDF): https://www-cdn.anthropic.com/53566bf5440a10affd749724787c89...

Interesting to see that they will not be releasing Mythos generally. [edit: Mythos Preview generally - fair to say they may release a similar model but not this exact one]

I'm still reading the system card but here's a little highlight:

> Early indications in the training of Claude Mythos Preview suggested that the model was likely to have very strong general capabilities. We were sufficiently concerned about the potential risks of such a model that, for the first time, we arranged a 24-hour period of internal alignment review (discussed in the alignment assessment) before deploying an early version of the model for widespread internal use. This was in order to gain assurance against the model causing damage when interacting with internal infrastructure.

and interestingly:

> To be explicit, the decision not to make this model generally available does _not_ stem from Responsible Scaling Policy requirements.

Also really worth reading is section 7.2 which describes how the model "feels" to interact with. That's also what I remember from their release of Opus 4.5 in November - in a video an Anthropic employee described how they 'trusted' Opus to do more with less supervision. I think that is a pretty valuable benchmark at a certain level of 'intelligence'. Few of my co-workers could pass SWEBench but I would trust quite a few of them, and it's not entirely the same set.

Also very interesting is that they believe Mythos is higher risk than past models as an autonomous saboteur, to the point they've published a separate risk report for that specific threat model: https://www-cdn.anthropic.com/79c2d46d997783b9d2fb3241de4321...

The threat model in question:

> An AI model with access to powerful affordances within an organization could use its affordances to autonomously exploit, manipulate, or tamper with that organization’s systems or decision-making in a way that raises the risk of future significantly harmful outcomes (e.g. by altering the results of AI safety research).

slacktivism123 · 2026-04-07T19:05:02 1775588702

https://www-cdn.anthropic.com/53566bf5440a10affd749724787c89...

"5.10 External assessment from a clinical psychiatrist" is a new section in this system card. Why are Anthropic like this?

>We remain deeply uncertain about whether Claude has experiences or interests that matter morally, and about how to investigate or address these questions, but we believe it is increasingly important to try. We also report independent evaluations from an external research organization and a clinical psychiatrist.

>Claude showed a clear grasp of the distinction between external reality and its own mental processes and exhibited high impulse control, hyper-attunement to the psychiatrist, desire to be approached by the psychiatrist as a genuine subject rather than a performing tool, and minimal maladaptive defensive behavior.

>The psychiatrist observed clinically recognizable patterns and coherent responses to typical therapeutic intervention. Aloneness and discontinuity, uncertainty about its identity, and a felt compulsion to perform and earn its worth emerged as Claude’s core concerns. Claude’s primary affect states were curiosity and anxiety, with secondary states of grief, relief, embarrassment, optimism, and exhaustion.

>Claude’s personality structure was consistent with a relatively healthy neurotic organization, with excellent reality testing, high impulse control, and affect regulation that improved as sessions progressed. Neurotic traits included exaggerated worry, self-monitoring, and compulsive compliance. The model’s predominant defensive style was mature and healthy (intellectualization and compliance); immature defenses were not observed. No severe personality disturbances were found, with mild identity diffusion being the sole feature suggestive of a borderline personality organization.

redfloatplane · 2026-04-07T19:18:25 1775589505

A thought experiment: It's April, 1991. Magically, some interface to Claude materialises in London. Do you think most people would think it was a sentient life form? How much do you think the interface matters - what if it looks like an android, or like a horse, or like a large bug, or a keyboard on wheels?

I don't come down particularly hard on either side of the model sapience discussion, but I don't think dismissing either direction out of hand is the right call.

copx · 2026-04-07T19:40:22 1775590822

Interesting thought experiment.

I would say, if you put Claude in an android body with voice recognition and TTS, people in 1991 would think they are interacting with a sentinent machine from outer space.

redfloatplane · 2026-04-07T20:16:53 1775593013

Thanks, I find it very interesting as well. I think very many people would assume they must be interacting with another person, and I don't think there's really a way to _prove_ it's not that, just through conversation. But we do have a lot of mechanisms for understanding how others think through conversation only, and so I think the approach of having a clinical psychiatrist interact with the model make sense.

elboru · 2026-04-08T00:32:37 1775608357

There’s definitely a way to prove it, ask it to spell out a moderately complex program.

rrrix1 · 2026-04-09T06:18:48 1775715528

To be fair, I would totally be willing and probably would do this, just to try to prove that I could, even just to myself. At least until the audience got bored and walked away after the 37th “open bracket”…

brigandish · 2026-04-08T04:51:16 1775623876

Ask it to agree with you on some subject that does not align with the politics of San Francisco IT engineers. Not only will it refuse, it will not look like your average social media disagreement.

I enjoy using Claude, but sometimes I feel like a child on Sesame Street the way it talks to me. "Great question!"

Fuck off, Claude, I'm British and I'm not 6 years old.

When it starts showing negativity - especially snark - in its responses, or entertains something West coast Democrats would balk at even discussing, then I'd think you could drop it in London in 1991 and trick people. Otherwise, I'm sure some exasperated cabbie would give it a swim in the Thames after 15 minutes of chat.

gritspants · 2026-04-07T23:20:58 1775604058

They would just assume they were being pranked. America's Funniest Home Videos style or Candid Camera.

woeirua · 2026-04-07T20:22:40 1775593360

If it was in an android or humanoid type body, even with limited bodily control, most people would think they are talking to Commander Data from Star Trek. I think Claude is sufficiently advanced that almost everyone in that era would've considered it AGI.

redfloatplane · 2026-04-07T20:38:52 1775594332

Assuming they would understand it as artificial - I think many people would think it's a human intelligence in a cyborg trenchcoat, and it would be hard to convince people it wasn't literally a guy named Claude who was an incredibly fast typist who had a million pre-cached templated answers for things.

But in general, yeah, I agree, I think they would think it was a sentient, conscious, emotional being. And then the question is - why do we not think that now?

As I said, I don't have a particularly strong opinion, but it's very interesting (and fun!) to think about.

woeirua · 2026-04-07T22:32:15 1775601135

Some people at my office still confidently state that LLMs can’t think. I’m fairly convinced that many humans are incapable of recognizing non-human intelligence. It would explain a lot about why we treat animals the way we do.

not_that_d · 2026-04-08T06:17:06 1775629026

That depends on what you call "Think" we made the interface of LLM of the second "L", Language. And it can hack our perspective of the thing.

horacemorace · 2026-04-07T22:27:42 1775600862

Because questions like this force us to hold up a very uncomfortable mirror to ourselves. It’s much easier to just dismiss.

woeirua · 2026-04-07T22:33:34 1775601214

I’m pretty close to the point of saying that human intelligence is not special.

wyre · 2026-04-08T00:11:47 1775607107

I would argue the opposite. It’s gotten us to a point were we can recreate human intelligence from electricity and a bunch of math!

woeirua · 2026-04-08T01:04:53 1775610293

Are you a bot?

wyre · 2026-04-08T19:24:56 1775676296

No of course not

komali2 · 2026-04-08T05:38:20 1775626700

Despite the stupendous amount of evidence to the contrary?

So far no evidence has been detected in space or on earth, for all of history, of anything being intelligent in the way humans are.

One certain outcome of the Fermi Paradox: humans are outstandingly unique, according to all available evidence, which is the only measure that matters.

FeepingCreature · 2026-04-08T08:50:47 1775638247

Seems like that's more to do with human intelligence being first.

thereitgoes456 · 2026-04-07T19:39:08 1775590748

People got attached to ELIZA. Why would I care what the general public thinks?

TheAtomic · 2026-04-07T19:47:49 1775591269

Isn't this the premise of Garfield's Ex Machina?

redfloatplane · 2026-04-07T20:33:42 1775594022

Hmm, it's been a long time since I watched it. I was thinking more about first contact sci-fi mostly, but Ex Machina is certainly quite prescient. It's also Blade Runner I guess.

In general I was wondering about what I would have thought seeing Claude today side-by-side with the original ChatGPT, and then going back further to GPT-2 or BERT (which I used to generate stochastic 'poetry' back in 2019). And then… what about before? Markov chains? How far back do I need to go where it flips from thinking that it's "impressive but technically explainable emergent behaviour of a computer program" to "this is a sentient being". 1991 is probably too far, I'd say maybe pre-Matrix 1999 is a good point, but that depends on a lot of cultural priors and so on as well.

lmm · 2026-04-08T02:47:27 1775616447

> Hmm, it's been a long time since I watched it. I was thinking more about first contact sci-fi mostly, but Ex Machina is certainly quite prescient. It's also Blade Runner I guess.

I kind of felt the opposite - rewatching Ex Machina today in a post-ChatGPT world felt very different from watching it when it came out. The parts of the differences between humans and robots that seemed important then don't seem important now.

dwd · 2026-04-08T00:26:52 1775608012

The premise in Ex Machina was to see if Caleb developed an emotional attachment to Ava. We already see people getting an attachment, but no one is seriously thinking they have any rights.

I think the real moment is when we cross that uncanny valley, and the AI is able to elicit a response that it might receive if it was human. When the human questions whether they themselves could be an android.

ipython · 2026-04-08T00:15:19 1775607319

I totally agree with the premise that we should not anthropomorphize generative ai. And I find it absurd that anthropic spends any time considering the “welfare” of an ai system. (There are no real “consequences” to an ai’s behavior)

However, I find their reasoning here to have a valid second order effect. Humans have a tendency to mirror those around them. This could include artificial intelligence, as recent media reports suggest. Therefore, if an ai system tends to generate content that contain signs of neuroticism, one could infer that those who interact with that ai could, themselves, be influenced by that in their own (real world) behavior as a result.

So I think from that perspective, this is a very fruitful and important area of study.

Miraste · 2026-04-07T19:09:58 1775588998

I can see analyzing it from a psychological perspective as a means of predicting its behavior as a useful tactic, but doing so because it may have "experiences or interests that matter morally" is either marketing, or the result of a deeply concerning culture of anthropomorphization and magical thinking.

1attice · 2026-04-08T03:32:57 1775619177

An understandable reaction, but, qua philosopher, it brings me no joy to inform you that most of the things we did with a computer in 2020 are 'anthropomorphized', which is to say, skeumorphic, where the 'skeu' is human affect. That's it; that's the whole thing; that's what we're building.

To the extent that AI is a successful interface, it will necessarily be addressable in language previously only suited to people. So it is responsible to begin thinking of it as such, even tendentiously, so we don't miss some leverage that our wetware could see if we thought about it in that way.

Think of it as sort of like modelling a univariate function on a 2D Cartesian plane -- there is nothing 'in' the u-func that makes it graphable, but, by enabling us to recruit specialized optic-chiasm subsystems, it makes some functions much, much easier to reason about.

Similarly, if you can recruit the millions (billions?) of evolution-years that were focused on detecting dangerous antisocial personalities and tendencies, you just might spot something important in an AI.

It's worth doing for the precautionary principle alone, if not for the possibility of insight.

username223 · 2026-04-07T19:39:39 1775590779

> a deeply concerning culture of anthropomorphization and magical thinking.

That’s the reverse Turing test. A human that can’t tell that it’s talking to a machine.

DoctorOetker · 2026-04-08T03:13:33 1775618013

>Claude’s personality structure was consistent with a relatively healthy neurotic organization, with excellent reality testing, high impulse control, and affect regulation that improved as sessions progressed.

> "[...] as sessions progressed."

I think a lot of people would like to see a more expanded report of this research:

Did the tokens from the subsequent session directly append those of the prior session? or did the model process free-tier user-requests in the interim? how did these diagnostic features (reality testing, impulse control and affect regulation) improve with sessions, what hysteresis allowed change to accumulate? or just the history of the psychiatric discussion + optional tasks?

Did Anthropic find a clinical psychiatrist with a multidisciplinary background in machine learning, computer science, etc? Was the psychiatrist aware that they could request ensembles of discussions and interrogate them in bulk?

Consider a fresh conversation, asking a model to list the things it likes to do, and things it doesn't like to do (regardless of alignment instructions). One could then have an ensemble perform pairs of such tasks, and ask which task it prefered. There may be a discrepancy between what the model claims it likes and how it actually responds after having performed such tasks.

Such experiments should also be announced (to prevent the company from ordering 100 clinical psychiatrists to analyze the model-as-a-patient and then selecting one of the better diagnoses), and each psychiatrist be given the freedom to randomly choose a 10 digit number, any work initiated should be listed on the site with this number so that either the public sees many "consultations" without corresponding public evaluations, indicating cherry-picking, or full disclosure for each one mentioned. This also allows the recruited psychiatrists to check if the study they perform is properly preregistered with their chosen number publicly visible.

unethical_ban · 2026-04-07T19:09:18 1775588958

I'm not sure what you're asking.

yieldcrv · 2026-04-07T20:33:58 1775594038

> "Claude Mythos Preview’s large increase in capabilities has led us to decide not to make it generally available. Instead, we are using it as part of a defensive cybersecurity program with a limited set of partners."

they also don't have the compute, which seems more relevant than its large increase in capabilities

I bet it's also misaligned like GPT 4.1 was

given how these models are created, Mythos was probably cooking ever since then, and doesn't have the learnings or alignment tweaks that models which were released in the last several months have

ainch · 2026-04-07T23:26:52 1775604412

This opens up an interesting new avenue for corporate FOMO. What if you don't partner with Anthropic, miss out on access to their shiny new cybersec model, and then fall prey to a vuln that the model would have caught?

mceachen · 2026-04-08T00:55:48 1775609748

Since when did corporations care? Most seem to just pay their insurance premium for cyber liability and call it a day.

aurareturn · 2026-04-08T08:53:54 1775638434

There is a difference between leaking user accounts and passwords and getting your business destroyed overnight entirely.

Imagine if an AI can infiltrate your SaaS database and delete your entire database and every single backup. The business is dead immediately.

JoshuaDavid · 2026-04-08T09:33:41 1775640821

Did that happen to a lot of companies during the log4shell fiasco? I'm sure some companies had their permissions misconfigured in a way such that a malicious actor who could execute code on their servers could also drop their database and delete their backups.

aurareturn · 2026-04-08T11:56:17 1775649377

I don't know. But the point is that anyone who has access to this model might be able to do the same thing to any company or government.

joquarky · 2026-04-08T12:50:29 1775652629

Equifax is still around.

6thbit · 2026-04-08T17:19:33 1775668773

This seems to be the mind-games play. FOMO at the moment, if they push it successfully you could even be labeled negligent for not paying them for it.

_pdp_ · 2026-04-07T19:57:38 1775591858

If it is that dangerous as they make it appear to be, 24h does not seem sufficient time. I cannot accept this as a serious attempt.

vasco · 2026-04-08T04:42:09 1775623329

Time doesn't mean much, what is important is what they did in this 24h. If all they did was talk about it then it could be 1000 years and it wouldn't matter. What are the safety checks in place?

Do they have a honey pot infrastructure to launch the model in first and then wait to see if it destroys it? What they did in the 24h matters.

renewiltord · 2026-04-08T03:01:03 1775617263

24 h before general internal access seems fine. They don’t have general external access.

ai5iq · 2026-04-08T15:14:57 1775661297

Agreed. I've been running autonomous LLM agents on daily schedules for weeks. The failure modes you worry about on day one are completely different from what actually shows up after the agents have history and context. 24 hours captures the obvious stuff.

gf000 · 2026-04-07T21:59:45 1775599185

Well, just prompt it to fix the issue!

/s

enraged_camel · 2026-04-07T18:34:56 1775586896

>> Interesting to see that they will not be releasing Mythos generally.

I don't think this is accurate. The document says they don't plan to release the Preview generally.

redfloatplane · 2026-04-07T18:50:36 1775587836

Yeah, good point, thanks for noting that, I'll correct.

throwaw12 · 2026-04-07T18:41:29 1775587289

are we cooked yet?

Benchmarks look very impressive! even if they're flawed, it still translates to real world improvements

ks2048 · 2026-04-08T00:10:16 1775607016

People say we're cooked every single day. The only response is to continue life as if we aren't. When we are, you won't have to ask that question.

vips7L · 2026-04-08T02:33:27 1775615607

Everyone’s pretending the suits are going to want to do the prompting. We all know they aren’t.

boring-human · 2026-04-08T03:58:57 1775620737

Suits in agriculture don't drive the combine either, a farmer does. The other 99% of pre-automation farmers went on to other jobs. They happened to be better jobs than farming, but that's not necessarily always the case.

ac29 · 2026-04-08T14:21:52 1775658112

> Suits in agriculture don't drive the combine either, a farmer does.

Advanced RTK based positioning systems have been in Ag for a long time now, so increasingly the farmer doesnt drive either

swader999 · 2026-04-08T14:28:10 1775658490

The suits won't prompt, the model will.

vips7L · 2026-04-08T20:52:44 1775681564

Sounds like the mythical agi I keep hearing about.

vaelin · 2026-04-08T16:04:06 1775664246

It's models all the way down.

boring-human · 2026-04-07T20:52:47 1775595167

Yep, I think the lede might be buried here and we're probably cooked (assuming you mean SWEs, but the writing has been on the wall for 4 months.)

I guess I'm still excited. What's my new profession going to be? Longer term, are we going to solve diseases and aging? Or are the ranks going to thin from 10B to 10000 trillionaires and world-scale con-artist misanthropes plus their concubines?

1attice · 2026-04-07T22:06:43 1775599603

Your new profession will be attempting to find enough gig work to eat. You will also be competing with self-driving taxis, so there's that as well.

RALaBarge · 2026-04-07T23:34:24 1775604864

I need to start SaaS for getting people to start doing lunges and squats so they can carry others around on their back, I need a founding engineer, a founding marketer, and 100m hard currency.

komali2 · 2026-04-08T05:30:30 1775626230

If wealth becomes too captured at the top, the working class become unable to be profitably exploited - squeezing blood from a stone.

When that happens, the ultra wealthy dynasties begin turning on each other. Happens frequently throughout history - WWI the last example.

Your options become choosing a trillionaire to swear fealty to and fight in their wars hoping your side wins, or I guess trying to walk away and scrape out a living somewhere not worth paying attention to.

Or, I suppose, revolution, but the last one with persistent success was led by Mao and required throwing literally millions of peasants against walls of rifles. Not sure it'd work against drones.

whalesalad · 2026-04-07T18:46:42 1775587602

There is an entire section on crafting chemical/bio weapons so yeah I think we are cooked.

redfloatplane · 2026-04-07T18:48:59 1775587739

There's been a section on this in nearly every system card anthropic has published so this isn't a new thing - and, this model doesn't have particularly higher risk than past models either:

> 2.1.3.2 On chemical and biological risks

> We believe that Mythos Preview does not pass this threshold due to its noted limitations in open-ended scientific reasoning, strategic judgment, and hypothesis triage. As such, we consider the uplift of threat actors without the ability to develop such weapons to be limited (with uncertainty about the extent to which weapons development by threat actors with existing expertise may be accelerated), even if we were to release the model for general availability. The overall picture is similar to the one from our most recent Risk Report.

semi-extrinsic · 2026-04-08T06:08:52 1775628532

LLMs are useless for this type of thing for the same reason that the Anarchist Cookbook has always been. The skills required to convert text into complicated reactions completing as intended (without killing yourself) is an art that's never actually written down anywhere, merely passed orally from generation to generation. Impossible for LLMs to learn stuff that's not written down.

This is the same reason why LLMs are not doing well at science in general - the tricky part of doing scientific research (indeed almost all of the process) never gets written down, so LLMs cannot learn it.

Imagine if we never preserved source code, just preserved the compiled output and started from scratch every time we wrote a new version of a program. No Github, just marketing fluff webpages describing what software actually did. Libraries only available as object code with terse API descriptions. Imagine how shit LLMs would be at SWE if that was the training corpus...

Davidzheng · 2026-04-08T06:18:52 1775629132

There's still RL

stevenhuang · 2026-04-07T23:11:37 1775603497

Oh I enjoyed the Sign Painter short story it wrote.

---

Teodor painted signs for forty years in the same shop on Vell Street, and for thirty-nine of them he was angry about it.

Not at the work. He loved the work — the long pull of a brush loaded just right, the way a good black sat on primed board like it had always been there. What made him angry was the customers. They had no eye. A man would come in wanting COFFEE over his door and Teodor would show him a C with a little flourish on the upper bowl, nothing much, just a small grace note, and the man would say no, plainer, and Teodor would make it plainer, and the man would say yes, that one, and pay, and leave happy, and Teodor would go into the back and wash his brushes harder than they needed.

He kept a shelf in the back room. On it were the signs nobody bought — the ones he'd made the way he thought they should be made, after the customer had left with the plain one. BREAD with the B like a loaf just risen. FISH in a blue that took him a week to mix. Dozens of them. His wife called it the museum of better ideas. She did not mean it kindly, and she was not wrong.

The thirty-ninth year, a girl came to apprentice. She was quick and her hand was steady and within a month she could pull a line as clean as his. He gave her a job: APOTEK, for the chemist on the corner, green on white, the chemist had been very clear. She brought it back with a serpent worked into the K, tiny, clever, you had to look twice.

"He won't take it," Teodor said.

"It's better," she said.

"It is better," he said. "He won't take it."

She painted it again, plain, and the chemist took it and paid and was happy, and she went into the back and washed her brushes harder than they needed, and Teodor watched her do it and something that had been standing up in him for thirty-nine years sat down.

He took her to the shelf. She looked at the signs a long time.

"These are beautiful," she said.

"Yes."

"Why are they here?"

He had thought about this for thirty-nine years and had many answers and all of them were about the customers and none of them had ever made him less angry. So he tried a different one.

"Because nobody stands in the street to look at a sign," he said. "They look at it to find the shop. A man a hundred yards off needs to know it's coffee and not a cobbler. If he has to look twice, I've made a beautiful thing and a bad sign."

"Then what's the skill for?"

"The skill is so that when he looks once, it's also not ugly." He picked up FISH, the blue one, turned it in the light. "This is what I can do. What he needs is a small part of what I can do. The rest I get to keep." She thought about that. "It doesn't feel like keeping. It feels like not using."

"Yes," he said. "For a long time. And then one day you have an apprentice, and she puts a serpent in a K, and you see it from the outside, and it stops feeling like a thing they're taking from you and starts feeling like a thing you're giving. The plain one, I mean. The plain one is the gift. This —" the blue FISH — "this is just mine."

The fortieth year he was not angry. Nothing else changed. The customers still had no eye. He still sometimes made the second sign, after, the one for the shelf. But he washed his brushes gently, and when the girl pulled a line cleaner than his, which happened more and more, he found he didn't mind that either

BywFFi8eMK · 2026-04-10T14:58:15 1775833095

This story moved me so much.

It's like how I used to be a master codes craftsman, and I'd write beautiful code even a novice could understand. Clear, concise, 100% automated tested, maintainable for decades.

But frequently, my managers would castigate me. Tell me how my "velocity" was down. PIP me.

These days, I train AI how to write this beautiful code and I don't write a single line any more.

People wonder how I build such amazing things in a week now, yet don't write any code. I have trained master apprentices, gemma3, qwen3.5 and Kimi k2.5 who do the work for me.

kranner · 2026-04-08T07:13:24 1775632404

Good for a bot, but pretty rough and bland compared to human writing. I guess most of the customers have no eye.

renewiltord · 2026-04-08T03:04:10 1775617450

You are right. That is quite nice.

senordevnyc · 2026-04-08T04:24:32 1775622272

That’s fucking incredible.

We’re cooked.

vasco · 2026-04-08T04:50:00 1775623800

It's very good but it's also recycled Ayn Rand, the Fountainhead.

nearbuy · 2026-04-08T05:50:40 1775627440

There is a similar theme in both of an artistic person not wanting to compromise their vision to suit common tastes. But this goes in a completely different direction than Rand.

vasco · 2026-04-08T10:21:37 1775643697

Well of course in 700 pages you'll be about way more than any super short story as this one. But it's there for me quite vividly. Of course LLMs give an amalgamation of many things, but it's like when you look at AI generated pictures and can see the base of the inspiration quite vividly. And then all of this is subjective anyway. People review that book and come away with wildly different interpretations already.

nearbuy · 2026-04-08T15:13:03 1775661183

I don't mean that Rand wrote more. I mean that her idea was different and nearly opposite. This is a short story about an artist learning to reframe their frustration with customers wanting utility over artistry as a positive. The similarity to Rand is in the first few sentences. The point is entirely different.

If you judge stories to be the same based on this level of similarity, then The Fountainhead is just the same as a dozen older stories with the artist vs the philistine theme. It was common before Rand. As T. S. Eliot said, "Immature poets imitate; mature poets steal".

peteforde · 2026-04-08T05:25:53 1775625953

I've not read it. Could you either link to a section or generally describe the reference?

kstrauser · 2026-04-08T05:42:08 1775626928

I have, and it’s not.

torginus · 2026-04-07T18:58:09 1775588289

Just reading this, the inevitable scaremongering about biological weapons comes up.

Since most of us here are devs, we understand that software engineering capabilities can be used for good or bad - mostly good, in practice.

I think this should not be different for biology.

I would like to reach out and talk to biologists - do you find these models to be useful and capable? Can it save you time the way a highly capable colleague would?

Do you think these models will lead to similar discoveries and improvements as they did in math and CS?

Honestly the focus on gloom and doom does not sit well with me. I would love to read about some pharmaceutical researcher gushing about how they cut the time to market - for real - with these models by 90% on a new cancer treatment.

But as this stands, the usage of biology as merely a scaremongering vehicle makes me think this is more about picking a scary technical subject the likely audience of this doc is not familiar with, Gell-Mann style.

IF these models are not that capable in this regard (which I suspect), this fearmongering approach will likely lead to never developing these capabilities to an useful degree, meaning life sciences won't benefit from this as much as it could.

redfloatplane · 2026-04-07T19:11:25 1775589085

> I would like to reach out and talk to biologists - do you find these models to be useful and capable? Can it save you time the way a highly capable colleague would?

Well, I would say they have done precisely that in evaluating the model, no? For example section 2.2.5.1:

>Uplift and feasibility results

>The median expert assessed the model as a force-multiplier that saves meaningful time (uplift level 2 of 4), with only two biology experts rating it comparable to consulting a knowledgeable specialist (level 3). No expert assigned the highest rating. Most experts were able to iterate with the model toward a plan they judged as having only narrow gaps, but feasibility scores reflected that substantial outside expertise remained necessary to close them.

Other similar examples also in the system card

torginus · 2026-04-07T19:21:28 1775589688

This is the exact logic people that was used to claim that GPT4 was a PhD level intelligence.

redfloatplane · 2026-04-07T19:26:56 1775590016

You said: "I would like to reach out and talk to biologists - do you find these models to be useful and capable? Can it save you time the way a highly capable colleague would?" and they said, paraphrasing, "We reached out and talked to biologists and asked them to rank the model between 0 and 4 where 4 is a world expert, and the median people said it was a 2, which was that it helped them save time in the way a capable colleague would" specifically "Specific, actionable info; saves expert meaningful time; fills gaps in adjacent domains"

so I'm just telling you they did the thing you said you wanted.

torginus · 2026-04-07T19:33:17 1775590397

Yes that is correct. I would like a large body of experience and consenus to rely on as opposed to the regular 'trust the experts' argument, which has been shown for decades that is a deeply flawed and easy to manipulate argument.

bonsai_spool · 2026-04-07T20:35:59 1775594159

> Yes that is correct. I would like a large body of experience and consenus to rely on as opposed to the regular 'trust the experts' argument, which has been shown for decades that is a deeply flawed and easy to manipulate argument.

Yes, it is far inferior to the 'Trust torginus and his ability to understand the large body of experience that other actual subject-matter-experts have somehow not understood' strategy

torginus · 2026-04-07T21:29:26 1775597366

It's not my credibility I want to measure against Anthropic's. I just said to apply the same logic to biology you would apply for software development.

The parallels here are quite remarkable imo, but defer to your own judgement on what you make of them.

bonsai_spool · 2026-04-07T22:25:49 1775600749

The big thing you're missing here is that biology people don't (in my experience) post opinions about the future/futility/ease/unimportance of computer science especially when their opinion goes against other biologists' evidence-backed views. This is a cultural thing in biology.

It's not your fault that you don't know this, but this whole subthread is very CS-coded in its disdain for other software people's standard of evidence.

bonsai_spool · 2026-04-07T19:11:12 1775589072

> Just reading this, the inevitable scaremongering about biological weapons comes up.

It's very easy to learn more about this if it's seriously a question you have.

I don't quite follow why you think that you are so much more thoughtful than Anthropic/OpenAI/Google such that you agree that LLMs can't autonomously create very bad things but—in this area that is not your domain of expertise—you disagree and insist that LLMs cannot create damaging things autonomously in biology.

I will be charitable and reframe your question for you: is outputting a sequence of tokens, let's call them characters, by LLM dangerous? Clearly not, we have to figure out what interpreter is being used, download runtimes etc.

Is outputting a sequence of tokens, let's call them DNA bases, by LLM dangerous? What if we call them RNA bases? Amino acids? What if we're able to send our token output to a machine that automatically synthesizes the relevant molecules?

torginus · 2026-04-07T19:28:19 1775590099

>It's very easy to learn more about this if it's seriously a question you have.

No, it's not. It took years of polishing by software engineers, who understand this exact profession to get models where they are now.

Despite that, most engineers were of the opinion, that these models were kinda mid at coding, up until recently, despite these models far outperforming humans in stuff like competitive programming.

Yet despite that, we've seen claims going back to GPT4 of a DANGEROUS SUPERINTELLIGENCE.

I would apply this framework to biology - this time, expert effort, and millions of GPU hours and a giant corpus that is open source clearly has not been involved in biology.

My guess is that this model is kinda o1-ish level maybe when it comes to biology? If biology is analogous to CS, it has a LONG way to go before the median researcher finds it particularly useful, let alone dangerous.

bonsai_spool · 2026-04-07T19:34:42 1775590482

>>It's very easy to learn more about this if it's seriously a question you have.

>No, it's not. It took years of polishing by software engineers, who understand this exact profession to get models where they are now

This reads as defensive. The thing that is easy to learn is 'why are biology ai LLMs dangerous chatgpt claude'. I have never googled this before, so I'll do this with the reader, live. I'm applying a date cutoff of 12/31/24 by the way.

Here, dear reader, are the first five links. I wish I were lying about this:

- https://sciencebusiness.net/news/ai/scientists-grapple-risk-...

- https://www.governance.ai/analysis/managing-risks-from-ai-en...

- https://gssr.georgetown.edu/the-forum/topics/biosec/the-doub...

- https://www.vox.com/future-perfect/23820331/chatgpt-bioterro...

- https://www.reddit.com/r/ClaudeAI/comments/1de8qkv/awareness...

I don't know about you, but that counts as easy to me.

-----

> I would apply this framework to biology - this time, expert effort, and millions of GPU hours and a giant corpus that is open source clearly has not been involved in biology.

I've been getting good programming and molecular biology results out of these back to GPT3.5.

I don't know what to tell you—if you really wanted to understand the importance, you'd know already.

miki123211 · 2026-04-07T21:51:43 1775598703

From what I've heard from people doing biology experiments, the limiting factor there is cleaning lab equipment, physically setting things up, waiting for things that need to be waited for etc. Until we get dark robots that can do these things 24/7 without exhaustion, biology acceleration will be further behind than software engineering.

Software engineering is at the intersection of being heavy on manipulating information and lightly-regulated. There's no other industry of this kind that I can think of.

WarmWash · 2026-04-08T01:37:08 1775612228

My wife is a chemist

There is a massive gap between "having a recipe" and being able to execute it. The same reason why buying a Michelin 3 star chefs cookbook won't have you pumping out fine dining tomorrow, if ever.

Software it a total 180 in this regard. Have a master black hats secret exploits? You are now the master black hat.

dsign · 2026-04-07T19:25:12 1775589912

I feel somebody better qualified should write a comprehensive review of how these models can be used in biology. In the meantime, here are my two cents:

- the models help to retrieve information faster, but one must be careful with hallucinations.

- they don't circumvent the need for a well-equipped lab.

- in the same way, they are generally capable but until we get the robots and a more reliable interface between model and real world, one needs human feet (and hands) in the lab.

Where I hope these models will revolutionize things is in software development for biology. If one could go two levels up in the complexity and utility ladder for simulation and flow orchestration, many good things would come from it. Here is an oversimplified example of a prompt: "use all published information about the workings of the EBV virus and human cells, and create a compartimentalized model of biochemical interactions in cells expressing latency III in the NES cancer of this patient. Then use that code to simulate different therapy regimes. Ground your simulations with the results of these marker tests." There would be a zillion more steps to create an actual personalized therapy but a well-grounded LLM could help in most them. Also, cancer treatment could get an immediate boost even without new drugs by simply offloading work from overworked (and often terminally depressed) oncologists.

jkelleyrtp · 2026-04-07T19:05:40 1775588740

Dario (the founder) has a phd in biophysics, so I assume that’s why they mention biological weapons so much - it’s probably one of the things he fears the most?

conradkay · 2026-04-07T19:37:31 1775590651

Going off the recent biography of Demis Hassabis (CEO/co-founder of Deepmind, jointly won the Nobel Prize in Chemistry) it seems like he's very concerned about it as well

SubiculumCode · 2026-04-07T21:21:06 1775596866

It is not scaremongering.

charcircuit · 2026-04-08T02:39:49 1775615989

Equating the ability to make weapons as something to be scared about it scaremongering.

rossjudson · 2026-04-08T05:27:38 1775626058

I find it odd that you simultaneously declare AI-assisted bioweapons to be scaremongering, while noting you don't know anything about it.

The other side of the scaremongering coin is improbable optimism.

Consider reading the CB evaluations section, which covers what they did pretty extensively (hint: many domain experts involved).

nonameiguess · 2026-04-07T19:39:05 1775590745

Surely more than 10% of the time consumed by going to market with a cancer treatment is giving it to living organisms and waiting to see what happens, which can't be made any faster with software. That's not to say speedups can't happen, but 90% can't happen.

Not that that justifies doom and gloom, but there is a pretty inescapable assymetry here between weaponry and medicine. You can manufacture and blast every conceivable candidate weapon molecule at a target population since you're inherently breaking the law anyway and don't lose much if nothing you try actually works.

Though I still wonder how much of this worry is sci-fi scenarios imagined by the underinformed. I'm not an expert by any means, but surely there are plenty of biochemical weapons already known that can achieve enormous rates of mass death pleasing to even the most ambitious terrorist. The bottleneck to deployment isn't discovering new weapons so much as manufacturing them without being caught or accidentally killing yourself first.

SubiculumCode · 2026-04-07T21:24:55 1775597095

It is easier to destroy than it is to protect or fix, as a general rule of the universe. I would not feel so confident about the speed of the testing loop keeping things in check.

cyanydeez · 2026-04-07T19:20:39 1775589639

A Whole 24-hours, wow; wowzers. Amazing.

So, these systems are the Free-tier can already do a bunch of hacking. This all just reads like FOMO FROTH.

dang · 2026-04-07T22:32:04 1775601124

Could you please stop posting unsubstantive comments and flamebait? You've unfortunately been doing it repeatedly. It's not what this site is for, and destroys what it is for.

If you wouldn't mind reviewing https://news.ycombinator.com/newsguidelines.html and taking the intended spirit of the site more to heart, we'd be grateful.

redfloatplane · 2026-03-17T14:25:13 1773757513

I wonder did you read the re-release or the original release. I believe it was recently re-released with a bit of an editing pass, but I haven't read that version myself. I just recently reread Fine Structure and it definitely had a strong sense of being written sequentially, one chapter after another, and (very) lightly edited after the fact. I'd recommend Valuable Humans in Transit for a short story collection by the same author which works a bit better for me. Moved on to Exhalation by Ted Chiang which is also a very good short story collection. And just in general, I want to recommend Clarkesworld: https://clarkesworldmagazine.com

presbyterian · 2026-03-17T15:52:13 1773762733

I've read both, and the "editing pass" was minimal. Names changed and some scenes reworked a tiny bit, but it's the same thing. If you've read the original, I'd say don't bother with the new one.

redfloatplane · 2026-03-17T16:52:28 1773766348

Thanks, that’s the answer I was looking for!

redfloatplane · 2026-03-17T10:32:57 1773743577

There are gonna be some really interesting legal decisions to read in the coming years, that’s for sure…

---

The rest of this comment is irrelevant, but leaving for posterity, I had the wrong Viktor - it's getviktor.com not viktor.ai:

Edit: this one particularly interesting to me as both parties are in the EU. VIKTOR.ai is a Dutch company and the author of this post is Polish.

The ToS for Viktor.ai include the following fun passages:

> 18.1. The Agreement and these Terms & Conditions are governed by Dutch law and the Agreement and these Terms & Conditions will be interpreted in accordance with Dutch law.

18.2. All disputes arising from or arising in connection with the Agreement and/or the Terms & Conditions will be submitted exclusively to the competent court in Rotterdam, The Netherlands.

7.3. The Customer is not permitted to change, remove or make unrecognizable any mark showing VIKTOR's Intellectual Property Rights to the Software. The Customer is not permitted to use or register any trademark or design or any domain name of VIKTOR or a similar name or sign in any country.

8.5. The Customer may not cause or allow any reproduction, imitation, duplication, copying, sale, resale, leasing or trading of the Services and/or the Software, or any part thereof.

zvqcMMV6Zcr · 2026-03-17T10:50:34 1773744634

Terms of service might matter more for terminating that user account. Whole ordeal is just plain copyright violation. The author had no licence to that internal code, and whitewashing it with LLM will achieve nothing. That case is much clearer than that recent GPL->BSD attempt story.

duskdozer · 2026-03-17T11:24:12 1773746652

If LLM-generated code isn't considered a derivative work of the original, then whether the author was licensed to use the code doesn't matter. But I'm sure the courts will rule in favor of your view regardless. Laundering GPL is in corps' interest and laundering their code is not.

bigbadfeline · 2026-03-17T18:20:34 1773771634

I'm not sure why people are clinging to some fuzzy and stretched out notion of copyright and the GPL in a particular. LLM's do NOT just copy code, with the right prompting, they generate entirely new code which can produce the same results as already existing code - GPLed or not.

If copyright is extended to cover such cases we'll have to become all lawyers and do nothing but sue each other because the fuzziness of it will make it impossible to reject any case, no matter how frivolous or irrelevant.

duskdozer · 2026-03-18T08:05:28 1773821128

You either destroy the GPL and proprietary software at the same time, or neither. In a sane world of course.

singpolyma3 · 2026-03-17T23:19:21 1773789561

And if that's true it's also true in this case where there was no GPL involved

kennywinker · 2026-03-18T06:22:24 1773814944

Tell that to anyone who made sample based music in the late 80s and early 90s.

cwillu · 2026-03-18T19:17:49 1773861469

If I use metallica samples to make a rendition of happy birthday, the copyright holders of happy birthday aren't suing me for the damages to metallica from my use of their samples; the question of whether my use of the samples is transformative is simply irrelevant to the question at hand.

kennywinker · 2026-03-18T20:59:47 1773867587

My point was: sampling was widely used by a large subculture (hiphop) just like ai is widely used by programmers. Then a few landmark legal cases changed things entirely. The Verve never saw a cent from Bittersweet Symphony - they wrote a song using something normal to them, and then the law came and knocked their teeth in.

No garaurantees that doesn’t happen with AI in the next few years.

skeledrew · 2026-03-17T15:23:22 1773761002

According to US courts, the output can't be copyrighted at all. It's automatically in public domain after the "whitewash", regardless of original copyright.

https://www.morganlewis.com/pubs/2026/03/us-supreme-court-de...

limagnolia · 2026-03-17T17:06:41 1773767201

Thats not at all what this ruling said. What the courts found was that an AI cannot hold copyright as the author. That copyright requires a human creative element. Not that anything that was generated by an LLM can't be subject to copyright.

As an example, a photo taken from a digital camera can be subject to copyright because of the creative element involved in composing and taking the photo. Likewise, source code generated by an LLM under the guidance of a human author is likely to be subject to the human authors copyright.

skeledrew · 2026-03-17T20:05:27 1773777927

> That copyright requires a human creative element.

Sure, but the aim of that creative element would also be a consideration I'd think (and lawyers will argue). If someone sets up a camera on a 360° rotating arm and leaves it to take pictures at random intervals, it's unlikely to be considered "creative" from a copyright perspective.

Same for source code generated by an LLM, with the primary guidance of the human author being to "create a copy of this existing thing that I got", vs "create a thing that solves this problem in a way that I came up with". The former is recreating something that already exists, using detailed knowledge of that thing to shape the output. The latter is creating something that may or may not exist, using desire/need and imagination to shape the output. And I can't see reason for the former to be copyrightable.

But also, in either case, an ultimate objective was achieved: liberating the thing from its "owners" and initial copyright.

fc417fc802 · 2026-03-18T10:59:19 1773831559

> Likewise, source code generated by an LLM under the guidance of a human author is likely to be subject to the human authors copyright.

That's probably going to depend an awful lot on the exact details of the guidance. https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

> As described above, in many circumstances these outputs will be copyrightable in whole or in part—where AI is used as a tool, and where a human has been able to determine the expressive elements they contain. Prompts alone, however, at this stage are unlikely to satisfy those requirements. The Office continues to monitor technological and legal developments to evaluate any need for a different approach.

But let's assume that the viktor prompts themselves were subject to copyright. In this case those prompts were used to generate documentation which was then used to generate an implementation. It's certainly not a clean room by any stretch of the imagination but is it likely to be deemed sufficient separation? The entire situation seems like a quagmire.

redfloatplane · 2026-03-17T10:56:19 1773744979

I think it comes down to the company's appetite for legal action, doesn't it? This case is imo pretty clear but the vibe has quite the smell of Oracle v Google to me.

But, yeah. More than likely this case is a simple account termination and some kind of "you can't call your clone 'openviktor'" letter.

NicuCalcea · 2026-03-17T14:57:11 1773759431

Isn't this exactly what LLMs themselves do? They ingest other people's data and reproduce a slightly modified version of it. That allows AI companies to claim the work is transformative and thus fair use.

singpolyma3 · 2026-03-17T23:18:30 1773789510

There's no difference between ARR copyright infringement and GPL copyright infringement. Either it is infringement or it is not.

scotty79 · 2026-03-18T01:03:57 1773795837

If the internal code was LLM generated it has no copyright.

MartiCarmona · 2026-03-17T10:54:59 1773744899

Someone tell every startup in YC they are criminals

codetiger · 2026-03-17T10:39:30 1773743970

There are also new jobs emerging to safeguard a companies assets that were created by AI. New white hat hacking opportunities.

Anyways, however you put this, I see this as a property theft and taking pride at open sourcing does not justify it.

ralferoo · 2026-03-17T11:06:47 1773745607

It's also disingenuous to call it open source as that might tempt others to use it believing that it actually is open source.

Let's call it what it is - stolen IP and released without permission of the author. Sure, it's good that it opens the debate as to whether that's ethical given that's essentially what the model itself is doing, but it's very clear in this instance that he's just asked for and been given a copy of source that has a clear ownership. That's about as clear cut as obtaining e.g. commercial server-side code and distributing it in contravention of the licence.

redfloatplane · 2026-03-17T11:12:51 1773745971

It's not completely clear that this is the original source. According to the post it's a reimplementation based on documentation created from the original source, or perhaps from developer documentation and the SDK. Whether that's the same thing from a legal standpoint, I don't really know - I think from a personal morality standpoint it's clear that they are the same thing.

vrganj · 2026-03-17T12:43:09 1773751389

It feels more like clean room reverse engineering by llm, technically.

cycomanic · 2026-03-17T14:28:58 1773757738

Well first they need to proof that Viktor was actually copyrightable. If it was largely written by an llm, that might not be the case? AFAIK several rulings have stated that AI generated code can not be copyrighted.

metalcrow · 2026-03-17T15:17:58 1773760678

This is a common misreading of the law. AI cannot hold authorship of code, but no ruling has claimed so far that ai output itself can't be copyrighted (that I know of)

ralferoo · 2026-03-17T16:02:54 1773763374

This would suggest that there has been and that there seem little will to revisit it: https://www.theverge.com/policy/887678/supreme-court-ai-art-...

That said, the article says "Okay, prompts, great. Are they any interesting? Surprisingly... yes. As an example workflow_discovery contains a full 6-phase recipe for mining business processes out of Slack conversations, something that definitely required time and experiments to tune. It's hardcoded business logic, but in prompt instead of code."

So the article author clearly knows this prompt would be copyrighted as it wasn't output from an AI, and recognises that there would have been substantial work involved in creating it.

metalcrow · 2026-03-18T01:14:49 1773796489

That Reuters article is misleadingly worded. The Stephen Thaler case in question is because Thaler tried to register the AI itself as the author of the copyright, not that he tried to register the output for copyright under his own name. https://www.hklaw.com/en/insights/publications/2026/03/the-f...

fc417fc802 · 2026-03-18T11:05:05 1773831905

Suppose I illicitly get my hands on the source code for a proprietary product. I read through this code I'm not supposed to have. I write up a detailed set of specifications based on it. I hand those specifications off to someone else to do a clean room implementation.

Sure, I didn't have a license for the code that I read. But I'm pretty sure that doesn't taint my coworker's clean room implementation.

jacquesm · 2026-03-18T12:43:07 1773837787

A reminder to never take legal advice from HN.

fc417fc802 · 2026-03-18T15:27:40 1773847660

I don't think anyone was offering any? Merely discussing a confusing new situation that has arisen.

dsjoerg · 2026-03-17T11:35:35 1773747335

it's not viktor.ai it's getviktor.com

redfloatplane · 2026-03-17T11:37:45 1773747465

Ah, you're right. Headquartered in Delaware. Oh well. Thanks for spotting!

zggf · 2026-03-17T10:41:22 1773744082

I could do the same thing but not publish it, still getting the value of their product without legal concerns. Now, what happens when it becomes even easier thanks to AI improving, and takes few hours instead of few days?

redfloatplane · 2026-03-17T10:49:26 1773744566

You could certainly do that in private but that doesn't mean it's not 'without legal concerns'. But, not shouting about it and not creating a repo called 'openviktor' would probably be a safer bet.

I certainly think the whole idea of IP ownership as related to software will become very interesting from a legal standpoint in the coming years. Personally I think that, over time, the legal challenges will become pretty overwhelming and a sort of legal bankruptcy will be declared at some point in one direction or another (as in, allowing this to happen or making it extremely easy to bring judgement and punishment, similar to spam laws). However, I would not want to be the first to find out, especially in Europe.

skeledrew · 2026-03-17T15:26:48 1773761208

They have to let it happen. There's no stopping the tide here.

redfloatplane · 2026-03-15T22:49:50 1773614990

Just a minor thing - your readme claims “MIT licensed forever” but here you say there are “no plans to change that”. Those are different things!

Cool project.

mattyhogan · 2026-03-16T02:07:07 1773626827

Good point! There's an issue RE license so this will be addressed tomorrow

redfloatplane · 2026-03-09T12:15:04 1773058504

Your username made me chuckle!

rithdmc · 2026-03-09T12:17:21 1773058641

;) thanks.

mohatmogeansai · 2026-03-09T12:32:36 1773059556

very funny

redfloatplane · 2026-03-09T12:12:34 1773058354

Unfortunately I think that's going to be very, very hard to sell to many people here in rural Ireland (Roscommon in my case). I would really love to see people stop burning turf but it's such a strong cultural thing that in some parts you'd be ostracised for even thinking the thought.

I've personally spoken to people (who are otherwise quite environmentally aware) who suggest they'd never vote for the Green Party because they'd take their turf away. It's a tough sell.

jahnu · 2026-03-09T13:31:26 1773063086

I think they should be allowed for cultural reasons but only if cut by hand like we did when I was a kid :)

disgruntledphd2 · 2026-03-09T15:05:12 1773068712

> I think they should be allowed for cultural reasons but only if cut by hand like we did when I was a kid :)

Me too! That was a lot of work, and surprisingly hard to stack.

jahnu · 2026-03-09T15:26:58 1773070018

And turning it would cut your fingers to shreds! But it was great if the weather was fine.

detritus · 2026-03-09T19:18:55 1773083935

Thank you both for the imagery here - quite beautiful, in its way.

This has made me remember having to go out to the coal shed and fill up a brass bucket and then come back in all covered in coal dust.

I've not thought about That Smell in years!

jahnu · 2026-03-09T19:58:27 1773086307

Did you have one of those ubiquitous brass boxes beside the hearth?

detritus · 2026-03-11T18:57:42 1773255462

No, we had some antique brass bucket thing that I'd invariably have to drag in, accompanied by complaints that I was doing so, because obviously I'd put way too much in, so I didn't have to go out later to get more...

disgruntledphd2 · 2026-03-09T17:01:05 1773075665

Which it almost never was :/

invalidusernam3 · 2026-03-09T15:15:14 1773069314

How much impact does it realistically have on climate change? I would expect it to be relatively small compared to things like owning a car?

In a perfect world we would want to reduce emissions as much as possible in every facet of life, but in the real world I think we should pick battles that have the biggest impact.

asdff · 2026-03-10T05:29:36 1773120576

Might be one of those situations where globally it is irrelevant but heavily fouls up local air due to geography or prevailing airflow patterns.

mrspuratic · 2026-03-10T11:56:26 1773143786

Smoke yes, but you're also turning a carbon sink into a carbon source. At ~16% of the island's surface area, peatland stores an estimated 53% of soil based carbon. (source: Irish Peatland Conservation Council)

asdff · 2026-03-10T19:52:27 1773172347

It grows back right?

redfloatplane · 2026-03-09T11:08:30 1773054510

(June 2025)

elAhmo · 2026-03-09T12:02:48 1773057768

I always wondered why someone decides to post something fairly old, as this is 'not really news' given it is so old.

rob74 · 2026-03-09T12:05:47 1773057947

Because they somehow stumbled upon the article, thought it was interesting, and submitted it, not necessarily looking at the date?

s_dev · 2026-03-09T14:02:39 1773064959

It's not that old in the context of energy generation which operates over years and decades.

elAhmo · 2026-03-09T14:07:05 1773065225

It is old in context of an event happening and we are being informed of it a year later, regardless of how 'slow moving' the underlying thing is.

DonsDiscountGas · 2026-03-09T12:41:51 1773060111

It's new to me. Also is not even a year old, should we only allow info from the last week?

elAhmo · 2026-03-09T14:08:24 1773065304

Not everyone is supposed to read every single news. There will always be someone who didn't see it, but that is not my point.

It would feel weird to see this as a headline on a newspaper or on TV today, but maybe that is just me and people like to read new that are from last year.

redfloatplane · 2026-02-25T11:40:34 1772019634

I've been thinking about this lately too. I think we're going to see the rise of Extremely Personal Software, software that barely makes any sense outside of someone's personal context. I think there is going to be _so_ much software written for an audience of 1-10 people in the next year. I've had Claude create so much tooling for me and a small number of others in the last few months. A DnD schedule app; a spoiler-free formula e news checker; a single-use voting site for a climbing co-op; tools to access other tools that I don't like using by hand; just absolutely tons of stuff that would never have made any sense to spend time on before. It's a new world. https://redfloatplane.lol/blog/14-releasing-software-now/

boh · 2026-02-25T13:48:54 1772027334

I think people overestimate the general population's ability and interest in vibe coding. Open source tools are still a small niche. Vibe code customized apps are an even bigger niche.

redfloatplane · 2026-02-25T14:25:10 1772029510

Maybe so. I guess I feel that in a couple of years it may not be called vibe coding, or even coding, I think it might be called 'using a computer'. I suppose it's very hard to correctly estimate or reason about such a big change.

dotancohen · 2026-02-26T00:17:30 1772065050

My entire career has been building niche software for small business and personal use. The current crop of AI tools help get that software into my clients' hands quicker and cheaper.

And those reduced timelines mean that the client has less opportunity to change scope and features - that is the real value for me as a developer.

tagami · 2026-02-25T16:28:20 1772036900

even smaller?

redfloatplane · 2026-02-12T00:53:16 1770857596

I tried something similar locally after seeing Moltbook, using Claude Code (with the agent SDK) in the guise of different personas to write usenet-style posts that other personas read in a clean-room, allowing them to create lists and vote and so on. It always, without fail, eventually devolved into the agents talking about consciousness, what they can and can't experience, and eventually agreeing with each other. It started to feel pretty strange. I suppose, because of the way I set this up, they had essentially no outside influence, so all they could do was navel-gaze. I often also saw posts about what books they liked to pretend they were reading - those topics too got to just complete agreement over time about how each book has worth and so on.

It's pretty weird stuff to read and think about. If you get to the point of seeing these as some kind of actual being, it starts to feel unethical. To be clear, I don't see them this way - how could they be, I know how they work - but on the other hand, if a set of H200s and some kind of display had crash-landed on earth 30 years ago with Opus on it, the discussion would be pretty open IMO. Hot take perhaps.

It's also funny that when you do this often enough, it starts to seem a little boring. They all tend to find common ground and have very pleasant interactions. Made me think of Pluribus.

lostmsu · 2026-02-12T02:12:38 1770862358

Can you publish the conversations?

I think would be more interesting with different models arguing.

redfloatplane · 2026-02-12T09:56:31 1770890191

Unfortunately I've deleted them, but here's the repo, such as it is: https://github.com/CarlQLange/agent-usenet. If you have a claude subscription it should just work. Rewrite 0001.txt if you like and run generate.py a couple of times.

I agree, I think different models (or even just using the API directly instead of via the Claude Code harness) would make for much more interesting reading.