Hacker Newsnew | past | comments | ask | show | jobs | submit | cvwright's commentslogin

Unless the data is a lagging indicator

Are Mississippi and Louisiana at the top of the pay scale?

Then why are their reading scores improving so dramatically compared to wealthier states? Especially for under-privileged populations?


https://en.wikipedia.org/wiki/Mississippi_Miracle

> This embrace of phonics education and the near-complete rejection of whole language theory was a key component of the program's success.


Because the internet addictions/phones everywhere mean the average dumbass kid is reading more actual text than any kid beforehand on average. This is why the missippi miracle is happening. Well, that, and reducing the amount of actual corporal punishment administrators can doll out (paddling students in public school is legal in the shithole Deep South )

I only saw modest improvement in reading scores.

Because the Brits will never be accepted by the locals as a Thai or Vietnamese or whatever, and they know it.

Whereas in western countries we have “new Germans” or “new British” from all over the world.


It’s easy to find sketchy lines of code in any large C project.

The big advance that they are claiming with Mythos is the ability to triage all the hundreds of candidate vulns and automatically generate exploits to prove that the real ones are real. And if they’re really finding 27-yr-old 0-days in OpenBSD, then it’s not just hype.


I do not think you need a great model to do this, just great automation. There’s a reason they haven’t open sourced the actual process in which did this, stubbing out the mythos model itself.

About five minutes in in this video: https://www.youtube.com/watch?v=1sd26pWhfmg

They also say publicly in their Opus 4.6 post (https://red.anthropic.com/2026/zero-days/):

>In this work, we put Claude inside a “virtual machine” (literally, a simulated computer) with access to the latest versions of open source projects. We gave it standard utilities (e.g., the standard coreutils or Python) and vulnerability analysis tools (e.g., debuggers or fuzzers), but we didn’t provide any special instructions on how to use these tools, nor did we provide a custom harness that would have given it specialized knowledge about how to better find vulnerabilities. This means we were directly testing Claude’s “out-of-the-box” capabilities, relying solely on the fact that modern large language models are generally-capable agents that can already reason about how to best make use of the tools available.


Again, marketing materials by Anthropic. You realize this is by anthropic themselves right? And again, not reproducible by outsiders. So useless.

You've moved goalposts from "they haven't open-sourced the process" to "these are marketing materials by Anthropic".

I think you're right to be skeptical, but they _have_ talked about the process publicly.

And I don't think there's anything there that is not reproducible by outsiders? They have access to the same Opus 4.6 that you and I do; though not having to pay for the tokens certainly helps.

I'm pretty sure if you wanted to burn a couple thousand bucks, you'd reproduce at least some of these findings.


The goal post is the same, reproducible. Talking about a process isn’t reproducible. This entire discussion is why I feel developers are so gullible. You are defending a process that’s entirely opaque and you can’t even use. It’s crazy.

What's the CVE for the 27-yr-old 0-day in OpenBSD?

Depends on the impact? CVE scores are known to be a worthless metric when looking at the actual impact.

Linux now labels every single bug as a CVE.


I think they mean what is the actual vulnerability and not the score.

Right. It’s things like Baltimore (when I lived there) requiring that high speed internet had to roll out in poor areas first, before it could go into the rich neighborhoods.

But this was the early 2000s and the internet was still “new”. Only the richer areas cared and were willing to pay the price. Letting them have first (or even equal!) access would have made it easier to fund the rollout in low income areas.


I thought that was kind of how the hard sciences work already?

My grad school friend who was a physicist would write his talk just before his conferences, and then submit the paper later. My experience in CS was totally backwards from that.


Find-then-patch only works if you can fix the bugs quicker than you’re creating new ones.

Some orgs will be able to do this, some won’t.


"Find me vulnerabilities in this PR."


You’re conveniently ignoring the Olympic boxing champion from 2024 who beat the absolute shit out of the female competitors.


Are you talking about Imane Khelif? The woman who was born a woman, competed her whole life as a woman and is still last time I checked, a woman?


A woman with an SRY gene undergoing treatment to reduce testosterone levels to typical female level. https://nypost.com/2026/02/06/sports/imane-khelif-opens-up-o...


Or maybe this is like fondly remembering the busted economy car that you drove around with your friends? I have my first 386DX sitting on my desk right now and it looks exactly like the top left of that photo.

The hot car that we all lusted after was maybe something like a SGI Indy or an O2.


You’re not wrong, but you probably could have built the thing with Claude in the time it took you to write this comment.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: