More

retinaros · 2026-05-14T10:15:30 1778753730

that is for sure what everyone does. also they train on evals with the datasets that they would be bench against.

tedsanders · 2026-05-14T16:21:20 1778775680

What do you mean by this? We don’t train on evals, and if we did I’d quit on the spot.

(The loose version of this that’s true is that there may exist eval data contamination in pretraining. This is a hard problem to fully solve.)

retinaros · 2026-05-14T18:04:37 1778781877

its not that loose of a version. its the reality and as probably is surely a focus of a dedicated post training RL-ing these kind of githubs. of course you would train specifically on the task. you would mix this eval data with others in thousands of githubs repos.

retinaros · 2026-05-14T07:05:54 1778742354

there is no digital sovereignty in european cloud. first they would bow down to any bigger instance of power that ask for their data and being smaller companies than azure aws gcp they wouldnt have the firepower to fight back against governments. Like this one : https://www.theregister.com/off-prem/2025/11/27/canadian-dat...

second, europe has the most digitally agressive roadmap in the democratic world right now. they plan to ban vpn, enforce agressive data laws that give full power to authorities and gov to extract legally your data from your "sovereign cloud", remove anonymity from the web, enforce a cashless distopia where they can track everything and block you from using your own cash, punish you with laws against hate speech where governements decide what they define as hate speech depending on who is in power.

finally for his choices. Mistral while riding the european sovereignity wave is in fact an american owned company with european founders and the french gov trying to kill anything that they dont like touching Mistral.

OVH while a good company is definitly not providing US cloud-level data resiliency and recent events are pretty worrisome from data loss fire and hacks on customer data

Proton, maybe the only company that ever looked for sovereignity is thinking about leaving switzerland due to these opressive laws.https://www.techradar.com/vpn/vpn-privacy-security/we-would-...

also he kept the only company that is vibecoding in prod (cloudflare) and proud of it while laying off people based on the ai-religion.

It is like he made all the wrong choice if his goal was like he says to own his data and know "where is the data"

retinaros · 2026-05-12T17:14:13 1778606053

We can tell they are using AI

retinaros · 2026-05-12T17:12:47 1778605967

Its not really a 60%. It accelerates a lot code creation. Save some time on admin tasks. That is it.

retinaros · 2026-05-12T17:11:32 1778605892

Could you list us some of the capabilities you use that bring value besides “summarize my email”

morelandjs · 2026-05-12T17:21:32 1778606492

Yes, we can crawl our entire internal documentation via LLM. Want to know if someone is already working in the space of your latest idea? Ask Claude, it hits the internal search APIs and finds docs and references directly relevant to your query. There are a lot of separate document stores so this took a lot of effort previously. I can also query Slack, Outlook, etc. I don’t understand the cynicism in your comment.

retinaros · 2026-05-12T17:54:02 1778608442

That is a summarize my wiki. Nice search feature.

Leynos · 2026-05-12T19:00:36 1778612436

The trouble is, it's here now, and it wasn't before.

That may be an enterprise saas is shit problem, but I'm just happy that my employer now has a wiki search that works.

jkingsbery · 2026-05-12T17:18:56 1778606336

Not OP, but within Amazon we have pretty good connectors around integrating with our task system (so you can pretty easily ask your GenAI tool "look up the next item in our sprint board, let me know if you have any clarifying questions, but otherwise start implementing it"). We have decent integration with internal wiki and search systems, so it's easier now to figure out the best Amazon way to do some coding task. And Amazon being a big doc-writing company, there are lots of great tools for helping improve all phases of writing.

thisoneisreal · 2026-05-13T00:24:36 1778631876

I found it very useful running a TDD workflow the other day. It created a test plan, generated tests, documented them, implemented and modified existing code, and added structured logging. It also identified really good refactor candidates and explained them to me after I noted a core design issue in the code we were modifying. This wasn't autonomous: I spent some time correcting it and sending it in new directions. Still, it was a pretty nice feeling to not have to go manually configure Logback (it one shotted a nice basic config), not have to write a bunch of repetitive test setup code, etc. It even pulled in a newer JUnit feature that I didn't know about that was perfect for what I was doing. Definitely not the silver bullet a lot of people are trying to sell, but still a very powerful tool.

harimau777 · 2026-05-12T20:56:09 1778619369

A company requires a specific % of code coverage but doesn't give developers enough time to actually write tests. AI can be used to generate the tests needed to get pass the code coverage and avoid being fired for not working fast enough.

retinaros · 2026-05-12T17:10:01 1778605801

Vibecoded ppt, docs, frontends is an even bigger scam than crypto ever was. Ofc people getting sucked into it

traderj0e · 2026-05-12T20:09:53 1778616593

Are the AI tokens fungible though?

retinaros · 2026-05-11T22:23:00 1778538180

Is there a polymatket for when its gonna be layoff due to hantavirus overhiring?

retinaros · 2026-05-11T22:18:18 1778537898

Not really. Middle management is there to be in meetings all day long with nothing produced but identifying low performers.

retinaros · 2026-05-11T22:16:06 1778537766

« See how agentic AI transforms software delivery »

retinaros · 2026-05-10T07:33:15 1778398395

iphone is a vanity item. this is something unique handcrafted.

fwipsy · 2026-05-10T14:16:03 1778422563

I'm an Android user but iPhones are good value. Most people use their phone a lot so it's worth paying extra for something capable, reliable, easy to use.