its not that loose of a version. its the reality and as probably is surely a focus of a dedicated post training RL-ing these kind of githubs. of course you would train specifically on the task. you would mix this eval data with others in thousands of githubs repos.
there is no digital sovereignty in european cloud. first they would bow down to any bigger instance of power that ask for their data and being smaller companies than azure aws gcp they wouldnt have the firepower to fight back against governments. Like this one : https://www.theregister.com/off-prem/2025/11/27/canadian-dat...
second, europe has the most digitally agressive roadmap in the democratic world right now. they plan to ban vpn, enforce agressive data laws that give full power to authorities and gov to extract legally your data from your "sovereign cloud", remove anonymity from the web, enforce a cashless distopia where they can track everything and block you from using your own cash, punish you with laws against hate speech where governements decide what they define as hate speech depending on who is in power.
finally for his choices. Mistral while riding the european sovereignity wave is in fact an american owned company with european founders and the french gov trying to kill anything that they dont like touching Mistral.
OVH while a good company is definitly not providing US cloud-level data resiliency and recent events are pretty worrisome from data loss fire and hacks on customer data
Yes, we can crawl our entire internal documentation via LLM. Want to know if someone is already working in the space of your latest idea? Ask Claude, it hits the internal search APIs and finds docs and references directly relevant to your query. There are a lot of separate document stores so this took a lot of effort previously. I can also query Slack, Outlook, etc. I don’t understand the cynicism in your comment.
Not OP, but within Amazon we have pretty good connectors around integrating with our task system (so you can pretty easily ask your GenAI tool "look up the next item in our sprint board, let me know if you have any clarifying questions, but otherwise start implementing it"). We have decent integration with internal wiki and search systems, so it's easier now to figure out the best Amazon way to do some coding task. And Amazon being a big doc-writing company, there are lots of great tools for helping improve all phases of writing.
I found it very useful running a TDD workflow the other day. It created a test plan, generated tests, documented them, implemented and modified existing code, and added structured logging. It also identified really good refactor candidates and explained them to me after I noted a core design issue in the code we were modifying. This wasn't autonomous: I spent some time correcting it and sending it in new directions. Still, it was a pretty nice feeling to not have to go manually configure Logback (it one shotted a nice basic config), not have to write a bunch of repetitive test setup code, etc. It even pulled in a newer JUnit feature that I didn't know about that was perfect for what I was doing. Definitely not the silver bullet a lot of people are trying to sell, but still a very powerful tool.
A company requires a specific % of code coverage but doesn't give developers enough time to actually write tests. AI can be used to generate the tests needed to get pass the code coverage and avoid being fired for not working fast enough.
I'm an Android user but iPhones are good value. Most people use their phone a lot so it's worth paying extra for something capable, reliable, easy to use.
reply