Tesla FSD is quickly becoming less of a software problem and more of a problem of semantics.
If the car drives someone to and from work 30 days in a row without a problem, is it truly FSD? What about 300 days? Where do you draw the line? 1000x safer than the average human?
Same thing here will AI. How many conversations with GTP-X need to happen without a stupid response from GTP before we call it real world AI?
How about first getting to "as safe/performant as a non-drunk, non-sleep-deprived, non-brand-new driver with 0 human intervention" before asking more advanced questions?
If the car drives someone to and from work 30 days in a row without a problem, is it truly FSD? What about 300 days? Where do you draw the line? 1000x safer than the average human?
Same thing here will AI. How many conversations with GTP-X need to happen without a stupid response from GTP before we call it real world AI?