Awesome work! I've built similar projects like this but nothing at this scale, or this polished. I have a bunch of questions:
How do you deal with bad sentences/mistakes in the source data? Even the best dictionaries I look at often have very odd example sentences (at least in Japanese/English dictionaries). Do you have any plans to vet for things like this?
You measure word difficulty by frequency, but do you do any heuristics for sentence difficulty?
Do you have any idea if you method works? I'm not attacking you, what your site does is very similar to what I did on my own to learn a second language, but having hard data would be great.
Again, I really love the site, keep up the good work!
- bad sentences/mistakes - users can report sentences with errors. I'm notified and those sentences are then removed from their queue. Pro users can also ignore sentences, I'm thinking I may make this a free feature in the future as well.
- Difficulty is just by word frequency at the moment, what kind of heuristics do you have in mind?
- I'd definitely like to be able to measure the effectiveness of Clozemaster somehow, but I'm not sure what kind of hard data I could come up with. Perhaps in the future I can come up with some kind of test/experiment to compare traditional single word flashcards vs. Clozemaster, or test reading comprehension somehow after playing Clozemaster for a certain period of time.
How do you deal with bad sentences/mistakes in the source data? Even the best dictionaries I look at often have very odd example sentences (at least in Japanese/English dictionaries). Do you have any plans to vet for things like this?
You measure word difficulty by frequency, but do you do any heuristics for sentence difficulty?
Do you have any idea if you method works? I'm not attacking you, what your site does is very similar to what I did on my own to learn a second language, but having hard data would be great.
Again, I really love the site, keep up the good work!