Add a Rikai-chan based parsing of Japanese texts (or something like that!).
Use Rikai-chan (Firefox Japanese plugin) to parse Japanese texts into words. Current scheme either counts kanji or sentences, any serious reader wouldn't have the time to go through putting spaces in every text....
LWT is a tool for many languages, not only Japanese. Please understand that I cannot implement algorithms, etc. for specific languages. You can use a standalone Japanese word splitter, put in your text and paste the result into LWT. Or you put into LWT the 10000 or 20000 most important Japanese terms, LWT will automatically recognize these.
1 comment
-
Kevin
commented
OK, that is understandable. I have since discovered how to use MeCab manually to parse spaces into files. If you have any reference to a list of 10K or 20K important Japanese terms formatted for LWT that would be nice :D Thanks for the response!