I'm trying to parse some Japanese text, and I can't seem to figure out the output encoding.
This is the output I'm getting:
これは ̾��,����,*,*,*,*,*
本 ̾��,����,*,*,*,*,*
です ̾��,����,*,*,*,*,*
。 ̾��,������³,*,*,*,*,*
EOS
Steps I took:
git clone https://github.com/taku910/mecabcd mecab/mecab./configure --enable-utf8-only --with-charset=utf8makesudo make installmecab -o ~/Desktop/output.txt ~/Desktop/input.txt, whereinput.txtcontains "これは本です。"
Using OSX 10.15.3