-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unable to process newer mrd/gramtab files #27
Comments
Welp, the fix is actually quite easy. The newer gramtab/mrd files are in UTF-8 instead of Win1251. Changing the encoding resulted in successful generation of a newer morph.info. |
Could you explain which files need to be updated to get a newer version of the dictionaries? |
Or just use this one: https://github.com/Evengard/LuceneNetRussianMorphologyNetCore/blob/master/LuceneNetRussianMorphology/Resources/ru_morph.info - which I already generated myself 5 months ago or so. |
I tried to generate a new morph.info file from newer mrd/gramtab files from https://github.com/sokirko74/morph_dict/tree/master/data/Russian. Unfortunately, it failed - the resulting morph.info was way too small and didn't contain anything useful.
Could you please fix the parsing of theese files to allow using a more fresh and complete dictionary?
The text was updated successfully, but these errors were encountered: