Highlights
- Produce builds for Python 3.12 (#236)
- Add a simple configuration API
- Add surface projections (#230)
Surface Projections
- For chiTra compatibility SudachiPy can now directly produce different tokens in the surface field.
- Original surface is accessible via
Morheme.raw_surface()
method - It is possible to customize projection dictionary-wise, via Config object, passing it on a dictionary creation, or for a single pre-tokenizer.