Previous release was not as well-tested as it should have been. Sorry 😢
This version fixes a problem with the encoding while processing a file using cucco CLI.
Cucco is back with a minor release.
This version fixes issue #42, adding the possibility to normalize a single file using cucco CLI. The library itself has suffered an small change too. Normalization function 'remove_extra_whitespaces' has been renamed to 'remove_extra_white_spaces'. This means that any previous code or config file using this function will break if using the last version. Sorry for that 😞
Happy normalization 🐔
This minor release is a needed step to be able to use the library in an API.
remove_stop_words function allows to specify the language to use. Also, the little cucco in not as lazy as before and will always load the stop words file for the language indicated in the Configuration class. If lazy_load is not used, all of them will be loaded.
Yay! Cucco has reached version 2 and it comes with some nice goodies.
But before some words from our sponsors. I'll give the floor to Mike, CEO of Feathers & CO:
Shut your beak and tell me what's new
Thank you all for this invitation.
To open this event I would like to talk about...
Ok ok, so here is the list of new features for cucco:
Not yet, but I'm working on this. All the docs for cucco will be available at cucco.io cucco-soon. In the meantime...
Run Mike! Run for your life!
This version improve the stop words removal functionality adding support for 50 languages and a simplified format for the stop words files (one word per line without comments).
Almost two years after the first release of the Python text normalizer, version 1.0.0 is released.
What is new?
Special thanks to @feinsteinben who helped to extend the library and, more important, helped me to get some motivation to keep improving it.