Skip to content

Releases: roshan-research/hazm

Hazm 0.10.0

16 Jan 16:49
270e897
Compare
Choose a tag to compare

Download pretrained-models

Full Changelog: v0.9.4...v0.10.0

Hazm 0.9.4

01 Oct 20:12
Compare
Choose a tag to compare

Download pretrained-models

Full Changelog: v0.9.3...v0.9.4

Hazm 0.9.3

19 Jul 15:25
Compare
Choose a tag to compare

Fixed

  • Fix critical bug in Lemmatizer that caused incorrect lemmatization of certain words. @sir-kokabi.
  • Fix bug caused WikipediaReader to not work as before #287. @sir-kokabi.
  • Fix missing imports for WikipediaReaderand PersianPlainTextReader #286. @sir-kokabi.
  • Fix some issues in the demo to make it compatible with the latest version of Hazm. @sir-kokabi.
  • Fix a few issues related to tests and mkdocs build. @sir-kokabi.
  • Improve documentation. @sir-kokabi.
  • improve dependency tree visualization on the demo page. @sir-kokabi.

Download pretrained-models

Full Changelog: v0.9.2...v0.9.3

Hazm 0.9.2

08 Jul 12:48
Compare
Choose a tag to compare

Added

Fixed

Download pretrained-models

Full Changelog: v0.9...v0.9.2

Hazm 0.9

20 May 15:48
548c4b1
Compare
Choose a tag to compare

Added

  • Windows compaitiblity by using Python-crfsuite instead of Wapiti. @E-Ghafour.
  • Pretrained Chunker and POSTagger models with Python-crfsuite. @E-Ghafour.
  • new parameters in Normalizer to better text processing. @sir-kokabi.
  • Three regex patterns in Normalizer to fix ZWNJs and spacing issues. @sir-kokabi.
  • 400 Non-standard unicode characters to be replaced in Normalizer. @sir-kokabi.
  • 40,000+ new words to improve Lemmatizer and Tokenizer. @sir-kokabi.
  • train function for Word2vec and Sent2vec modules in Embedding. @E-Ghafour.
  • Implement keywordExtraction with the embedRank approach as a sample of Hazm usage. @E-Ghafour.
  • Support Universal tags in POSTagger. @E-Ghafour.
  • Support universal POS mapper in PeykareReader & DadeganReader (#239). @phsfr.
  • PersianPlainTextReader to process raw text datasets (#120). @mhbashari.
  • Support EZ tag in PeykareReader. @E-Ghafour.
  • Slash & back-slash (/ ) support in Tokenizer (#102). @elahimanesh.
  • Conjugation class to handle verb conjugation. @sir-kokabi.

Fixed

Changed

  • Drop Python 2 support and migrate all code to Python 3. @sir-kokabi.
  • Use data_maker function instead of patterns in SequenceTagger. @E-Ghafour.
  • Refactor IOBTagger and POSTagger to be compatible with data_maker. @E_Ghafour.
  • Change می روم to می‌روم in example (#203). @SMSadegh19.
  • Overhaul the project structure and GitHub repo. @sir-kokabi.

Download Pretrained models

Full Changelog: v0.8.2...v0.9

Hazm 0.8

29 Nov 12:03
Compare
Choose a tag to compare

Release notes:

  • Add WordEmbedding (Download the pre-trained model(Fasttext) from here)
  • Add SentenceEmbedding (Download the pre-trained model from here)
  • Add Documentation webpage (link)
  • Improve normalizer, informal normalizer, and tokenizer
  • Add Degarbayan and MirasText corpus reader

What's Changed

New Contributors

Full Changelog: v0.7...v0.8.2

Hazm 0.7

12 Oct 12:03
Compare
Choose a tag to compare
v0.7

update release version

Hazm 0.5

20 Mar 11:01
Compare
Choose a tag to compare
v0.5

fix travis script

Hazm 0.4

16 Dec 04:30
Compare
Choose a tag to compare
v0.4

update setup file

Hazm 0.3

29 Aug 02:35
Compare
Choose a tag to compare
v0.3

fix tests for new release