vendor/tomotopy/README.rst in tomoto-0.1.4 vs vendor/tomotopy/README.rst in tomoto-0.2.0

- old
+ new

@@ -30,30 +30,29 @@ * Hierarchical LDA (`tomotopy.HLDAModel`) * Multi Grain LDA (`tomotopy.MGLDAModel`) * Pachinko Allocation (`tomotopy.PAModel`) * Hierarchical PA (`tomotopy.HPAModel`) * Correlated Topic Model (`tomotopy.CTModel`) -* Dynamic Topic Model (`tomotopy.DTModel`). +* Dynamic Topic Model (`tomotopy.DTModel`) +* Pseudo-document based Topic Model (`tomotopy.PTModel`). Please visit https://bab2min.github.io/tomotopy to see more information. -The most recent version of tomotopy is 0.10.2. - Getting Started --------------- You can install tomotopy easily using pip. (https://pypi.org/project/tomotopy/) :: $ pip install --upgrade pip $ pip install tomotopy The supported OS and Python versions are: -* Linux (x86-64) with Python >= 3.5 -* macOS >= 10.13 with Python >= 3.5 -* Windows 7 or later (x86, x86-64) with Python >= 3.5 -* Other OS with Python >= 3.5: Compilation from source code required (with c++11 compatible compiler) +* Linux (x86-64) with Python >= 3.6 +* macOS >= 10.13 with Python >= 3.6 +* Windows 7 or later (x86, x86-64) with Python >= 3.6 +* Other OS with Python >= 3.6: Compilation from source code required (with c++14 compatible compiler) After installing, you can start tomotopy by just importing. :: import tomotopy as tp @@ -259,10 +258,32 @@ `tomotopy` is licensed under the terms of MIT License, meaning you can use it for any reasonable purpose and remain in complete ownership of all the documentation you produce. History ------- +* 0.12.0 (2021-04-26) + * Now `tomotopy.DMRModel` and `tomotopy.GDMRModel` support multiple values of metadata (see https://github.com/bab2min/tomotopy/blob/main/examples/dmr_multi_label.py ) + * The performance of `tomotopy.GDMRModel` was improved. + * A `copy()` method has been added for all topic models to do a deep copy. + * An issue was fixed where words that are excluded from training (by `min_cf`, `min_df`) have incorrect topic id. Now all excluded words have `-1` as topic id. + * Now all exceptions and warnings that generated by `tomotopy` follow standard Python types. + * Compiler requirements have been raised to C++14. + +* 0.11.1 (2021-03-28) + * A critical bug of asymmetric alphas was fixed. Due to this bug, version 0.11.0 has been removed from releases. + +* 0.11.0 (2021-03-26) (removed) + * A new topic model `tomotopy.PTModel` for short texts was added into the package. + * An issue was fixed where `tomotopy.HDPModel.infer` causes a segmentation fault sometimes. + * A mismatch of numpy API version was fixed. + * Now asymmetric document-topic priors are supported. + * Serializing topic models to `bytes` in memory is supported. + * An argument `normalize` was added to `get_topic_dist()`, `get_topic_word_dist()` and `get_sub_topic_dist()` for controlling normalization of results. + * Now `tomotopy.DMRModel.lambdas` and `tomotopy.DMRModel.alpha` give correct values. + * Categorical metadata supports for `tomotopy.GDMRModel` were added (see https://github.com/bab2min/tomotopy/blob/main/examples/gdmr_both_categorical_and_numerical.py ). + * Python3.5 support was dropped. + * 0.10.2 (2021-02-16) * An issue was fixed where `tomotopy.CTModel.train` fails with large K. * An issue was fixed where `tomotopy.utils.Corpus` loses their `uid` values. * 0.10.1 (2021-02-14) @@ -271,10 +292,10 @@ * An issue was fixed where `tomotopy.HLDAModel.infer` generates wrong `tomotopy.Document.path`. * Since a new parameter `freeze_topics` for `tomotopy.HLDAModel.train` was added, you can control whether to create a new topic or not when training. * 0.10.0 (2020-12-19) * The interface of `tomotopy.utils.Corpus` and of `tomotopy.LDAModel.docs` were unified. Now you can access the document in corpus with the same manner. - * __getitem__ of `tomotopy.utils.Corpus` was improved. Not only indexing by int, but also by Iterable[int], slicing are supported. Also indexing by uid is supported. + * `__getitem__` of `tomotopy.utils.Corpus` was improved. Not only indexing by int, but also by Iterable[int], slicing are supported. Also indexing by uid is supported. * New methods `tomotopy.utils.Corpus.extract_ngrams` and `tomotopy.utils.Corpus.concat_ngrams` were added. They extracts n-gram collocations using PMI and concatenates them into a single words. * A new method `tomotopy.LDAModel.add_corpus` was added, and `tomotopy.LDAModel.infer` can receive corpus as input. * A new module `tomotopy.coherence` was added. It provides the way to calculate coherence of the model. * A paramter `window_size` was added to `tomotopy.label.FoRelevance`. * An issue was fixed where NaN often occurs when training `tomotopy.HDPModel`.