diff --git a/README.rst b/README.rst index e6315f9..d86ca9b 100644 --- a/README.rst +++ b/README.rst @@ -93,12 +93,6 @@ Limits Installation ============ -.. warning:: - - This package is currently in pre-release testing phase for version 0.9.0. The latest release candidate is available - on PyPI and can be only installed when explicitly stating the exact version and release candidate, e.g. with - ``pip install tmtoolkit[recommended]==0.9.0rc3``. - The package *tmtoolkit* is available on `PyPI `_ and can be installed via Python package manager *pip*. It is highly recommended to install tmtoolkit and its dependencies in a diff --git a/doc/source/data/news_articles_100.xlsx b/doc/source/data/news_articles_100.xlsx index 50ae51d..c498e20 100644 Binary files a/doc/source/data/news_articles_100.xlsx and b/doc/source/data/news_articles_100.xlsx differ diff --git a/doc/source/install.rst b/doc/source/install.rst index 37ab46e..37d879e 100644 --- a/doc/source/install.rst +++ b/doc/source/install.rst @@ -3,12 +3,6 @@ Installation ============ -.. warning:: - - This package is currently in pre-release testing phase for version 0.9.0. The latest release candidate is available - on PyPI and can be only installed when explicitly stating the exact version and release candidate, e.g. with - ``pip install tmtoolkit[recommended]==0.9.0rc3``. - The package *tmtoolkit* is available on `PyPI `_ and can be installed via Python package manager *pip*. It is highly recommended to install tmtoolkit and its dependencies in a `Python Virtual Environment ("venv") `_ and upgrade to the latest diff --git a/doc/source/intro.rst b/doc/source/intro.rst index 4a7eb09..5a4321e 100644 --- a/doc/source/intro.rst +++ b/doc/source/intro.rst @@ -1,6 +1,8 @@ tmtookit: Text mining and topic modeling toolkit ================================================ +|pypi| |pypi_downloads| |rtd| |travis| |coverage| + *tmtoolkit* is a set of tools for text mining and topic modeling with Python developed especially for the use in the social sciences. It aims for easy installation, extensive documentation and a clear programming interface while offering good performance on large datasets by the means of vectorized operations (via NumPy) and parallel computation @@ -112,3 +114,24 @@ to follow along using these notebooks, you can There are also a few other examples as plain Python scripts available in the `examples folder `_ of the GitHub repository. + + +.. |pypi| image:: https://badge.fury.io/py/tmtoolkit.svg + :target: https://badge.fury.io/py/tmtoolkit + :alt: PyPI Version + +.. |pypi_downloads| image:: https://img.shields.io/pypi/dm/tmtoolkit + :target: https://pypi.org/project/tmtoolkit/ + :alt: Downloads from PyPI + +.. |travis| image:: https://travis-ci.org/WZBSocialScienceCenter/tmtoolkit.svg?branch=master + :target: https://travis-ci.org/WZBSocialScienceCenter/tmtoolkit + :alt: Travis CI Build Status + +.. |coverage| image:: https://raw.githubusercontent.com/WZBSocialScienceCenter/tmtoolkit/master/coverage.svg?sanitize=true + :target: https://github.com/WZBSocialScienceCenter/tmtoolkit/tree/master/tests + :alt: Coverage status + +.. |rtd| image:: https://readthedocs.org/projects/tmtoolkit/badge/?version=latest + :target: https://tmtoolkit.readthedocs.io/en/latest/?badge=latest + :alt: Documentation Status diff --git a/doc/source/preprocessing.ipynb b/doc/source/preprocessing.ipynb index 1c4b085..1112127 100644 --- a/doc/source/preprocessing.ipynb +++ b/doc/source/preprocessing.ipynb @@ -3154,7 +3154,7 @@ { "data": { "text/plain": [ - "(140408456194312, 140408456194312)" + "(139932303304072, 139932303304072)" ] }, "execution_count": 79, @@ -3258,7 +3258,7 @@ { "data": { "text/plain": [ - "(140408456194480, 140408456194312)" + "(139931125296264, 139932303304072)" ] }, "execution_count": 84, @@ -3609,31 +3609,31 @@ "
\n", " \n", " \n", - " \n", - " \n", + " \n", + " \n", " \n", " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", - " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", + " \n", " \n", - " \n", - " \n", - " \n", - " \n", - " \n", + " \n", + " \n", + " \n", + " \n", + " \n", " \n", "
docpositiontokenmeta_lengthmeta_upper
▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪
docpositiontokenmeta_uppermeta_length
▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪▪
0NewsArticles-10Betsy50
1NewsArticles-11DeVos50
2NewsArticles-12Confirmed90
3NewsArticles-13as20
4NewsArticles-14Education90
5NewsArticles-15Secretary90
6NewsArticles-16,10
7NewsArticles-17With40
8NewsArticles-18Pence50
9NewsArticles-19Casting70
10NewsArticles-110Historic80
11NewsArticles-111Tie-Breaking120
12NewsArticles-112Vote40
13NewsArticles-113Michigan80
14NewsArticles-114billionaire110
0NewsArticles-10Betsy05
1NewsArticles-11DeVos05
2NewsArticles-12Confirmed09
3NewsArticles-13as02
4NewsArticles-14Education09
5NewsArticles-15Secretary09
6NewsArticles-16,01
7NewsArticles-17With04
8NewsArticles-18Pence05
9NewsArticles-19Casting07
10NewsArticles-110Historic08
11NewsArticles-111Tie-Breaking012
12NewsArticles-112Vote04
13NewsArticles-113Michigan08
14NewsArticles-114billionaire011
2,452,721NewsArticles-999589article70
2,452,722NewsArticles-999590was30
2,452,723NewsArticles-999591n't30
2,452,724NewsArticles-999592funny50
2,452,725NewsArticles-999593?10
2,452,721NewsArticles-999589article07
2,452,722NewsArticles-999590was03
2,452,723NewsArticles-999591n't03
2,452,724NewsArticles-999592funny05
2,452,725NewsArticles-999593?01
\n", "