Chatterbot-corpus

Latest version: v1.2.2

Safety actively analyzes 723954 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 2 of 2

1.1.0

- Additional dialog contributions from contributors
- Removed math and swear words data files.

1.0.1

**Bug fixes**

- Fix where category is stored https://github.com/gunthercox/chatterbot-corpus/pull/39

**Testing**

- Add line length test to ensure all dialog is under 400 characters https://github.com/gunthercox/chatterbot-corpus/pull/36

1.0.0

**Breaking changes**

- Change format of corpus files from json to yml https://github.com/gunthercox/chatterbot-corpus/pull/25

**Features**

- Allow corpus name to be accessed as a category https://github.com/gunthercox/chatterbot-corpus/pull/26
- Add support for multiple category tags for a corpus https://github.com/gunthercox/chatterbot-corpus/pull/28

**Testing**

- Add test cases to verify that file paths to custom corpora can be specified https://github.com/gunthercox/chatterbot-corpus/pull/27

0.0.2

- Various corpus contributions

0.0.1

First Release of the ChatterBot Corpus

This is the first release of this dialog corpus module which was originally a part of the [ChatterBot](https://github.com/gunthercox/ChatterBot) package.

Splitting this data out into its own module is intended to improve the rate at which new training data can be released to ChatterBot users (because `chatterbot-corpus` can now be updated independently of the `chatterbot` module).

In addition, this change helps to make the corpus dialog data more widely available to other projects. As this project moves forward I am hoping to make this collection of data as useful as possible to the community of Python developers, and possibly even other programming languages as well.

Additional changes

- Addition of a Bangla corpus from orinify https://github.com/gunthercox/chatterbot-corpus/pull/10

Page 2 of 2

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.