Python-ucto

Latest version: v0.6.8

The latest version of python-ucto with no known security vulnerabilities is 0.6.8. We recommend installing version 0.6.8.

The information on this page was curated by experts in our Cybersecurity Intelligence Team.

Latest release
v0.6.8 at Sept. 12, 2024
License
GPL-3.0-only (GNU General Public License v3.0 only)

Description

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto).

Resources

Vulnerabilities

See all vulnerabilities

No known vulnerabilities found

Versions (23)

See all versions

Has known vulnerabilities

  • 0.6.8
  • 0.6.7
  • 0.6.6
  • 0.6.5
  • 0.6.4
  • 0.6.3
  • 0.6.2
  • 0.6.1
  • 0.6.0
  • 0.5.3
  • 0.5.2
  • 0.5.1
  • 0.5.0
  • 0.4.7
  • 0.4.5
  • 0.4.4
  • 0.4.3
  • 0.4.2
  • 0.3.0
  • 0.2.4