Miditok

Latest version: v3.0.5.post1

Safety actively analyzes 714860 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

Page 8 of 11

1.1.9

Changes
* d2a34041bec2748f27016f5e3b4e506ea23ae889 22c15f2f928ba417ef9ed6286b356d08deb860ab When tokenizing, files not found can now be logged
* 1f74e3a57aa28c150fb74ad7934f033091a437fe MASK tokens now available, use your_tokenizer.vocab.add_mask() to add in in your vocabulary.
* 1f74e3a57aa28c150fb74ad7934f033091a437fe fix to a possible bug when using custom vocabulary indexes, then non custom. Now checking indexes before registering tokens.
* 2f877654d0a0507ae2d4980325031e87f4ddb740 merge_tracks() now also merges sustain pedal, control change and pitch bend messages with the effects arg, and now handles List[Instrument] as well as MidiFile objects.
* 2e77eec91ea215186727881c57e618705dc72ce8 MIDI-Like token_types_errors() now looks for Note-Off for each Note-On tokens.

Compatibility
All good !

1.1.8

Changes
* 3c3c5da557aec9d497b2c8b3e68f7e5822e24f3c TimeSignature tokens are implemented for Octuple ! Thanks ilya16 for this great contribution ! These tokens are optional and can be set with the additional_tokens parameter.
* df1edd16d4c3ca2a258de3a2d03d4d99779c8d72 Added a fail-check for Bar / Pos based tokenizations, for when a token sequence begins by a Position tokens before any Bar.
* 5ab55f42fdcc76298c7f3b8ae049e85608391ef1 Bugfix when loading tokenizer params from config file with tempos.
* 08540a2526cd159ff552155d29aff72d12dfa37a SOS and EOS tokens are not adresses to -1 and -2 anymore as this could led to issues.

Compatibility
SOS and EOS tokens saved with v1.1.7 and before will not be compatible anymore.
You can however easily convert them. You just have to convert SOS (-1) and EOS(-2) tokens to respectively len(tokenizer.vocab) and len(tokenizer.vocab) + 1.

1.1.7

Changes
* 195d549e291745824fe5cc8dc7561ff23f9b9559 Tokenizer classes are renamed: the 'Encoding' suffixe is removed. Old class names still exist / work but will be removed in the future (a warning is called when using them)
* 195d549e291745824fe5cc8dc7561ff23f9b9559 **[constants](https://github.com/Natooz/MidiTok/blob/main/miditok/constants.py) import modified**, now has to be accessed _miditok.constants.A_CONSTANT_
* 3ed553207daf560bbf24607fd7aee1facd29c8e9 _PAD_ token type are now handled in token_types_errors methods

1.1.6

Changes
* 714f5f559a6d443586c1f8bb7e2536902f0b2d0e Speed up duration / time-shift computations
* 5fe18beca1a4304d3f9c0c2e5121e9e34c5de8e5 Speed up quantization of velocity and tempo values

Special thanks to ilya16 !

1.1.5

Changes
* c7169feb862d1687f8db1df1ff570e915bb52e5c rests no longer append a bar token when crossing a new bar (bugfix)
* 945717847157bb9cc46c5b7a26003e225a4b1f2c fix in token types graph for REMI / CP Word
* 8a6da142f3d20b366ad268d2351ba0aa76dcbe46 events_to_tokens and tokens_to_events no longer protected methods, to use for debugging

Compatibility:
* MIDI files tokenized with REMI and CP Word using **Rests**, with v1.1.4 and below might not be compatible as the decoding process changed (c7169feb862d1687f8db1df1ff570e915bb52e5c)

1.1.4

* 7d1c5bc3e41869068655077eed4c1fe1727569b0 af90e72a16bc89ac1ce48527494a686e192a3b4b Rest detection was inaccurate, now fixed for REMI, CP Word and MIDI-Like
* a2daaa2a61c2cfd8b008b250cb730968354c5b6c Bugfix when using MuMIDI with chords
* [Colab Notebooks !](https://github.com/Natooz/MidiTok/tree/main/colab-notebooks)

Page 8 of 11

© 2025 Safety CLI Cybersecurity Inc. All Rights Reserved.