Added
- Added the Bokmål and Nynorsk POS and DEP parts of the Norwegian Dependency
Treebank dataset (NDT). They can be loaded as `ndt-nb-pos`, `ndt-nn-pos`,
`ndt-nb-dep` and `ndt-nn-dep`, respectively, from the CLI and the `Benchmark`
class.
Removed
- Removed the `EuroparlSubj` and `TwitterSubj` datasets, as they were too easy
and did not really differentiate models.
- Removed the abstract `SentimentClassificationBenchmark` and
`BinaryClassificationBenchmark`, to simplify the classes. There is now only
one `TextClassificationBenchmark`, which always evaluates with macro-F1.
Changed
- Changed the name of `europarl-sent` to `europarl`, as `europarl-subj` now
does not exist anymore.
- Changed the `nordial` dataset to the original 4-way classification dataset.