First release 🎉
First release of TextMachina that includes:
- **Dataset generators**: for detection, attribution, and boundary detection tasks.
- Five **model providers**: Anthropic, Cohere, HuggingFace (local and remote), OpenAI, and Vertex AI.
- Six **extractors** to fill prompt templates: Auxiliary, Entities, Nouns, Sentence prefix, Word prefix, and Combined.
- One **decoding constrainer**: Length constrainer.
- Five **metrics** to assess task difficulty and dataset quality: MAUVE, Perplexity, Repetition, Diversity, and baseline models.
- **Post-processing functions** to improve the quality of the datasets and prevent common biases.
- **CLI interface** to generate and explore datasets.
- **Configuration examples**, under the folder `etc/examples`, to test different tasks and model providers.