Indoxgen

Latest version: v0.1.0

Safety actively analyzes 681775 Python packages for vulnerabilities to keep your Python projects secure.

Scan your dependencies

0.0.9

🔄 Hybrid Synthesis: LLM + GAN

The headline feature of this release is hybrid synthesis, enabling you to generate both textual and tabular data in a unified, streamlined process:

- **LLM for Text Generation**: Use advanced language models to generate contextually rich and realistic text.
- **GAN for Tabular Data**: Produce high-quality numerical and categorical tabular data with GANs.
- **Seamless Integration**: Effortlessly combine both approaches into a single synthesis pipeline for consistent dataset generation.

🛠️ New `TextTabularSynth` Class

We’re introducing the `TextTabularSynth` class to manage this hybrid synthesis process. Key features include:

- Integration of both LLM and GAN setups.
- A unified interface to streamline synthetic data generation.
- Ensures consistency between textual and tabular data.

📊 Enhanced Customization

- **Diversity Control**: Adjust text diversity with the `diversity_threshold` parameter.
- **Flexible LLM Integration**: Select your preferred LLM models for both generation and judging.
- **GAN Configuration**: Detailed control over GAN architecture, training parameters, and more.

🤝 Feedback and Support
We value your input! If you encounter any issues or have suggestions for future enhancements, please feel free to open an issue on our [GitHub repository](https://github.com/IndoxGen/IndoxGen) or contact our support team.

Thank you for your continued support and trust in IndoxGen. We’re excited to see how the new hybrid synthesis capability will enhance your synthetic data generation projects!

Happy synthesizing!

The IndoxGen Team

0.0.3

Documentation

Check out our [README](https://github.com/osllmai/indoxGen#quick-start-guide) for detailed examples and usage instructions:
- Standard `SyntheticDataGenerator`
- Advanced `SyntheticDataGeneratorHF` with human feedback integration
- Prompt-based generation using `DataFromPrompt`

Join the Community

We invite you to join our [Discord community](https://discord.com/invite/ossllmai) to share your experiences, get support, and be a part of our growing community of synthetic data enthusiasts.

Share Your Feedback

Your feedback continues to be crucial in helping us improve IndoxGen. Please report any issues or feature requests via our [GitHub Issues](https://github.com/osllmai/indoxGen/issues) page.

Thank you for your continued support as we strive to optimize and expand IndoxGen's capabilities!

0.0.2

Getting Started

Check out our [README](https://github.com/osllmai/indoxGen#quick-start-guide) for quick start examples using:
- Basic `SyntheticDataGenerator`
- Advanced `SyntheticDataGeneratorHF` with human feedback
- Prompt-based generation using `DataFromPrompt`

Community

Join our [Discord community](https://discord.com/invite/ossllmai) for support and discussions.

Feedback

As this is our initial release, your feedback is crucial! Please report any issues or feature requests on our [GitHub Issues](https://github.com/osllmai/indoxGen/issues) page.

Thank you for your interest in IndoxGen. We're excited to embark on this journey of empowering data-driven innovation with advanced synthetic data generation!

Links

Releases

Has known vulnerabilities

© 2024 Safety CLI Cybersecurity Inc. All Rights Reserved.