We're excited to announce the first release of DataAnalyzer!
DataAnalyzer is a Python-based tool designed to streamline various data analysis tasks. It provides the ability to load data from CSV files, perform statistical calculations, detect outliers, clean data, and visualize data.
New Features
- **Load data from CSV files**
- **Calculate statistics** such as mean, median, mode, and trimmed mean for a specified column
- **Detect outliers** in a specified column using the z-score method
- **Handle missing values** by either dropping or filling them
- **Drop duplicate rows** from the DataFrame
- **Encode categorical features** in the DataFrame
- **Split the data** into training and testing sets for machine learning tasks
- **Visualize data** by plotting a histogram for a specified column
- **Export data** to a new CSV file after processing
Installation
To install DataAnalyzer, you can use pip:
python
pip install dataanalyzer
Please see the [README](https://github.com/thomasthaddeus/DataAnalyzer/README.md) for more detailed usage instructions.
Feedback
We'd love to hear your feedback! If you have any suggestions or encounter any issues, please [open an issue](https://github.com/thomasthaddeus/DataAnalyzer/issues) on my GitHub page.
Future Plans
For the next release, we're planning to add more statistical calculation methods and enhance the data visualization capabilities.
Thanks to everyone who contributed to this release!