- โ
**Data import** from CSV, Excel, JSON, SQL, MongoDB, API
- ๐งน **Data cleaning & transformation** (mean, median, interpolation, log, Box-Cox)
- ๐ **Missing data imputation** using:
- statistical methods
- KNN and regression
- neural networks (autoencoders)
- ๐ **Exploratory and statistical analysis**
- basic statistics
- correlation matrix
- ANOVA
- time series decomposition
- ๐ค **Machine Learning:**
- Classification & regression (decision tree, SVM, KNN, random forest)
- Clustering (KMeans, DBSCAN)
- Anomaly detection (Isolation Forest, LOF)
- ๐ **AutoML support** via:
- AutoSklearn
- TPOT
- Optuna (hyperparameter tuning)
- ๐จ **Visualizations** including heatmaps, PCA, t-SNE, interactive plots
- ๐ **Report generation** in HTML via Jinja2 templates
- ๐ง **Model explanation** using SHAP & LIME (coming in next release)
---
๐ฆ Installation
bash
pip install AquaAgent
---
๐ CLI Example
bash
run-aquaagent odra_dataset.csv --target chlorophyll --mode interactive --model autosklearn --impute knn
---
๐งฌ Use Cases
- Long-term Odra River water quality modeling
- Climate and environment-aware AI modeling
- Teaching and rapid prototyping for environmental data scientists
---
๐ Acknowledgments
Special thanks to the open-source community and to scientific contributors!