Improvements
- Add Multi-GPU support using Dask-cuDF
- Add support for reading datasets from S3, GCS and HDFS
- Add 11 new operators: ColumnSimilarity, Dropna, Filter, FillMedian, HashBucket, JoinGroupBy, JoinExternal, LambdaOp, NormalizeMinMax, TargetEncoding and DifferenceLag
- Add HugeCTR integration and an example notebook showing an end to end workflow
- Signicantly faster dataloaders featuring a unified backend between TensorFlow and PyTorch