Fixed - Fix a bug when net have inverse and run inference in eval mode.
2.1.6
Fixed - Fix missing -fopenmp in linker for CPU only Removed - remove stale comment sending in CI
2.1.5
Added - Add cuda profile tool - Add python 36 support Changed - Format all code Removed - remove a unnecessary device sync and slightly improve performance.