* GrB_Matrix_reduce_BinaryOp: removed. Use a monoid instead, with
GrB_reduce or GrB_Matrix_reduce_Monoid.
* (23) bug fix: GrB_Matrix_apply_BinaryOp1st and 2nd were using the
wrong descriptors for GrB_INP0 and GrB_INP1.
* memory pool added for faster memory allocation
* calloc no longer used: using malloc+memset(0) instead
* realloc option: if not available, uses malloc+memcpy+free
* GrB MATLAB interface ported to MATLAB R2021a.
* GxB_Matrix_diag: construct a diagonal matrix from a vector
* GxB_Vector_diag: extract a diagonal from a matrix
* concat/split: added methods to concatenate and split matrices
* import/export: size of arrays now in bytes, not entries.
A new parameter, is_uniform, has been added to all import/export
methods, which indicates that the matrix values are all the same.
* (22) bug fix: SIMD vectorization was missing reduction(+,task_cnvals)
in GB_dense_subassign_06d_template.c. Caught by Jeff Huang, Texas A&M,
with his software package for race-condition detection. Good catch!
v4.0.4_GrAPL
Draft version of v4.0.4, used for results in GraAPL'21 submission.
(update: I seemed to have inadvertently left this version in Debug mode. To fix this, edit the CMakeLists.txt file, and comment out or delete line 88, "set ( CMAKE_BUILD_TYPE Debug )".