fix segfault on large e.g. 600x600x600 simulations by fixing MatGemini to write proper scalar datasets, and h5fortran to verify scalar is read into scalar as array into scalar segfaults.
Also switched allocatable actual args to have intent(inout) dummy args, as forced reallocation results otherwise, which is unintended but Fortran standard 2003 behavior.
* Added our own FindMPI as the vendor FindMPI is frequently a source of trouble as like vendor FindHDF5, it covers a large range of rare/old systems, and fails to work on our more up to date systems
* ci: added JUnit dash demo
* Find*.cmake: works better with CentOS packages
* numerous magcalc improvements, and add magcalc.run frontend
* remove Apple Silicon M1 workaround in anticipation of HWLOC 2.6, which we contributed to handling this and future Apple Silicon properly.