Matrix multiply-add, like DGEMM
BLAS has something like DGEMM which does X = aAB + b*X in one loop. This can be very fast, so allowing direct use of this would be handy.
3
votes
Gen Zhang
shared this idea