Unfortunately not, though I forgot about SIMD! It doesn’t seem to support arbitrary-sized matrices or arrays out of the box, though I guess I could index the vector type myself. Still, it doesn’t offer the operations I’d like, as far as I can tell.
Thanks though!
The closest thing I’ve been able to find so far (which seems to have been under slow development by 1-2 contributors for the past couple years) is https://github.com/MPSQUARK/BAVCL which is based on ILGPU. I’ll probably be keeping an eye on it though.