Typical question by students in Graduate Numerical Linear Algebra class: I understand the left-looking and right-looking variant of this operation. But how does one find different variants for a new operation? Different algorithms are required when targeting different layers of the memory hierarchy.
But how does one find different variants for a new operation?