GPU implementation of block transforms