The hardware allows for some rather complicated operations to be performed on the data during the copy, this is not implemented. Only same-format same-size raw copies are implemented for now.
Accumulate all arguments before calling the desired method. Note: Maybe we should do the same for the NonIncreasing mode?
Also moved the GPU MemoryManager class to video_core since it makes more sense for it to be there.
GPU: Added a command processor to decode the GPU pushbuffers and forward the commands to their respective engines.