As I understand, when calling draw with a transform in the RenderStates structure, the transform is applied on the GPU. Is this true?
I want to benchmark the point where large number of draw calls breaks even with a large number of transforms applied on individual vertices of a VertexBuffer with a single draw call. Knowing where the transform is performed will help me understand what is going on.