Vertex cache shenanigans

Hello.

I wrote a small test the other day which was supposed to calculate the size of the vertex cache of the GPU, but I got some very surprising results which indicate that the vertex cache isn’t working as, well, everyone expects. I’ve thrown together a small test program which does some indexed draw calls and uses ARB_pipeline_statistics_query to check the number of resulting vertex shader invocations, and then outputs its findings to a log file. I am EXTREMELY interested in knowing what kind of results people get on other hardware than my GTX 770, especially on AMD cards.

Here’s the entirety of the test source code (only requires LWJGL3): http://www.java-gaming.org/?action=pastebin&id=1475
Here’s a precompiled jar (may not run on Mac): https://drive.google.com/open?id=0B0dJlB1tP0QZbTc5ZExJeENOMWM

Please run the jar (or compile the test yourself) and post the contests of the generated log file in this thread! Although the program prints the GL_RENDERER string the driver returns, it may not show the exact GPU you have, so if possible include that information as well.

Thanks for your attention! The results of this test could heavily impact how meshes should be optimized for vertex caches!