I microbenchmarked System.arraycopy() on a bunch of different processors, and to my surprise Core.i5/i7 processors give a throughput 5 or 6 times higher than a Core 2 Duo. It can’t be the difference between DDR2 and DDR3 memory, so I’m wondering if the native code might use some instruction set not available on older processors. Any ideas?