GPU profiling?

Like in topic - is it possible to profile GPU just like CPU somehow? I don’t need to know where exactly the bottleneck is, informations about what is causing it should be enough.

There are several options, you can find a list here (click on “links”). For simple timings you can also use ARB_timer_query.