screen scraping as a hacking method?

I’m engaged in a low stakes hacking war with an unknown party who is messing with one of my java web start applications. The available evidence suggests that the most likely way this hack works is by interpreting the pictures on the screen, and feeding in mouse clicks. This can’t be a simple replay attack - some pretty sophisticated image processing must be involved, and the interpreter has to be coupled with a custom back-end robot. All in all, an improbably sophisticated package for this hack, which is definitely “just for fun” not profit or malice.

My question is what toolkits might be the basis for such a system?