Fast Math

Nate · September 1, 2009, 7:16pm

Here is my class for some fast math. Some of it was discussed here.

import java.lang.reflect.Method;

/**
 * Utility and fast math functions.
 * 
 * Thanks to:

 * Riven on JavaGaming.org for sin/cos/atan2 tables.

 * Roquen on JavaGaming.org for random numbers.

 * pjt33 on JavaGaming.org for fixed point.

 * Jim Shima for atan2_fast.

 */
public class MathUtil {
	static public final float PI = 3.1415926535897932384626433832795028841971f;

	static private final int SIN_BITS = 12; // Adjust for accuracy.
	static private final int SIN_MASK = ~(-1 << SIN_BITS);
	static private final int SIN_COUNT = SIN_MASK + 1;

	static private final float radFull = PI * 2;
	static private final float degFull = 360;
	static private final float radToIndex = SIN_COUNT / radFull;
	static private final float degToIndex = SIN_COUNT / degFull;

	static public final float[] sin = new float[SIN_COUNT];
	static public final float[] cos = new float[SIN_COUNT];
	static {
		for (int i = 0; i < SIN_COUNT; i++) {
			float a = (i + 0.5f) / SIN_COUNT * radFull;
			sin[i] = (float)Math.sin(a);
			cos[i] = (float)Math.cos(a);
		}
	}

	static public final float sin (float rad) {
		return sin[(int)(rad * radToIndex) & SIN_MASK];
	}

	static public final float cos (float rad) {
		return cos[(int)(rad * radToIndex) & SIN_MASK];
	}

	static public final float sin (int deg) {
		return sin[(int)(deg * degToIndex) & SIN_MASK];
	}

	static public final float cos (int deg) {
		return cos[(int)(deg * degToIndex) & SIN_MASK];
	}

	private static final int ATAN2_BITS = 7; // Adjust for accuracy.
	private static final int ATAN2_BITS2 = ATAN2_BITS << 1;
	private static final int ATAN2_MASK = ~(-1 << ATAN2_BITS2);
	private static final int ATAN2_COUNT = ATAN2_MASK + 1;
	private static final int ATAN2_DIM = (int)Math.sqrt(ATAN2_COUNT);
	private static final float INV_ATAN2_DIM_MINUS_1 = 1.0f / (ATAN2_DIM - 1);
	private static final float[] atan2 = new float[ATAN2_COUNT];
	static {
		for (int i = 0; i < ATAN2_DIM; i++) {
			for (int j = 0; j < ATAN2_DIM; j++) {
				float x0 = (float)i / ATAN2_DIM;
				float y0 = (float)j / ATAN2_DIM;
				atan2[j * ATAN2_DIM + i] = (float)Math.atan2(y0, x0);
			}
		}
	}

	public static final float atan2 (float y, float x) {
		float add, mul;
		if (x < 0) {
			if (y < 0) {
				y = -y;
				mul = 1;
			} else
				mul = -1;
			x = -x;
			add = -3.141592653f;
		} else {
			if (y < 0) {
				y = -y;
				mul = -1;
			} else
				mul = 1;
			add = 0;
		}
		float invDiv = 1 / (((x < y) ? y : x) * INV_ATAN2_DIM_MINUS_1);
		int xi = (int)(x * invDiv);
		int yi = (int)(y * invDiv);
		return (atan2[yi * ATAN2_DIM + xi] + add) * mul;
	}

	/**
	 * This is a very fast atan2, but not very accurate.
	 */
	static public float atan2_fast (float y, float x) {
		// From: http://dspguru.com/comp.dsp/tricks/alg/fxdatan2.htm
		float abs_y = y < 0 ? -y : y;
		float angle;
		if (x >= 0)
			angle = 0.7853981633974483f - 0.7853981633974483f * (x - abs_y) / (x + abs_y);
		else
			angle = 2.356194490192345f - 0.7853981633974483f * (x + abs_y) / (abs_y - x);
		return y < 0 ? -angle : angle;
	}

	static private Method sqrtMethod;
	static {
		try {
			sqrtMethod = Class.forName("android.util.FloatMath").getMethod("sqrt", new Class[] {float.class});
		} catch (Exception ex) {
			try {
				sqrtMethod = Class.forName("java.lang.Math").getMethod("sqrt", new Class[] {double.class});
			} catch (Exception ignored) {
			}
		}
	}

	static public final float sqrt (float value) {
		try {
			return ((Number)sqrtMethod.invoke(null, value)).floatValue();
		} catch (Exception ex) {
			throw new RuntimeException(ex);
		}
	}

	/**
	 * Fixed point multiply.
	 */
	static public int multiply (int x, int y) {
		return (int)((long)x * (long)y >> 16);
	}

	/**
	 * Fixed point divide.
	 */
	static public int divide (int x, int y) {
		return (int)((((long)x) << 16) / y);
	}

	static private int randomSeed = (int)System.currentTimeMillis();

	/**
	 * Returns a random number between 0 (inclusive) and the specified value (inclusive).
	 * @param range Must be >= 0.
	 */
	static public final int random (int range) {
		int seed = randomSeed * 1103515245 + 12345;
		randomSeed = seed;
		return ((seed >>> 15) * (range + 1)) >>> 17;
	}

	static public final int random (int start, int end) {
		int seed = randomSeed * 1103515245 + 12345;
		randomSeed = seed;
		return (((seed >>> 15) * ((end - start) + 1)) >>> 17) + start;
	}

	static public final boolean randomBoolean () {
		int seed = randomSeed * 1103515245 + 12345;
		randomSeed = seed;
		return seed > 0;
	}

	static public final float random () {
		int seed = randomSeed * 1103515245 + 12345;
		randomSeed = seed;
		return (seed >>> 8) * (1f / (1 << 24));
	}

	static public int nextPowerOfTwo (int value) {
		return 1 << (32 - Integer.numberOfLeadingZeros(value - 1));
	}
}

Couple notes:

For sqrt use FloatMath.sqrt, it is fast. The sqrt method here will call FloatMath.sqrt on Android and Math.sqrt on the desktop.
Don’t bother with fixed point, for the most part float is faster. Usually you only need fixed point for Open GL ES, and in that case the multiple and divide methods can be useful. You can add and subtract fixed point numbers the normal way.

Abuse · October 23, 2009, 12:05am

Worth noting none of those random number methods are thread safe.
Not a bad thing so long as it’s documented - though opting for a static utility class compounds the problem, as the application programmer cannot avoid the limitation by using a different instance in each thread.

Nate · October 23, 2009, 2:27am

Good point. When it does occur, it just means both threads get the same “random” value. I prefer putting up with this flaw to keep the methods static. You’re right, it should be documented.

pjt33 · October 23, 2009, 7:03am

If you’re really worried, xor the result with Thread.currentThread().hashCode()

Roquen · October 23, 2009, 8:58am

WRT: The RNG and multiple threads.

The worst case scenario is a variant of the following: Thread 1 reads the seed and is interrupted by Thread 2. Thread 2 reads the seed and is interrupted by Thread 3. Thread 1 comes back to life and generates some random numbers. Thread 2 wakes up and effectively moves the position in sequence to one past where we were at the beginning. Something to worry about? Not really. The window of opportunity is very narrow (about 5 opcodes) and must happen a couple of times in concert. The work being performed on Thread 1 must be something where this backward jump in the sequence would be possibly noticable. I’m more worry about a duck with a piece of tinsel in its bill landing on my head under a clear blue sky and then getting struck by lightning. Possible? Sure, but I’m not losing any sleep over it.

I’d add the core 32-bit generator as well for ranges of 0 to (2^n)-1. Trade two shifts, a multiply and add of the ranged version for a single shift.

WRT: Sin/Cos. You could collapse the two tables into 1 at the cost of making one of functions slightly more expense (a constant add).

Riven · October 23, 2009, 9:27am

So you’re suggesting to make my FastMath code a tad slower?

Roquen · October 23, 2009, 9:41am

I’ve gotten the impression that many people are running to memory limitiation issues with their apps. So sure it’s only a few K of savings, but if your at the wall…you gotta trim. Choosing to make one of the them slower is an option. It’s nice to consider before hand so that you can favor one over the other when the choice doesn’t matter.

Riven · October 23, 2009, 9:50am

Adjust:
static private final int SIN_BITS = 12; // Adjust for accuracy.
if you really want to reduce memory usage.

Roquen · October 23, 2009, 11:53am

Sure. You can lower the value by one and cut your memory usage in half. Your suggestion doubles the max error, mine increases the runtime cost of one of the functions. Of these two choices, the “best” option is game specific.

It might be worth checking if something like the following is faster on Android when you do need both:


static void sincos(FloatPair r, Float rad)
{
  int id = (int)(rad * radToIndex);

  r.a = sin[id & SIN_MASK];
  r.b = sin[(id + COS_OFF) & SIN_MASK];
}

CommanderKeith · November 3, 2009, 1:35pm

I was browsing through this thread and the other one (http://www.java-gaming.org/index.php/topic,20972.0.html) to find out how expensive Math.sqrt is compared to a normal plus/multiply/divide operation and couldn’t find a benchmark but there’s a good article here for anyone interested:

http://metafessional.com/blog/software-development/java-math-performance.html

clientVM	+	-	*	/	sqrt	sin	cos
integer	16	16	15	26
long	30	30	30	120
float	15	15	16	30
double	29	26	27	52	242	4205	4205
fixed	13	12	40	191	427	60	59

serverVM	+	-	*	/	sqrt	sin	cos
integer	21	115	14	21
long	24	25	25	106
float	12	12	12	29
double	24	24	25	54	87	3317	3260
fixed	14	13	32	189	314	57	64

Riven · November 3, 2009, 2:02pm

_{Pssst, your values are under the wrong columns}

Roquen · November 3, 2009, 2:45pm

This table looks very questionable. Fixed point +/- faster than integer? When it’s the same op? And the integer subtract on the serverVM? The sampling method must have been flawed.

CommanderKeith · November 3, 2009, 3:42pm

Hmm, yeah those numbers do look a bit suspect.

Sorry for quoting a dodgy reference I should have thought twice when I noticed that he didn’t post his benchmark code.

Mr_Gol · November 3, 2009, 4:23pm

static public final float PI = 3.1415926535897932384626433832795028841971f;

A float can’t hold that precision.