flipcode - Theory & Practice

Theory & Practice - Issue 01 - Collision Detection
by (17 May 2000)

Introduction

Ah, collision detection. It's an important part of many games. It's there in Pacman, in Pong, and it's in Quake 3. Collision detection can be used for physics, for restricting movement, for determining whether a weapon or bad guy affects a player, etc. Collision detection methods vary depending on what's required, but there are some general principles that are useful for many types of games. Today, I am going to talk about some of these.

In General

Basically collision detection is a system of determining whether two objects collide. If there are more than two objects, then you can consider pairs of objects individually. What constitutes an "object" varies with implementations.

Most collision detection methods usually deal with temporary positions reflecting where the object(s) are about to be. If a collision occurs with the temporary positions, the movement of the objects could be reconsidered. Perhaps they shouldn't move at all. Perhaps they should move just enough to touch each other. Usually what happens is:

1) Future Position Calculation
2) Collision Detection
3) Collision Handling

Do They Intersect?

In collision detection you usually want to know whether two objects intersect. When we say "collide" we usually assume movement in time. If we have a still picture, then we can say "intersect." Let's consider the simplest case: we have two line segments in 1 dimension, and we need to know whether or not they overlap (Figure 1).

These two segments intersect if min1<max2 and min2<max1. In other words, if the minimums are less than the maximums, the segments intersect. It's useful to remember this as a lemma, because it can be handy in many situations.

From this, you can check in a simple way whether two bounding boxes intersect (Figure 2). If the tops are higher than the bottoms and the lefts are "lefter" than the rights, then the boxes intersect. r 2 dimensions, you need 4 checks. For 3 dimensions you need 6 checks. For n dimensions you need 2n checks.

A short aside: The above checks seem pretty simple, but I've often seen people who aren't sure how to check for bounding box intersection. There are some methods which check whether at least one vertex of one box is inside the other, but these methods are more complex and aren't fullproof. For example, this method would fail in the following case:

Alright, let's move on to bounding circles. The check is a straightforward one: if the distance between the centers of the circles is less than the sum of the radii, then they intersect. Otherwise, they don't. To optimize this check, you can square both sides so you don't have to take any square roots to obtain the distance. (Figure 4)

And then, of course, you have bounding spheres that work on the same principle. If the (3D) distance between the two centers is less than the sum of the radii then the spheres intersect.

Collision in Time

Sometimes you'll want to work with more than simply hypothetical future positions in order to determine whether a collision occurred. This happens when your objects are zipping along, several units of length at a time, and might pass through something briefly, but not enough to wind up in it. Consider the following picture:

Suppose if your object was as small as the circle in the picture, and it moved 10 units each frame. Eventually you might realize that the object is now on the other side of the wall. What you want is to predict, while the object is in position 1, whether the object will go through the wall if it's moved 10 units, and if so, where exactly the object will reach the wall.

Instead of obtaining a solution to this particular problem (with the circle and the wall), I will present some general methods for solving these kinds of problems. This is where mathematical thinking begins to really help.

Think of the prediction problem as a problem in time. Two objects are moving relative to one another. Their positions are functions of time. You want to know exactly when something happens -- for example if they collide. If you find the time that happens, you can find their positions and anything else that depends on time.

What you want is to define some sort of relationship between the two objects that changes as a function of time. Here's what this kind of prediction problem would involve. We have:

P₁(t), P₂(t)	Positions P₁ and P₂ of the two objects as functions of time. (This can be in 1, 2, 3, etc. dimensions)
R(t)	The relationship between positions P₁ and P₂. Since both P₁ and P₂ are functions of time, R(P₁(t), P₂(t)) can just depend on time.
n	Some value that you want the relationship to be.

If you're a bit confused by the generality of these functions, it's okay. Basically it's like this:

The position functions can be anything you want. If an object is moving in a straight line, then you have a linear equation. If you have acceleration, such as gravity, then you might have a quadratic or cubic equation, etc.

Now you want to express the relationship you want in terms of the positions of the objects, and the value you want it to be. If you want two circles to collide, then R(P₁, P₂) would be the distance between P₁ and P₂, squared, and n would be the sum of the radii, squared. Now that you have expressed the collision conditions in terms of the object positions, and the object positions in terms of t, then you can express the entire collision scheme (the conditions) in terms of t. What you get is an equation that corresponds to: R(t) = n

Once you have that, you can usually solve for t, because R(t) is usually a polynomial. Thus you get the time at which the collision occurs. It doesn't even have to be collision, it can be some other event. Up until now you do everything on paper. What you finally put into your program is one, optimized line: the equation for t. It looks something like: t = . . .

Still confused? Let's take a look at some examples.

Example 1

We have two circles, with centers (x₁,y₁), (x₂,y₂) and radii r1, r2. The radii remain constant, but the first circle moves (2, 7) in one second, and the other moves (-1, 2) in one second. The question is, will they collide in that second?

Solution

From this we have:
P₁(t) = (x₁ + 2t, y₁ + 7t)
P₂(t) = (x₂ - t, y₂ + 2t)
Further,
R(t) = [(x₁+ 2t) - (x₂- 2t)]² + [(y₁ + 2t) - (y₂ + t)]² = (x₁ - x₂+ 3t)² + (y₁- y₂+ 5t)² =
(x₁ - x₂)² + (y₁- y₂)² + 3t(2x₁ - 2x₂+ 3t) + 5t(2y₁- 2y₂+ 5t) =
[3²+5²]t² + 2[3(x₁ - x₂) + 5(y₁ - y₂)]t + [(x₁ - x₂)² + (y₁- y₂)²]

This is a quadratic equation in t. We don't even have to solve it for t, because the original question was whether the two circles will collide. So we find the minimum of the function R(t). To do that, we take the derivative of R(t). We get:

R'(t) = 2[3²+5²]t + 2[3(x₁ - x₂) + 5(y₁ - y₂)]

And we solve for t₁ when R'(t₁) = 0 :

t₁ = -[3(x₁ - x₂) + 5(y₁ - y₂)] / [3²+5²]

It is at this point in time that the distance is least between the two centers of the circles. You can visualize this: if the circles start out moving towards each other, for example, over time the distance will decrease and then increase steadily (Figure 7).

We're almost done. First, we see if the distance between the circle centers (squared) at the time t gotten above is less than the sum of the radii (squared). In other words, we check: R(t₁) < (r₁ + r₂)² ?

If the distance between the centers smaller than the sum of the radii at that t, this means the circles collide at that t. Finally, we have to check whether 0 < t < 1. If the question asked, "will the circles ever collide if they continue moving this way forever?" we would be done. But the question asked whether the circles would "collide in that second." Checking whether t is between 0 and 1 is easy. So, if we've gotten this far, then the circles will collide.

Actually, if we analyze the method we used to get the answer, we can get a general solution to the problem of two moving circles. If circle 1 moves (a₁, b₁) units per second, and circle 2 moves (a₂, a₂) units per second, then the minimum distance time will be:

(1) t₁ = -[(a₁ - a₂)(x₁ - x₂) + (b₁ - b₂)(y₁ - y₂)] / [(a₁ - a₂)² + (b₁ - b₂)²]

In our example, a₁ - a₂ = 3 and b₁ - b₂ = 5.
In fact, we see that we don't need to know the movements of the circles in absolute coordinates. We only need to know the movement of the balls relative to each other. If we take the center of circle 1 to be the origin, then the equation for t₁ can be something like:

(2) t₁ = -[ax + by] / [a² + b²]

Compare this to equation (1).

Example 2

We have a circle and a line segment going through it. The circle is centered at (a, b) with radius r. The line segment goes from (x₁, y₁) to (x₂, y₂). Does the line segment intersect the circle?

Solution

Again, we can consider this a time problem. Let's say the line segment is really a point traveling through time. We have to get the time where the point is closest to the center of the circle, and see whether at that time the distance from the point to the circle (squared) is less than the radius of the circle (squared).

P₁(t) = (a, b)
P₂(t) = ( x₁ + [x₂- x₁]t, y₁ + [y₂- y₁]t ) -- where t goes from 0 to 1

In fact, let's use the work we've already done in Example 1. Let's consider the point to be a circle with radius 0. In that case we make use of equation (2).

t₁ = -[(0 - [x₂- x₁])(a - x₁) + (0 - [y₂- y₁])(b - y₂)] / [(0 - [x₂- x₁])² + (0 - [y₂- y₁])²] =
[(x₂- x₁)(a - x₁) + (y₂- y₁)(b - y₂)] / [(x₂- x₁)² + (y₂- y₁])²]

That's it. Solve for t₁, and see if it's between 0 and 1. If it is, plug it into R(t) [which is the distance between P₁(t) and P₂(t), squared] and see if it's less than the radius squared.

By the way, if t₁ isn't between 0 and 1, that means that an infinite line would intersect that circle, but not the line segment, which runs from (x₁, y₁) to (x₂, y₂)

Also, if the distances were relative to the center of the circle, then everything would be shorter to write. Say the circle is centered at (0, 0), has radius r, and the line segment runs from (x₁, y₁) to (x₂, y₂).

t₁ = -[(x₂- x₁)x₁ + (y₂- y₁)y₁] / [(x₂- x₁)² + (y₂- y₁)²]

Note: in this particular case there are easier ways of solving the problem. To find the minimum distance of a line AB to a point C, one could take a random point D on the line, and project CD onto the line perpendicular to AB. The absolute value of the resulting vector would be the minimum distance. This example is just to illustrate the general method of predicting collisions.

Example 3

One more example. This time with bounding boxes. We have a dot that travels from (x₁, y₁) to (x₂, y₂) in one second. We have a box with its center at (a, b), width w and height h. Does the dot go through the box? If so, when is it inside the box?

Solution

This time we'll set things up a bit differently.

P₁(t) = (a, b)
P₂(t) = ( x₁ + [x₂- x₁]t, y₁ + [y₂- y₁]t ) -- where t goes from 0 to 1

We'll have two relationship functions: the horizontal distance from the center of the box and the vertical distance.

R₁(t) = abs( x₁ + [x₂- x₁]t - a ) -- the horizontal distance
R₂(t) = abs( y₁ + [y₂- y₁]t - b ) -- the vertical distance

This time we take no derivatives. Due to the question, we need to find when the dot enters the box and when it leaves. So we solve for t in the following equations:

R₁(t) = w/2
R₂(t) = h/2

For the first equation we get two values for t. This is because of the absolute value operation in R₁(t).

t₁ = (w/2 - x₁ + a) / (x₂- x₁)
t₂ = -(w/2 + x₁ - a) / (x₂- x₁)

This is expected. As the dot travels along the line, it will "enter" and "leave" the horizontal interval of the box. And of course, the path of the dot cannot be vertical, since then the point would make no horizontal progress. This is demonstrated by the fact that, on a vertical line, (x₂- x₁)=0 and dividing by zero is not allowed.
Similarly, for vertical distance,

t₃ = (h/2 - y₁ + b) / (y₂- y₁)
t₄ = -(h/2 + y₁ - b) / (y₂- y₁)

So we have found the times when the dot "enters" and "leaves" the vertical interval of the box. (The path of the dot cannot be horizontal, for the same reasons as stated above.)

We should, as before, check whether t is between 0 and 1. All that remains after that is to see whether the time intervals (t₁, t₂) and (t₃, t₄) overlap. This can be done using everyone's favorite intersection lemma:

If t₁4 and t₃2 then the dot enters the box for a period of time on its way from (x₁, y₁) to (x₂, y₂).

Finally, we can get the start and end of this time period. To do that, we sort t₁, t₂, t₃ and t₄ in order from earliest to latest (or vice versa), and the middle two are the beginning and end of the time period the dot is inside the box.

So, what you do really depends on what you're looking for. If you want to find out when the objects begin to collide, then you solve R(t) = n for t. (You might get more than one t depending on whether R(t) is linear, quadratic, etc.) If you just want to find out whether the objects will collide in the future, you can find out when R(t) is at its minimum by solving for t in R'(t) = 0 and only check that 'minimum' case. The derivative of R(t) is simpler than R(t); for example, if R(t) is quadratic in t then R'(t) is linear.

Summing Up So Far

Here ends the first part on collision detection. We went rather quickly, but I hope you're left with some general ideas in your mind regarding general principles of collision detection. Perhaps they will help you implement the specialized collision detection you'll need in your projects.

A note: There was quite a bit of math in this discussion, but don't be put off by that fact. You don't usually have to understand all the math to grasp the underlying idea. Usually, the math in these discussions won't be very involved, and I will talk more about programming techniques. However, understanding the math behind these techniques provides a good foundation. I have found that, often, math comes in very handy in solving problems of this sort; it serves to organize your thoughts and illustrate the relationships between different qualities.

Next time, I will explore some common collision detection situations involving polygons, and the methods of solving them. Please drop me a line, and tell me what you thought. And stay tuned for the next issue :)

Links And Acknowledgements

Here's an article on Gamasutra by Jeff Lander, entitled "When Two Hearts Collide..." (Bounding Boxes)

This article, by Nick Bobic, talks about some advanced topics in collision detection, such as hierarchy trees.

Article Series:

Theory & Practice - Issue 00 - Introduction
Theory & Practice - Issue 01 - Collision Detection
Theory & Practice - Issue 02 - Collision Detection - Part 2
Theory & Practice - Issue 03 - Curved Surfaces
Theory & Practice - Issue 04 - Curved Surfaces, Part II
Theory & Practice - Issue 05 - Landscapes
Theory & Practice - Issue 06 - Event Handling Model