I tend to think of it more like you have so many peaks and nulls across the entire line due to the range in time delay that they all average out, esp the further away you get from them. And the closer they are, the better they're going to balance out, esp in the top end.
Now, if you're using a single, several-foot tall ribbon tweeter its likely not an issue, since its still a single continuous source.