They likely level matched all the files using the one with the highest dynamic range as the basis for all, which is great for direct comparisons among the various test tracks. This way you don't go from soft on one track, to being blown out of your chair on the next. Just set the volume at a comfortable level and commence your test.
