There are a surprising number of considerations that go into heatsink design. These include the acceptable maximum operating temperature of the devices you're trying to cool, how well the package of the device transfers heat to whatever is contacting it, how the device is coupled to the heatsink (a place where you can lose a lot of performance!), how many devices you're using, and the maximum load you are intending to support.
When it comes to the heatsink itself, apparently small things can make a significant difference, such as the fin design and surface treatment.
Then there's the weird reality of production costs. Extrusion dies are as a rule quite expensive. Depending on your product's expected production quantity and the cost of materials (aluminum in this case), it might make more sense to use an over-specified heatsink in a low-power design if it means you can use the fruits of the same die in a larger one.
For the same power output, lateral power MOSFETs do need slightly higher power supply rails compared to BJT amplifiers, which means for the same maximum output power they will need more cooling. Exactly how much more depends on implementation. For amplifiers using source follower outputs, especially impactful is whether the driver stage is powered from higher rails than the output stage (best for thermals) or, as was the case with many early MOSFET amps, lower voltage rails (worst for thermals).
So, given all the variables, it's quite difficult to speculate why Hafler (or anyone else) used the heatsinks they did. But I hope this gives you a better understanding of the complexities involved in heatsink design and why they can vary so much.