Emotional musicality and dynamics is the real magic. Common speaker measurements like 93 dB/2.83 volts at one meter is way overblown. Our ears measure sound differently at all common listening distances. Of course, the exception is at one meter. About three meters/yards is a very common home listening distance. You're probably pressurizing the smaller room correctly, too.
To me, dynamics at 3 meters or so is the real test. I'd be very interested in comparing speaker's tested at 1 meter verses measurements taken at the truer listener's range. Lab tests are often way overblown. IMHO, meeting that one meter referencing spec is helpful - but overrated like THD was the buzz years ago. It's a helpful, standardized lab spec before we personally listen for magic musicality playback and dyamic performance.
I've found many "great" full range or near full range speakers with dazzling 1 meter measurements often sound "choked off" or anemic at 3 or so meters. To me, great musical dynamics is best graded like you did - purely subjective listening tests using real world listening distances and environments.
Bravo for going beyond lab specs by using your own ears' to judge that special musical MoJo. Specs are a good starter. No specs, even well standardized, can overrule the MoJo impressions of real world listening dynamics.

