The problem, of course, is that subjective results are not transitive. This means that if you make something .3dB more intense, most people are going to think it sounds better. And, comparing .3 and .6dB more intense, they will vote for the .6dB. HOWEVER, if you compare the .6 (or .9 or 1.2 or something farther along, depends on source and person) with the original, they will dislike the higher level.
In other words if A>B is true, B>C is true, and C>D is true, that does not mean that A>C or A>D is true.
As to the A-flat remover, it wasn't a particularly good idea in the first place. When we heard about it at Bell Labs in the Acoustics Research Department, the reaction was a mix of amusement and exasperation. The argument that it would take a contrived signal to detect it was at least moderately wrong, and the idea that it would work at all was also pretty strange, given noise modulation in cassettes, distortions in LP's, FM noise, and so on. It was before any perceptual coder (it was an analog system), and perceptual coding would have driven it completely bonkers, too.
No disagreement there JJ, but this just emphasises the point that preference and audibility/quality performance do go hand in hand when talking scientfically.
Although maybe part of the problem is the initial post needs to be expanded because we can all take a slightly different interpretation as it stands and its context; because Harman group is a classic example where their scientific studies do include preference/performance/cognitive sighted-blind as part of a study research paper.
So I accept some of us may be misunderstanding the context of that 1st post.
Cheers
Orb