well, the thing is when you are doing the LRGB merge onto your RGB image, what you are doing is replacing the implicit L channel of the RGB with the L image. if the two images do not have similar histograms then the merged result is going to look funny, as the implicit L will be dimmer or brighter than the L image.
using STF to compute the HT for both images virtually ensures that the resultant images have very similar histograms, that's all.