Which one is correct?
Both. The black and white reference masks are
binarized masks. A binary mask is either black (0) or white (1). However, there is an independent mask for each channel of the reference image(s). So if for example there is a white pixel on the red channel of the background mask which is black on the green and blue channels, then you'll see it as pure red as represented on the screen.