In references like the university course
Modern Observational Techniques or the
Lowell Observatory Exposure Time Calculator (at the bottom of the page there is a short documentation as a PDF file) some of the noise terms are weighted by an aperture term. In this discussion, every noise term has the same weight, implying a single-pixel aperture. If this happens here for simplification reasons, I have to express my worries:
What is the purpose of dithering? Besides better outlier rejection, it tries to "break the correlation" of noise sources which have spatial correlations. These noise sources live in the spatial domain and have a typical correlation length
l. Such a length term (or an estimate of it) must be present somewhere in the formulas. Remember, in practice we have good dithering if we pick a random direction angle and a random length
d with
d>
l.
In other words, with dithering we benefit from randomization in the spatial domain because we live in the imperfect world of Bayer matrices and confined between the bars of CCD columns. A single-pixel approach forgets about any such spatial correlations. Somewhere a term with units of length should be present, and (in my opinion) even if we don't account for dithering.
I'll end my math/theoretical mumble with the notes of another
course. Lecture 5 may be of interest! My experimentalist part of self agrees with Charles, it takes less time doing it than think about it