PG needs for age numbers need to be there somewhere because without them
there's no future hope for controlled/moderated text refinement. We need them
to match up the canonical text with the canonical image and quickly verify that
a proposed correction is legitimate.

Whether the page number needs to be included in the downloaded "plain text"
version, or whether the "plain text" version should be the canonical version are
separate matters.