Re: [gutvol-d] Scan file naming -- another comment

22 Jul 2005

      Bowerbird wrote:
...
the overarching rule is to
 make the system as complex 
 as it needs to be, and no more.
The system I propose for scan file naming is *simpler* than yours and
more flexible -- it will handle *any* publisher page numbering/naming
convention one throws at it (even backwards page numbering or where the
same number is used twice) without the use of special prefix letters to
describe which major structure of the book the page numbers are used.
It also integrates better into the QC system.

Here's your two part (really three-part) system as I best understand it:

   BookID : PublisherPageID(withalphabetprefixes)

Here's my system:

   BookID : ScanSeq# : PublisherPageID(asis)

In your system, you are asking the person who is naming the files to
not only read the PublisherPageID, but to then append some letter (as
needed) to handle the different numbering schemes used in the book
(typically two: Roman and Arabic). In my system, the person only needs
to read the PublisherPageID and enter that without having to figure
out any letter prefixes -- this is easier and more reliable. It will
also handle cases your (and Marcello's) system won't handle, such as
backward-numbered pages and where page numbers are repeated (this
example was actually brought up.)

My system also integrates well (not saying yours doesn't) into the
natural work flow of the scanning and QC process:

1) Scans are sequentially made from the front to back, including all
   blank pages. Each scan is given a simple ScanSeq#.

2) During the next stage where a human being is looking at each scan,
   they append the *actual* publisher supplied page number (or string)
   to the filename from (1). No need to add any letter prefixes or
   anything -- they use the *actual* string "as it is".

3) Then, if needed, the BookID (whether it is a DP database record ID
   or a PG text number) is then prepended to the whole set. This is
   trivially done with a script. In Windows, I can run a command line
   *.bat file to do this, if I wanted.

Let's not forget the fact that during post-processing (deskewing,
cropping, color reduction, etc.) we are generating derivative scan
sets whose names must differentiate from other derivative (and the
master) scanset. This was a problem I had. By adding a fourth field to
the filename, we can differentiate between scan sets of the same
project, which we may wish to preserve (at least in the working data
base.)

This is not complicated at all, and provides a lot of flexibility.

Jon

Re: [gutvol-d] Scan file naming -- another comment

Jon Noring