
On Thu, 6 Jul 2006, Greg Newby wrote:
2) "The" as the first word in the title is an artifact of the back-end catalog. These basically need to be fixed by hand. (Yes, I'm sure some could be automated.... Marcello would like to hear your thoughts on this, I'm certain.)
If you are taking your information from the PG online catalog, there should be a field for "non-filing characters" for each title, etc. which indicates how many characters to ignore for sorting purposes. This is used for initial articles. For example the title "The Adventures of Billy" would be marked as having 4 non-filing characters; the title "An adventure with Billy", 3; "Das Leben Billys", 4; "A Long day with Billy", 2; "Les Amours de Billie", 4; "La Maraj Vojagxoj de Bilio", 3. Right now, as new titles added, the common initial articles for German and English are looked after automatically. I've dealt with many of the French ones manually. Hope this helps, Andrew