
On Mon, Jan 17, 2005 at 05:53:47PM -0500, D Garcia wrote:
On Sunday 16 January 2005 05:25 pm, Greg Newby wrote:
On Sun, Jan 16, 2005 at 05:10:39PM -0500, D Garcia wrote:
It would be excellent to have a nightly-updated feed of appropriately anonymized Cleared clearances published by PG in the same or similar format as the Catalog feed.
This is doable, but what do you want the records for?
The same purpose as David's list, to look up what's In Progress (i.e. "Cleared") so as to avoid duplication of effort. The advantage of PG doing it automatically is *currency*. I know David does the best he can, but it's a huge amount of work and is frequently a month behind. I know from Juliet that recently there have been about 100 clearances being done per day, though that's probably more than usual. Still though, at say 50 a day for a month ... that's a significant lag time where different volunteers could each be getting clearances for the same works. It's happened to me twice recently, and several times before.
Is this style enough, from two recent clearances: OK 20050104154143pergaud Le roman de Miraut, chien de chasse Louis Pergaud 1913:c OK 20050104152050malot En famille Hector Malot 1895:c OK, based on library stamp.--Juliet That's tab-delimited.... Something in XML or with field labels is also easy, though what's above is straight out of the log file the whitewashers use. -- Greg
Everything is in a database now...though the older data are not divided into appropriate fields.
That's good to know, and simplifies what I'm talking about below:
I can forsee some difficulty in tying old clearances to current PG usernames, though many should be mappable via email address.
What do you need usernames for? Generally speaking, we try to keep clearance submitters' personal information "need to know," which many have requested.
*I* don't need to know them. But anyone transforming the old gbn clearances into the new style would need the email on that one to try to match it to the current PG username. Simply an observation from a data manipulation standpoint, and strictly backend.
Hope that clarifies it for you! David _______________________________________________ gutvol-d mailing list gutvol-d@lists.pglaf.org http://lists.pglaf.org/listinfo.cgi/gutvol-d