
here i finish up my thoughts on italics, by myself... as i said earlier, to get accuracy through the book, it's almost a necessity that more than one person go through the book. it would be silly to assume that a sole producer, no matter how dedicated or skilled or tenacious, had located _every_ instance. (i am not saying it cannot be done; i'm saying that it would be silly for us to _assume_ that it was so.) so, then, a different sort of question poses itself: how do you motivate additional volunteers to do the job the second, third, or fourth time around? it's usually easy enough to find someone to do it the _first_ time, because it's virgin territory, and every find is a new hit, and _feels_like_progress._ it's the second, and subsequent, times through it that the job gets unfulfilling, because most of the instances -- 80% or 90% -- are _already_ marked, so there are fewer cases where you get the "rush" of finding a new one. (and fixing a false-positive, although valuable, is rare, and even less thrilling.) *** but there _is_ a way to get the best of both sides. and that's to _hide_ the visibility of current italics, so the person can "have the fun" of finding them. then, because we do know about the presence of many of the instances, we can reward the person with _recognition_ that they "found" what we "hid". and we can also show them any that they missed. so we're both (1) rewarding their successes, and (2) training them, by showing any shortcomings. this keeps 'em motivated, and makes 'em better. *** moreover, we can give them _extra_credit_ when they find an instance that we didn't already know. after all, that's why we do the additional checks; so people find new cases we didn't know about... so that's what we're gonna give good rewards for. in my "italics game" demo, if the italics you clicked had a blue bounding-box, it meant, "hey, good job! you found one of the instances that we had hidden." in our game, this "find" might earn you one point. (or 1000 points, given the point-inflation today.) but if the italics you clicked had a _red_ bounding, it meant that you'd _discovered_a_new_instance_... (this isn't just something arbitrary that i made up; the words with red bounding-boxes are instances that were _actually_ missed by either david or jim, and usually both, in their digitizations of "huck". so, according to them, you _did_ find a new one.) so in our game, this "discovery" of a new instance might earn two points, or five, or 2000, or 5000. (a nice twist might be to base it on the number of rounds so far; it'd be 2 points in the second round, and 3 points in the third, 5 in the fifth, and so on. thus, the more people who'd missed it before you, the higher the reward you'll receive for finding it.) we could have other game-play considerations too. as one, of course, there would also be _penalities_ -- _deducted_ points -- for any missed instances. we could decide how heavily we wanted to charge people for such missed instances, but i'd say that it should be at least _twice_ as much as we reward them for "re-finding" a known instance, since the whole purpose of this exercise is to find new ones, so we want subsequent checkers to be _vigilant._ and we might want to base this penalty, too, on the round, to balance the increased "reward" for finding new instances. so you'd lose 2 points in the second round for misses, 3 in the third, etc. as another wrinkle, we might want to plant some "false misses" in the mix. here, we would _say_ the person had "missed" an instance, but we will then point to an _unitalicized_ word. this would be the equivalent of a "false positive", and thus we'll want the person to _appeal_ the judgment. that way, if round 2 incorrectly italicizes a word, then round 3 will be penalized for not finding it, but the player in round 3 will see that the word is _not_ supposed to be italicized, and _appeal_. and our referee will come and uphold the appeal, thus _removing_ the incorrect mark from round 2. (and yes, retroactively penalize the round 2 player.) so the false penalties teach the players to appeal, and that benefits us by removing false-positives... *** like any social game, you'd post the "standings", and give players a message-board to chat, boast, trashtalk, bellyache, and just generally have fun... *** so, would this kind of "game" thingee work? i dunno. don't care, really. i'm just a coder, building a tool to help automate a dull task... if you want an example of a non-game use of it, see:
http://zenmagiclove.com/misc/italicscheck241.html http://zenmagiclove.com/misc/italicscheck258.html http://zenmagiclove.com/misc/italicscheck271.html http://zenmagiclove.com/misc/italicscheck303.html http://zenmagiclove.com/misc/italicscheck313.html http://zenmagiclove.com/misc/italicscheck320.html http://zenmagiclove.com/misc/italicscheck326.html
these are the same 7 pages i had set up as a game, but here we just have the interface of the tool itself. you'll see that each instance of italics is _highlighted_ by a colorized box in both the left and right margins. (lines with multiple instances display multiple boxes.) these highlights are created by analyzing the text for the page, and seeing which lines have italics marked, then putting the margin-highlights at the appropriate place on the scan where we computed that line to be. or -- if people think it works better -- we can also mark each of those words themselves, with borders, the same way we did with the "game" version earlier. (to see this, click the "turn them on" button up top.) what the checker is checking for, then, is congruence with an italicized word in a line and a highlight for it. any noncongruence -- italics without a highlight, or highlight without italicized word -- needs attention. also note that, new in this version, i've included a _button_ in the right margin, for each line of words. when a line "needs attention", you'll click its button. this will send a report that that line needs attention. this is for when the text is sufficiently correct that changes will be rare, and we would want 'em to be confirmed by someone before they are committed. *** now let's recall how i described the current workflow:
look at the pagescan and note where there are italics, then check each instance in the text, and mark it in the event that it hasn't already been marked. and then, just to be safe, make sure that every italics word that _is_ marked in the text is actually an italics in the scan.
so there are 2 entities in front of you on your screen. one is the pagescan, and the other is the text-file... and most of what you're doing is shifting between one and the other, and then back to the first one... look at the scan, then the text, then back to the scan. repeat, repeat, repeat. like watching a tennis match... haha! my goodness: "like watching a tennis match." sometimes i'm so funny i crack _myself_ up! ;+) now look at how this tool makes that more efficient. first, the text-file is removed from the picture entirely. all edits could be registered automatically by the tool. (however, i suggest instant review by an administrator, so that false-positives will not even get a toehold, and to address anomalies such as don's partial-word italics. this is the "confirmation" that i was mentioning earlier.) still, a person who's _checking_ doesn't mess with text. this means there's no loss-of-attention involved in the task-switch from examining-the-scan to text-editing. moreover, it is unnecessary to look at the text-file to see if an italicized word is marked, because the mark is shown _right_on_the_scan_itself_, where it's needed. you _know_ if a word's marked, or if it still needs to be. so if you see an italic _with_ a mark, you know it's fine. if you see one _without_ a mark, you know to mark it... and if there is a false-positive, you can see that as well, because there is a mark, but no italic-word alongside it. so all that tedious back-and-forth-and back movement of your eyes and attention from the scan to text to scan is eliminated. your eyes and attention stay on the scan. this is the essence of a _huge_ boost in task-efficiency. and that's why we build tools in the first place. are there any questions? -bowerbird
participants (1)
-
Bowerbird@aol.com