
I believe that the problem in handling UTF-8 submissions is unitame, that is the tool that the WWers use to recode UTF-8 to iso-Latin-1. It cannot handle simple things like bullets and greek, but it would be very easy to extend it to be able to handle these characters and more (I did). Extending unitame is just editing an iso-latin file unitame.dat. A slightly more complicated extension could allow to have unitame accept a submission-specific patch file (looking first an unitame.dat in the current location, then the global unitame.dat). This could be much easier to finalize than fixing gutcheck. UTF-8 gutcheck could hence be obtained piping unitame output to "classic" gutcheck. Possibly, even a quick and dirty version of unitame.dat, replacing unhandled characters with #, instead of the current verbose output that suggests manual handling, could be enough to use gutcheck with UTF-8 files. Of course manual tweaking would be needed if some WW-ers want to post iso-latin-1 or ASCII files anyway. I can contribute a version of unitame.dat transliterating greek (although with some suboptimal transliteration) and handling dingbats. Carlo