
25 Jan
2012
25 Jan
'12
10:23 p.m.
Most have argued that UTF-8 requires Unicode. Technically you can UTF-8 encode any set of code points, but for this project it would serve no
Lee>Unicode without composition. purpose. I suggest that PG has historically "reserved" some commonly used Unicode code points for their own special purposes, and it would at least be wise to take the opportunity to choose much less commonly used code points for those special purposes, or alternatively use uncommon code sequences for those special purposes. The way the situation sits right now one cannot reliably write automatic tools to reliably process PG "utf-8" files.