
26 Jan
2012
26 Jan
'12
7:45 p.m.
On Thu, January 26, 2012 12:16 pm, Marcello Perathoner wrote:
On 01/26/2012 07:29 PM, Lee Passey wrote:
But if the file is not UTF-16 then at least the first line is guaranteed to be ASCII
Wrong. It can also be UCS-4, UCS-2, or EBCDIC.
I stand corrected. The first line is guaranteed to start with "<?xml", so what a program needs to do is read the first 5 bytes and then figure out which of the "other encodings" yields that string. Would PG ever consider using one of these "other encodings" or is this discussion just academic?