Re: [mp2] "Unrecognized character" error when running scripts in utf-8

[prev] [thread] [next] [Date index for 2004/11/07]

From: Markus Wichitill
Subject: Re: [mp2] "Unrecognized character" error when running scripts in utf-8
Date: 20:19 on 07 Nov 2004
Stas Bekman wrote:
> Markus Wichitill wrote:
>> No, Perl can deal with a BOM at the start of a file. But 
>> convert_script_to_compiled_handler() puts the BOM in the middle of the 
>> string that is eval'ed. A bunch of raw bytes has no place in the 
>> middle of a script.
> 
> Aha! Now it's clear. Thanks Markus.
> 
> Do we know that it's always coming in the first few chars of the script?

Yes, it's always the first 2-4 bytes.

http://www.unicode.org/faq/utf_bom.html#BOM

        -- 
        Report problems: http://perl.apache.org/bugs/
Mail list info: http://perl.apache.org/maillist/modperl.html
List etiquette: http://perl.apache.org/maillist/email-etiquette.html

(message missing)

[mp2] =?koi8-r?Q?=22?=Unrecognized character=?koi8-r?Q?=22=20?=error when running scripts in utf-8
=?koi8-r?Q?=E9=C7=CF=D2=D8=20=EB=D5=C4=C1=DB=C5=D7=20?= 13:23 on 07 Nov 2004

Re[2]: [mp2] =?koi8-r?Q?=22?=Unrecognized character=?koi8-r?Q?=22=20?=error when running scripts inutf-8
=?koi8-r?Q?=E9=C7=CF=D2=D8=20=EB=D5=C4=C1=DB=C5=D7=20?= 12:09 on 08 Nov 2004

Re: [mp2] "Unrecognized character" error when running scripts in utf-8
Markus Wichitill 20:19 on 07 Nov 2004

Generated at 11:26 on 21 Dec 2004 by mariachi v0.52