Re: [mp2] "Unrecognized character" error when running scripts in utf-8

[prev] [thread] [next] [Date index for 2004/11/07]

From: Stas Bekman
Subject: Re: [mp2] "Unrecognized character" error when running scripts in utf-8
Date: 20:39 on 07 Nov 2004
Markus Wichitill wrote:
> Stas Bekman wrote:
> 
>> Markus Wichitill wrote:
>>
>>> No, Perl can deal with a BOM at the start of a file. But 
>>> convert_script_to_compiled_handler() puts the BOM in the middle of 
>>> the string that is eval'ed. A bunch of raw bytes has no place in the 
>>> middle of a script.
>>
>>
>> Aha! Now it's clear. Thanks Markus.
>>
>> Do we know that it's always coming in the first few chars of the script?
> 
> 
> Yes, it's always the first 2-4 bytes.
> 
> http://www.unicode.org/faq/utf_bom.html#BOM

Thanks for the link, Markus.

I suppose we could add your s/// wrapper to RegistryCooker as a function, 
so to save time to those who want to write a subclass to handle that.


        -- 
        __________________________________________________________________
Stas Bekman            JAm_pH ------> Just Another mod_perl Hacker
http://stason.org/     mod_perl Guide ---> http://perl.apache.org
mailto:stas@xxxxxx.xxx http://use.perl.org http://apacheweek.com
http://modperlbook.org http://apache.org   http://ticketmaster.com

-- 
Report problems: http://perl.apache.org/bugs/
Mail list info: http://perl.apache.org/maillist/modperl.html
List etiquette: http://perl.apache.org/maillist/email-etiquette.html

(message missing)

[mp2] =?koi8-r?Q?=22?=Unrecognized character=?koi8-r?Q?=22=20?=error when running scripts in utf-8
=?koi8-r?Q?=E9=C7=CF=D2=D8=20=EB=D5=C4=C1=DB=C5=D7=20?= 13:23 on 07 Nov 2004

Re[2]: [mp2] =?koi8-r?Q?=22?=Unrecognized character=?koi8-r?Q?=22=20?=error when running scripts inutf-8
=?koi8-r?Q?=E9=C7=CF=D2=D8=20=EB=D5=C4=C1=DB=C5=D7=20?= 12:09 on 08 Nov 2004

Re: [mp2] "Unrecognized character" error when running scripts in utf-8
Stas Bekman 20:39 on 07 Nov 2004

Generated at 11:26 on 21 Dec 2004 by mariachi v0.52