Hi all
I try use orbeon with UTF-8 encoding but i get "Illegal HTML character: decimal 146" I use orbeon-hello-world for testing (https://github.com/protocol7/orbeon-hello-world) I change characters - (in xforms:label) to russian, add <?xml version="1.0" encoding="UTF-8"?> to beginning of html file (orbeon-hello-world/src/main/webapp/test/test.html) as result I get "Illegal HTML character: decimal 146" whats is wrong? If I use jsp with <%@page pageEncoding="UTF-8" %> page transformed perfectly Dmitry Rzhevskiy www.rzhevskiy.info Jabber: [hidden email] -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
Administrator
|
Dima,
Which version of Orbeon Forms are you using? I think you might be hitting the common "Windows-1252 kind of converted to pseudo-UTF-8" issue, which we fixed in February 2011. Character 146 is one of those Windows-1252 characters (a single quote), which should not appear in UTF-8, but sometimes does when text is incorrectly converted from Windows-1252. See this blog post: http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html And this doc: http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers If you have a version after February 2011, things should work out of the box. Otherwise, you could try to fix your text to not include incorrect UTF-8 characters. -Erik On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: > Hi all > I try use orbeon with UTF-8 encoding but i get "Illegal HTML > character: decimal 146" > I use orbeon-hello-world for testing > (https://github.com/protocol7/orbeon-hello-world) > I change characters - (in xforms:label) to russian, add > <?xml version="1.0" encoding="UTF-8"?> to beginning of html file > (orbeon-hello-world/src/main/webapp/test/test.html) > > as result I get "Illegal HTML character: decimal 146" > whats is wrong? > > > If I use jsp with <%@page pageEncoding="UTF-8" %> > page transformed perfectly > > > Dmitry Rzhevskiy > www.rzhevskiy.info > Jabber: [hidden email] > > > -- > You receive this message as a subscriber of the [hidden email] mailing list. > To unsubscribe: mailto:[hidden email] > For general help: mailto:[hidden email]?subject=help > OW2 mailing lists service home page: http://www.ow2.org/wws > > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
I use UTF-8 encoding in files. Default locale of my operation
system(Linux) is "ru_RU.UTF-8". I attach file than produce exception. How to use custom serializers I not clearly understand. Where file with settings for custom-serializers located? Dmitry Rzhevskiy www.rzhevskiy.info Jabber: [hidden email] 2011/8/16 Erik Bruchez <[hidden email]>: > Dima, > > Which version of Orbeon Forms are you using? > > I think you might be hitting the common "Windows-1252 kind of > converted to pseudo-UTF-8" issue, which we fixed in February 2011. > > Character 146 is one of those Windows-1252 characters (a single > quote), which should not appear in UTF-8, but sometimes does when text > is incorrectly converted from Windows-1252. > > See this blog post: > http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html > > And this doc: > http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers > > If you have a version after February 2011, things should work out of the box. > > Otherwise, you could try to fix your text to not include incorrect > UTF-8 characters. > > -Erik > > On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: >> Hi all >> I try use orbeon with UTF-8 encoding but i get "Illegal HTML >> character: decimal 146" >> I use orbeon-hello-world for testing >> (https://github.com/protocol7/orbeon-hello-world) >> I change characters - (in xforms:label) to russian, add >> <?xml version="1.0" encoding="UTF-8"?> to beginning of html file >> (orbeon-hello-world/src/main/webapp/test/test.html) >> >> as result I get "Illegal HTML character: decimal 146" >> whats is wrong? >> >> >> If I use jsp with <%@page pageEncoding="UTF-8" %> >> page transformed perfectly >> >> >> Dmitry Rzhevskiy >> www.rzhevskiy.info >> Jabber: [hidden email] >> >> >> -- >> You receive this message as a subscriber of the [hidden email] mailing list. >> To unsubscribe: mailto:[hidden email] >> For general help: mailto:[hidden email]?subject=help >> OW2 mailing lists service home page: http://www.ow2.org/wws >> >> > > > -- > You receive this message as a subscriber of the [hidden email] mailing list. > To unsubscribe: mailto:[hidden email] > For general help: mailto:[hidden email]?subject=help > OW2 mailing lists service home page: http://www.ow2.org/wws > > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws test.html (812 bytes) Download Attachment |
Administrator
|
Curiously, I don't see an exception with the file you sent, when I load it as:
<page path-info="/test" view="test.xhtml"/> How do you run this, and with which OF version? The configuration for the serializers would go in your properties-local.xml file. -Erik 2011/8/16 Dima Rzhevskiy <[hidden email]>: > I use UTF-8 encoding in files. Default locale of my operation > system(Linux) is "ru_RU.UTF-8". > I attach file than produce exception. > > How to use custom serializers I not clearly understand. Where file > with settings for custom-serializers located? > > Dmitry Rzhevskiy > www.rzhevskiy.info > Jabber: [hidden email] > > > > 2011/8/16 Erik Bruchez <[hidden email]>: >> Dima, >> >> Which version of Orbeon Forms are you using? >> >> I think you might be hitting the common "Windows-1252 kind of >> converted to pseudo-UTF-8" issue, which we fixed in February 2011. >> >> Character 146 is one of those Windows-1252 characters (a single >> quote), which should not appear in UTF-8, but sometimes does when text >> is incorrectly converted from Windows-1252. >> >> See this blog post: >> http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html >> >> And this doc: >> http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers >> >> If you have a version after February 2011, things should work out of the box. >> >> Otherwise, you could try to fix your text to not include incorrect >> UTF-8 characters. >> >> -Erik >> >> On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: >>> Hi all >>> I try use orbeon with UTF-8 encoding but i get "Illegal HTML >>> character: decimal 146" >>> I use orbeon-hello-world for testing >>> (https://github.com/protocol7/orbeon-hello-world) >>> I change characters - (in xforms:label) to russian, add >>> <?xml version="1.0" encoding="UTF-8"?> to beginning of html file >>> (orbeon-hello-world/src/main/webapp/test/test.html) >>> >>> as result I get "Illegal HTML character: decimal 146" >>> whats is wrong? >>> >>> >>> If I use jsp with <%@page pageEncoding="UTF-8" %> >>> page transformed perfectly >>> >>> >>> Dmitry Rzhevskiy >>> www.rzhevskiy.info >>> Jabber: [hidden email] >>> >>> >>> -- >>> You receive this message as a subscriber of the [hidden email] mailing list. >>> To unsubscribe: mailto:[hidden email] >>> For general help: mailto:[hidden email]?subject=help >>> OW2 mailing lists service home page: http://www.ow2.org/wws >>> >>> >> >> >> -- >> You receive this message as a subscriber of the [hidden email] mailing list. >> To unsubscribe: mailto:[hidden email] >> For general help: mailto:[hidden email]?subject=help >> OW2 mailing lists service home page: http://www.ow2.org/wws >> >> > > > -- > You receive this message as a subscriber of the [hidden email] mailing list. > To unsubscribe: mailto:[hidden email] > For general help: mailto:[hidden email]?subject=help > OW2 mailing lists service home page: http://www.ow2.org/wws > > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
I use orbeon-3.9.0.201105152046-CE.zip version
And I try convert to html using org.orbeon.oxf.servlet.OrbeonXFormsFilter (separated deployment) Previously I put to tomcat/webapps/orbeon/xforms-jsp/guess-the-number directory, change exstension to jsp, without adding to header <%@page pageEncoding="UTF-8" %> page produce same error. Dmitry Rzhevskiy www.rzhevskiy.info Jabber: [hidden email] 2011/8/17 Erik Bruchez <[hidden email]>: > Curiously, I don't see an exception with the file you sent, when I load it as: > > <page path-info="/test" view="test.xhtml"/> > > How do you run this, and with which OF version? > > The configuration for the serializers would go in your > properties-local.xml file. > > -Erik > > 2011/8/16 Dima Rzhevskiy <[hidden email]>: >> I use UTF-8 encoding in files. Default locale of my operation >> system(Linux) is "ru_RU.UTF-8". >> I attach file than produce exception. >> >> How to use custom serializers I not clearly understand. Where file >> with settings for custom-serializers located? >> >> Dmitry Rzhevskiy >> www.rzhevskiy.info >> Jabber: [hidden email] >> >> >> >> 2011/8/16 Erik Bruchez <[hidden email]>: >>> Dima, >>> >>> Which version of Orbeon Forms are you using? >>> >>> I think you might be hitting the common "Windows-1252 kind of >>> converted to pseudo-UTF-8" issue, which we fixed in February 2011. >>> >>> Character 146 is one of those Windows-1252 characters (a single >>> quote), which should not appear in UTF-8, but sometimes does when text >>> is incorrectly converted from Windows-1252. >>> >>> See this blog post: >>> http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html >>> >>> And this doc: >>> http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers >>> >>> If you have a version after February 2011, things should work out of the box. >>> >>> Otherwise, you could try to fix your text to not include incorrect >>> UTF-8 characters. >>> >>> -Erik >>> >>> On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: >>>> Hi all >>>> I try use orbeon with UTF-8 encoding but i get "Illegal HTML >>>> character: decimal 146" >>>> I use orbeon-hello-world for testing >>>> (https://github.com/protocol7/orbeon-hello-world) >>>> I change characters - (in xforms:label) to russian, add >>>> <?xml version="1.0" encoding="UTF-8"?> to beginning of html file >>>> (orbeon-hello-world/src/main/webapp/test/test.html) >>>> >>>> as result I get "Illegal HTML character: decimal 146" >>>> whats is wrong? >>>> >>>> >>>> If I use jsp with <%@page pageEncoding="UTF-8" %> >>>> page transformed perfectly >>>> >>>> >>>> Dmitry Rzhevskiy >>>> www.rzhevskiy.info >>>> Jabber: [hidden email] >>>> >>>> >>>> -- >>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>> To unsubscribe: mailto:[hidden email] >>>> For general help: mailto:[hidden email]?subject=help >>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>> >>>> >>> >>> >>> -- >>> You receive this message as a subscriber of the [hidden email] mailing list. >>> To unsubscribe: mailto:[hidden email] >>> For general help: mailto:[hidden email]?subject=help >>> OW2 mailing lists service home page: http://www.ow2.org/wws >>> >>> >> >> >> -- >> You receive this message as a subscriber of the [hidden email] mailing list. >> To unsubscribe: mailto:[hidden email] >> For general help: mailto:[hidden email]?subject=help >> OW2 mailing lists service home page: http://www.ow2.org/wws >> >> > > > -- > You receive this message as a subscriber of the [hidden email] mailing list. > To unsubscribe: mailto:[hidden email] > For general help: mailto:[hidden email]?subject=help > OW2 mailing lists service home page: http://www.ow2.org/wws > > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
Do you have any idea why tranformation not work with internationaliazed page?
Dmitry Rzhevskiy www.rzhevskiy.info Jabber: [hidden email] 2011/8/17 Dima Rzhevskiy <[hidden email]>: > I use orbeon-3.9.0.201105152046-CE.zip version > And I try convert to html using > org.orbeon.oxf.servlet.OrbeonXFormsFilter (separated deployment) > > Previously I put to tomcat/webapps/orbeon/xforms-jsp/guess-the-number > directory, change exstension to jsp, > without adding to header <%@page pageEncoding="UTF-8" %> page produce > same error. > > > Dmitry Rzhevskiy > www.rzhevskiy.info > Jabber: [hidden email] > > > 2011/8/17 Erik Bruchez <[hidden email]>: >> Curiously, I don't see an exception with the file you sent, when I load it as: >> >> <page path-info="/test" view="test.xhtml"/> >> >> How do you run this, and with which OF version? >> >> The configuration for the serializers would go in your >> properties-local.xml file. >> >> -Erik >> >> 2011/8/16 Dima Rzhevskiy <[hidden email]>: >>> I use UTF-8 encoding in files. Default locale of my operation >>> system(Linux) is "ru_RU.UTF-8". >>> I attach file than produce exception. >>> >>> How to use custom serializers I not clearly understand. Where file >>> with settings for custom-serializers located? >>> >>> Dmitry Rzhevskiy >>> www.rzhevskiy.info >>> Jabber: [hidden email] >>> >>> >>> >>> 2011/8/16 Erik Bruchez <[hidden email]>: >>>> Dima, >>>> >>>> Which version of Orbeon Forms are you using? >>>> >>>> I think you might be hitting the common "Windows-1252 kind of >>>> converted to pseudo-UTF-8" issue, which we fixed in February 2011. >>>> >>>> Character 146 is one of those Windows-1252 characters (a single >>>> quote), which should not appear in UTF-8, but sometimes does when text >>>> is incorrectly converted from Windows-1252. >>>> >>>> See this blog post: >>>> http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html >>>> >>>> And this doc: >>>> http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers >>>> >>>> If you have a version after February 2011, things should work out of the box. >>>> >>>> Otherwise, you could try to fix your text to not include incorrect >>>> UTF-8 characters. >>>> >>>> -Erik >>>> >>>> On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: >>>>> Hi all >>>>> I try use orbeon with UTF-8 encoding but i get "Illegal HTML >>>>> character: decimal 146" >>>>> I use orbeon-hello-world for testing >>>>> (https://github.com/protocol7/orbeon-hello-world) >>>>> I change characters - (in xforms:label) to russian, add >>>>> <?xml version="1.0" encoding="UTF-8"?> to beginning of html file >>>>> (orbeon-hello-world/src/main/webapp/test/test.html) >>>>> >>>>> as result I get "Illegal HTML character: decimal 146" >>>>> whats is wrong? >>>>> >>>>> >>>>> If I use jsp with <%@page pageEncoding="UTF-8" %> >>>>> page transformed perfectly >>>>> >>>>> >>>>> Dmitry Rzhevskiy >>>>> www.rzhevskiy.info >>>>> Jabber: [hidden email] >>>>> >>>>> >>>>> -- >>>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>>> To unsubscribe: mailto:[hidden email] >>>>> For general help: mailto:[hidden email]?subject=help >>>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>>> >>>>> >>>> >>>> >>>> -- >>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>> To unsubscribe: mailto:[hidden email] >>>> For general help: mailto:[hidden email]?subject=help >>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>> >>>> >>> >>> >>> -- >>> You receive this message as a subscriber of the [hidden email] mailing list. >>> To unsubscribe: mailto:[hidden email] >>> For general help: mailto:[hidden email]?subject=help >>> OW2 mailing lists service home page: http://www.ow2.org/wws >>> >>> >> >> >> -- >> You receive this message as a subscriber of the [hidden email] mailing list. >> To unsubscribe: mailto:[hidden email] >> For general help: mailto:[hidden email]?subject=help >> OW2 mailing lists service home page: http://www.ow2.org/wws >> >> > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
Administrator
|
Dmitry ,
I think I found out why: by default, the JSP spec says the JSP page itself is encoded with ISO-8859-1. Your XML declaration at the beginning of the file doesn't change anything, because plain JSP doesn't care about that. So the file is incorrectly read as ISO-8859-1. You can change that default to UTF-8 by placing this configuration in your web.xml: <jsp-config> <jsp-property-group> <url-pattern>*.jsp</url-pattern> <page-encoding>UTF-8</page-encoding> </jsp-property-group> </jsp-config> With this change, your file is now working (make a change in your JSP to make sure it is recompiled). You might also be able to use the "page" directive to change this on a page by page basis. -Erik On Tue, Aug 23, 2011 at 12:56 AM, Dima Rzhevskiy <[hidden email]> wrote: > Do you have any idea why tranformation not work with internationaliazed page? > > > Dmitry Rzhevskiy > www.rzhevskiy.info > Jabber: [hidden email] > > > > 2011/8/17 Dima Rzhevskiy <[hidden email]>: >> I use orbeon-3.9.0.201105152046-CE.zip version >> And I try convert to html using >> org.orbeon.oxf.servlet.OrbeonXFormsFilter (separated deployment) >> >> Previously I put to tomcat/webapps/orbeon/xforms-jsp/guess-the-number >> directory, change exstension to jsp, >> without adding to header <%@page pageEncoding="UTF-8" %> page produce >> same error. >> >> >> Dmitry Rzhevskiy >> www.rzhevskiy.info >> Jabber: [hidden email] >> >> >> 2011/8/17 Erik Bruchez <[hidden email]>: >>> Curiously, I don't see an exception with the file you sent, when I load it as: >>> >>> <page path-info="/test" view="test.xhtml"/> >>> >>> How do you run this, and with which OF version? >>> >>> The configuration for the serializers would go in your >>> properties-local.xml file. >>> >>> -Erik >>> >>> 2011/8/16 Dima Rzhevskiy <[hidden email]>: >>>> I use UTF-8 encoding in files. Default locale of my operation >>>> system(Linux) is "ru_RU.UTF-8". >>>> I attach file than produce exception. >>>> >>>> How to use custom serializers I not clearly understand. Where file >>>> with settings for custom-serializers located? >>>> >>>> Dmitry Rzhevskiy >>>> www.rzhevskiy.info >>>> Jabber: [hidden email] >>>> >>>> >>>> >>>> 2011/8/16 Erik Bruchez <[hidden email]>: >>>>> Dima, >>>>> >>>>> Which version of Orbeon Forms are you using? >>>>> >>>>> I think you might be hitting the common "Windows-1252 kind of >>>>> converted to pseudo-UTF-8" issue, which we fixed in February 2011. >>>>> >>>>> Character 146 is one of those Windows-1252 characters (a single >>>>> quote), which should not appear in UTF-8, but sometimes does when text >>>>> is incorrectly converted from Windows-1252. >>>>> >>>>> See this blog post: >>>>> http://blog.orbeon.com/2011/05/auto-fixing-windowsunicode-character.html >>>>> >>>>> And this doc: >>>>> http://wiki.orbeon.com/forms/doc/developer-guide/processors-converters#TOC-Setting-custom-serializers >>>>> >>>>> If you have a version after February 2011, things should work out of the box. >>>>> >>>>> Otherwise, you could try to fix your text to not include incorrect >>>>> UTF-8 characters. >>>>> >>>>> -Erik >>>>> >>>>> On Mon, Aug 15, 2011 at 1:35 AM, Dima Rzhevskiy <[hidden email]> wrote: >>>>>> Hi all >>>>>> I try use orbeon with UTF-8 encoding but i get "Illegal HTML >>>>>> character: decimal 146" >>>>>> I use orbeon-hello-world for testing >>>>>> (https://github.com/protocol7/orbeon-hello-world) >>>>>> I change characters - (in xforms:label) to russian, add >>>>>> <?xml version="1.0" encoding="UTF-8"?> to beginning of html file >>>>>> (orbeon-hello-world/src/main/webapp/test/test.html) >>>>>> >>>>>> as result I get "Illegal HTML character: decimal 146" >>>>>> whats is wrong? >>>>>> >>>>>> >>>>>> If I use jsp with <%@page pageEncoding="UTF-8" %> >>>>>> page transformed perfectly >>>>>> >>>>>> >>>>>> Dmitry Rzhevskiy >>>>>> www.rzhevskiy.info >>>>>> Jabber: [hidden email] >>>>>> >>>>>> >>>>>> -- >>>>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>>>> To unsubscribe: mailto:[hidden email] >>>>>> For general help: mailto:[hidden email]?subject=help >>>>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>>> To unsubscribe: mailto:[hidden email] >>>>> For general help: mailto:[hidden email]?subject=help >>>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>>> >>>>> >>>> >>>> >>>> -- >>>> You receive this message as a subscriber of the [hidden email] mailing list. >>>> To unsubscribe: mailto:[hidden email] >>>> For general help: mailto:[hidden email]?subject=help >>>> OW2 mailing lists service home page: http://www.ow2.org/wws >>>> >>>> >>> >>> >>> -- >>> You receive this message as a subscriber of the [hidden email] mailing list. >>> To unsubscribe: mailto:[hidden email] >>> For general help: mailto:[hidden email]?subject=help >>> OW2 mailing lists service home page: http://www.ow2.org/wws >>> >>> >> > > > -- > You receive this message as a subscriber of the [hidden email] mailing list. > To unsubscribe: mailto:[hidden email] > For general help: mailto:[hidden email]?subject=help > OW2 mailing lists service home page: http://www.ow2.org/wws > > -- You receive this message as a subscriber of the [hidden email] mailing list. To unsubscribe: mailto:[hidden email] For general help: mailto:[hidden email]?subject=help OW2 mailing lists service home page: http://www.ow2.org/wws |
Free forum by Nabble | Edit this page |