unclean HTML?

classic Classic list List threaded Threaded
8 messages Options
Reply | Threaded
Open this post in threaded view
|

unclean HTML?

Sean Hogan (CLIXtec)
How are people dealing with using Orbeon Forms from JSP that contains plain old messy, real-world HTML?
 
I'm trying to work XForms into an existing application that uses !DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd".  Orbeon (or more precisely whatever it uses to parse XML) strongly dislikes the content of loose.dtd, since it barfs processing it.  It dislikes the strict DTD as well.  If I remove the DTD references entirely, then I get farther but even relatively innocuous things like ampersand-nbsp-semicolon trip up the processing.
 
I understand why Orbeon wants to see proper XHTML, but the reality is that there's a lot of old junk out there.  In my case there are hundreds of JSPs including each other all over the place, with lots of cruft that just happens to work and so hasn't been touched in years.  What is the most practical way to deal with this situation?
 
Thanks,
    Sean
---
Sean Hogan
CLIXtec IT Consulting Inc.
m: (709) 687-2275
e: [hidden email]
e: [hidden email]
w: http://clixtec.ca


--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws
Reply | Threaded
Open this post in threaded view
|

Re: unclean HTML?

Erik Bruchez
Administrator
Good idea.

How things work now:

* Filter stores JSP output as String into request
* xforms-renderer.xpl runs
* oxf:scope-generator reads String and tries to parse it as XML.

One idea would be to allow oxf:scope-generator to support "tag soup"  
input and to clean the HTML with a recent version of TagSoup (JAR is  
actually already checked into our CVS), which I think is able to keep  
foreign elements (and maybe attributes).

It would be great if you could give this a try.

-Erik

On Apr 18, 2009, at 4:59 PM, Sean Hogan (CLIXtec) wrote:

> How are people dealing with using Orbeon Forms from JSP that  
> contains plain old messy, real-world HTML?
>
> I'm trying to work XForms into an existing application that uses !
> DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd 
> ".  Orbeon (or more precisely whatever it uses to parse XML)  
> strongly dislikes the content of loose.dtd, since it barfs  
> processing it.  It dislikes the strict DTD as well.  If I remove the  
> DTD references entirely, then I get farther but even relatively  
> innocuous things like ampersand-nbsp-semicolon trip up the processing.
>
> I understand why Orbeon wants to see proper XHTML, but the reality  
> is that there's a lot of old junk out there.  In my case there are  
> hundreds of JSPs including each other all over the place, with lots  
> of cruft that just happens to work and so hasn't been touched in  
> years.  What is the most practical way to deal with this situation?
>
> Thanks,
>     Sean
> ---
> Sean Hogan
> CLIXtec IT Consulting Inc.
> m: (709) 687-2275
> e: [hidden email]
> e: [hidden email]
> w: http://clixtec.ca
>
> --
> You receive this message as a subscriber of the [hidden email]  
> mailing list.
> To unsubscribe: mailto:[hidden email]
> For general help: mailto:[hidden email]?subject=help
> OW2 mailing lists service home page: http://www.ow2.org/wws
--
Orbeon Forms - Web Forms for the Enterprise Done the Right Way
http://www.orbeon.com/



--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws
Reply | Threaded
Open this post in threaded view
|

Re: Re: unclean HTML?

Sean Hogan (CLIXtec)
"Good idea"?  I don't think I made any constructive suggestions - though I
like yours. :-)  Unfortunately I don't have the Orbeon Forms background
(yet) to make that sort of change.  Since this is only a proof of concept
for the project on which I'm consulting, I may need to back out and take a
different approach.

Regards,
    Sean

--------------------------------------------------
From: "Erik Bruchez" <[hidden email]>
Sent: Monday, April 20, 2009 7:16 PM
To: <[hidden email]>
Subject: [ops-users] Re: unclean HTML?

> Good idea.
>
> How things work now:
>
> * Filter stores JSP output as String into request
> * xforms-renderer.xpl runs
> * oxf:scope-generator reads String and tries to parse it as XML.
>
> One idea would be to allow oxf:scope-generator to support "tag soup"
> input and to clean the HTML with a recent version of TagSoup (JAR is
> actually already checked into our CVS), which I think is able to keep
> foreign elements (and maybe attributes).
>
> It would be great if you could give this a try.
>
> -Erik
>
> On Apr 18, 2009, at 4:59 PM, Sean Hogan (CLIXtec) wrote:
>
>> How are people dealing with using Orbeon Forms from JSP that
>> contains plain old messy, real-world HTML?
>>
>> I'm trying to work XForms into an existing application that uses !
>> DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
>> "http://www.w3.org/TR/html4/loose.dtd
>> ".  Orbeon (or more precisely whatever it uses to parse XML)
>> strongly dislikes the content of loose.dtd, since it barfs
>> processing it.  It dislikes the strict DTD as well.  If I remove the
>> DTD references entirely, then I get farther but even relatively
>> innocuous things like ampersand-nbsp-semicolon trip up the processing.
>>
>> I understand why Orbeon wants to see proper XHTML, but the reality
>> is that there's a lot of old junk out there.  In my case there are
>> hundreds of JSPs including each other all over the place, with lots
>> of cruft that just happens to work and so hasn't been touched in
>> years.  What is the most practical way to deal with this situation?
>>
>> Thanks,
>>     Sean
>> ---
>> Sean Hogan
>> CLIXtec IT Consulting Inc.
>> m: (709) 687-2275
>> e: [hidden email]
>> e: [hidden email]
>> w: http://clixtec.ca
>>
>> --
>> You receive this message as a subscriber of the [hidden email]
>> mailing list.
>> To unsubscribe: mailto:[hidden email]
>> For general help: mailto:[hidden email]?subject=help
>> OW2 mailing lists service home page: http://www.ow2.org/wws
>
> --
> Orbeon Forms - Web Forms for the Enterprise Done the Right Way
> http://www.orbeon.com/
>
>


>
> --
> You receive this message as a subscriber of the [hidden email] mailing
> list.
> To unsubscribe: mailto:[hidden email]
> For general help: mailto:[hidden email]?subject=help
> OW2 mailing lists service home page: http://www.ow2.org/wws
>


--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws
Reply | Threaded
Open this post in threaded view
|

Re: Re: Re: unclean HTML?

Erik Bruchez
Administrator
Well I took it as a suggestion ;)

Note you may not have to modify Orbeon Forms for a first shot. Just  
write a filter to clean your JSP output by calling TagSoup:

   http://home.ccil.org/~cowan/XML/tagsoup/

That may still be too much work for the time you have, but I thought I  
would mention it.

-Erik

On Apr 20, 2009, at 7:04 PM, Sean Hogan (CLIXtec) wrote:

> "Good idea"?  I don't think I made any constructive suggestions -  
> though I like yours. :-)  Unfortunately I don't have the Orbeon  
> Forms background (yet) to make that sort of change.  Since this is  
> only a proof of concept for the project on which I'm consulting, I  
> may need to back out and take a different approach.
>
> Regards,
>   Sean
>
> --------------------------------------------------
> From: "Erik Bruchez" <[hidden email]>
> Sent: Monday, April 20, 2009 7:16 PM
> To: <[hidden email]>
> Subject: [ops-users] Re: unclean HTML?
>
>> Good idea.
>>
>> How things work now:
>>
>> * Filter stores JSP output as String into request
>> * xforms-renderer.xpl runs
>> * oxf:scope-generator reads String and tries to parse it as XML.
>>
>> One idea would be to allow oxf:scope-generator to support "tag soup"
>> input and to clean the HTML with a recent version of TagSoup (JAR is
>> actually already checked into our CVS), which I think is able to keep
>> foreign elements (and maybe attributes).
>>
>> It would be great if you could give this a try.
>>
>> -Erik
>>
>> On Apr 18, 2009, at 4:59 PM, Sean Hogan (CLIXtec) wrote:
>>
>>> How are people dealing with using Orbeon Forms from JSP that
>>> contains plain old messy, real-world HTML?
>>>
>>> I'm trying to work XForms into an existing application that uses !
>>> DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd
>>> ".  Orbeon (or more precisely whatever it uses to parse XML)
>>> strongly dislikes the content of loose.dtd, since it barfs
>>> processing it.  It dislikes the strict DTD as well.  If I remove the
>>> DTD references entirely, then I get farther but even relatively
>>> innocuous things like ampersand-nbsp-semicolon trip up the  
>>> processing.
>>>
>>> I understand why Orbeon wants to see proper XHTML, but the reality
>>> is that there's a lot of old junk out there.  In my case there are
>>> hundreds of JSPs including each other all over the place, with lots
>>> of cruft that just happens to work and so hasn't been touched in
>>> years.  What is the most practical way to deal with this situation?
>>>
>>> Thanks,
>>>    Sean
>>> ---
>>> Sean Hogan
>>> CLIXtec IT Consulting Inc.
>>> m: (709) 687-2275
>>> e: [hidden email]
>>> e: [hidden email]
>>> w: http://clixtec.ca
>>>
>>> --
>>> You receive this message as a subscriber of the [hidden email]
>>> mailing list.
>>> To unsubscribe: mailto:[hidden email]
>>> For general help: mailto:[hidden email]?subject=help
>>> OW2 mailing lists service home page: http://www.ow2.org/wws
>>
>> --
>> Orbeon Forms - Web Forms for the Enterprise Done the Right Way
>> http://www.orbeon.com/
>>
>>
>
>
>
>>
>> --
>> You receive this message as a subscriber of the [hidden email]  
>> mailing list.
>> To unsubscribe: mailto:[hidden email]
>> For general help: mailto:[hidden email]?subject=help
>> OW2 mailing lists service home page: http://www.ow2.org/wws
>
> --
> You receive this message as a subscriber of the [hidden email]  
> mailing list.
> To unsubscribe: mailto:[hidden email]
> For general help: mailto:[hidden email]?subject=help
> OW2 mailing lists service home page: http://www.ow2.org/wws
--
Orbeon Forms - Web Forms for the Enterprise Done the Right Way
http://www.orbeon.com/



--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws
Reply | Threaded
Open this post in threaded view
|

Release date for 3.7

Trond Hjelmaas
Hi Erik/Orbeon,
 
Do you have any indication of the 3.7 release date?
 
Cheers,
Trond
 


--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws

winmail.dat (3K) Download Attachment
Reply | Threaded
Open this post in threaded view
|

Re: Release date for 3.7

Erik Bruchez
Administrator
Trond,

Every week it's one week later :-(

Current plan is:

* Switch to an even/odd release numbering system:
** 3.7.x, 3.9.x etc. are "dev" builds
** 3.8.x, 4.0.x etc. are "stable" builds
* Release ASAP a version called 3.7.1:
** it's going to be a release, but a "dev" release still because after  
all it has lots of bleeding-edge features
** many users have kind of been assuming they were using 3.7 all  
along ;)

After that the idea is to do dev releases more often, and when when we  
are happy with a particular dev release, to branch it and make it a  
stable release.

-Erik

On Apr 20, 2009, at 10:19 PM, Trond Hjelmaas wrote:

> Hi Erik/Orbeon,
>
> Do you have any indication of the 3.7 release date?
>
> Cheers,
> Trond
>
> <winmail.dat>
> --
> You receive this message as a subscriber of the [hidden email]  
> mailing list.
> To unsubscribe: mailto:[hidden email]
> For general help: mailto:[hidden email]?subject=help
> OW2 mailing lists service home page: http://www.ow2.org/wws
--
Orbeon Forms - Web Forms for the Enterprise Done the Right Way
http://www.orbeon.com/



--
You receive this message as a subscriber of the [hidden email] mailing list.
To unsubscribe: mailto:[hidden email]
For general help: mailto:[hidden email]?subject=help
OW2 mailing lists service home page: http://www.ow2.org/wws
Reply | Threaded
Open this post in threaded view
|

Re: Release date for 3.7

Ronthe
This post was updated on .
Erik,

Sorry for *that difficult question* again, but do you an update on the release date of a production ready version with Firefox3 support? We're really looking forward to this :)

Rgds,
Ronny


ps: sorry if this is a resend, I wasn't subscribed to the mailing list before sending
Reply | Threaded
Open this post in threaded view
|

Re: Release date for 3.7

Alessandro Vernet
Administrator
Ronny,

Ronthe wrote
Sorry for *that difficult question* again, but do you an update on the release date of a production ready version with Firefox3 support? We're really looking forward to this :)
As mentioned by Erik earlier, the next release will be a "dev" version, numbered 3.7.x. The next "production" version will be a 3.8.x. We are now really focusing the next "dev" release which we hope to have out very soon. We'll then see when we can release the next "production" version. Is that enough for an evasive answer? ;)

Alex