Reading WWN 284 I noticed the following line:
stabilizing for release &.
And sure enough the source XML is buggy:
stabilizing for release &<g&>.
I believe the intent was the following to deal with the double parsing:
stabilizing for release &lt;g&gt;.
But really this double parsing is a major pain. Can't we get rid of it? Because it's causing a lot of bugs:
$ egrep "<.*>" wwn/*.sgml | grep -v "who=" | wc -l 190
And it's not just missing grins either. For instance in WWN 125:
gdb <gdb remote protocol> winedbg <Win32 debug API> wineserver
and all the reader sees is:
gdb winedbg wineserver
which is totally wrong of course.
And in WWN 149:
#include <comcat.h> <br /> #include <oaidl.h><br /> #include <objbase.h><br /> #include <objidl.h><br />
...
all the reader sees is:
#include #include #include #include
...
Not very informative!
Yeah, this is a long standing issue of using HTML output in an XML document. In reality, you should not embed HTML in XML. I've done some tricks in our parser to get it to work, but it still fails in certain areas. The XML parser will convert any < to < and that is returned to the browser. Any text in between < and > is ignored by your browser. It assumes it is an HTML tag. What I do is convert < to &lt;. This then allows the parser to return < Messy, very messy.
I used to have this in the parser, then I removed it thinking I didn't need it anymore. Wrong, this is still needed so I re-added back to our code.
IIRC, this trick still causes errors with other things needing &'s. I checked and at least it does not break any URLs with & as delimiters.
If there is a better way to do this, I'd love to know how.
On Thu, 2005-07-28 at 13:06 +0200, Francois Gouget wrote:
Reading WWN 284 I noticed the following line:
stabilizing for release &.
And sure enough the source XML is buggy:
stabilizing for release &<g&>.
I believe the intent was the following to deal with the double parsing:
stabilizing for release &lt;g&gt;.
But really this double parsing is a major pain. Can't we get rid of it? Because it's causing a lot of bugs:
$ egrep "<.*>" wwn/*.sgml | grep -v "who=" | wc -l 190
And it's not just missing grins either. For instance in WWN 125:
gdb <gdb remote protocol> winedbg <Win32 debug API> wineserver
and all the reader sees is:
gdb winedbg wineserver
which is totally wrong of course.
And in WWN 149:
#include <comcat.h> <br /> #include <oaidl.h><br /> #include <objbase.h><br /> #include <objidl.h><br />
...
all the reader sees is:
#include #include #include #include
...
Not very informative!