Re: [PATCH 1/5] hhctrl.ocx: Add HTML to Unicode parsing capability.

List overview All Threads
Download

newer

older

Re: [PATCH 3/4] kernel32/tests:...

Re: Patch 1/2 - [scrrun] - Add...

Jacek Caban

8 Jun 2012 8 Jun '12

8:58 a.m.

Hi Erich,

On 06/07/12 23:09, Erich E. Hoover wrote:

...

/* Post the HTML text to the document */

array = SafeArrayCreateVector(VT_VARIANT, 0, 1);

if(!array)
   goto cleanup;
hr = SafeArrayAccessData(array, (LPVOID*)&array_param);

if (FAILED(hr))
   goto cleanup;
V_VT(array_param) = VT_BSTR;

V_BSTR(array_param) = SysAllocString(html_fragment);

hr = SafeArrayUnaccessData(array);

if (FAILED(hr))
   goto cleanup;
hr = IHTMLDocument2_write(html_parsing_doc, array);

Did you test indexes like '<script>alert("really!?")</script>' ? :) Seriously, HTMLDocument is not the right tool for the job.

Cheers, Jacek

Show replies by date

Erich E. Hoover

8 Jun 8 Jun

2:10 p.m.

New subject: [PATCH 1/5] hhctrl.ocx: Add HTML to Unicode parsing capability.

On Fri, Jun 8, 2012 at 2:58 AM, Jacek Caban jacek@codeweavers.com wrote:

...

... Did you test indexes like '<script>alert("really!?")</script>' ? :) Seriously, HTMLDocument is not the right tool for the job.

I can. Do you have a suggestion for an alternative?

Erich

Jacek Caban

2:17 p.m.

New subject: [PATCH 1/5] hhctrl.ocx: Add HTML to Unicode parsing capability.

On 06/08/12 16:10, Erich E. Hoover wrote:

...

On Fri, Jun 8, 2012 at 2:58 AM, Jacek Caban jacek@codeweavers.com wrote:

...
... Did you test indexes like '<script>alert("really!?")</script>' ? :) Seriously, HTMLDocument is not the right tool for the job.

I can. Do you have a suggestion for an alternative?

I don't know any helper API for that. Writing decoder for HTML-encoded characters sounds like a good solution.

Cheers, Jacek

Erich E. Hoover

9:18 p.m.

New subject: [PATCH 1/5] hhctrl.ocx: Add HTML to Unicode parsing capability.

On Fri, Jun 8, 2012 at 8:17 AM, Jacek Caban jacek@codeweavers.com wrote:

...

... I don't know any helper API for that. Writing decoder for HTML-encoded characters sounds like a good solution.

How does something like the attached sound?

Erich

Jacek Caban

10 Jun 10 Jun

5:19 p.m.

New subject: [PATCH 1/5] hhctrl.ocx: Add HTML to Unicode parsing capability.

On 6/8/12 11:18 PM, Erich E. Hoover wrote:

...

On Fri, Jun 8, 2012 at 8:17 AM, Jacek Cabanjacek@codeweavers.com wrote:

...
... I don't know any helper API for that. Writing decoder for HTML-encoded characters sounds like a good solution.

How does something like the attached sound?

A few comments:

You definitely don't need a new header file for just one funcition declaration. Even the implementation probably doesn't need a separated file (it's <200 lines of code that is unlikely to grow).

+#include "hhctrl.h" +#include <mshtml.h>

Probably left from the previous patch?

+ spc = strchr(amp, ' '); + if(spc && spc < sem) + break; /* cannot have a space between the ampersand and the semicolon */

This should not be needed (see above).

+ /* Convert the characters prior to the HTML encoded character */ + wlen = MultiByteToWideChar(CP_ACP, 0, h, len, NULL, 0); + MultiByteToWideChar(CP_ACP, 0, h, len, w, wlen);

One call should be enough. You may just pass remaining space in the output buffer as its length.

+ if(amp[0] != '#') + { + for(i=0;i<sizeof(html_encoded_symbols)/sizeof(html_encoded_symbols[0]);i++) + { + const char *encoded_symbol = html_encoded_symbols[i].html_code; + + if(strncmp(encoded_symbol, amp, len) == 0) + { + symbol = html_encoded_symbols[i].ascii_symbol; + break; + } + } + }

Binary search sounds like a good choice here (although just FIXME comment would be fine for the patch).

+ { + int tmp; + + sscanf(amp, "%d", &tmp); + symbol = tmp; + }

This will decode "&#123xxx;" as 123 instead of an invalid char. If you get it right, the earlier check for space won't be needed. strtol is probably better tool for this.

+ wlen = MultiByteToWideChar(CP_ACP, 0, &symbol, 1, NULL, 0); + MultiByteToWideChar(CP_ACP, 0, &symbol, 1, w, wlen);

Same here, two calls are not needed.

Cheers, Jacek

4929

Age (days ago)

4931

Last active (days ago)

wine-devel@winehq.org

4 comments

2 participants

tags (0)

participants (2)

Erich E. Hoover
Jacek Caban