Kees Cook wrote:
On Mon, Oct 25, 2004 at 11:49:14PM -0400, jon@...18... wrote:
- try to reproduce the bug
Can anyone give me a crash-course (har har) in setting up utf-8 character paths? I found the localization options for changing languages in windows, but I'm not sure how to generate non-ascii characters.
I'd be happy to do testing under w2k if I knew how to create the test case. ;)
In general... use the character map accessory. It's under Start|Programs|Accessories|System tools or some such. Remember to switch to advanced view.
Oh, here's the first Google hit on it: http://www.dewassoc.com/support/win2000/windows_2000_character_map.htm
Windows CodePage-1252 is "Windows Western".
ASCII has values from 0 through 127. Latin-1 has values form 0 through 255
The first 128 Latin-1 values are the same as ASCII. The first 256 Unicode values are the same as Latin-1
Windows 1252 is the same as Latin-1 *except* for the 32 characters from $80 through $9F. Here's a chart of 1252 http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT
So the test cases should go
ASCII (0-127) Latin-1 ($9F-$FF) Windows Western ($80-$9F) Unicode ($100 and up, except for those in Windows Western)
Cyrillic might be good for Unicode. Japanese is also, if your fonts show it.