.:∴ Archives ∴:.
Tierra Verde

Tuesday, July 27

Microsoft makes unicode entities SO fun. NOT!  


I just added every character from 128 to 255 to my url2mail.c program to encode. I thought I was going to have to, but it became obvious once I looked at the Unicode font set for that space. Microsoft uses a bunch of it for their own purposes (Euro, and other things that normally go elsewhere in the code set are there. So I just encoded them to their actual names, which took care of everything but two, Z caron and zcaron. Ž and ž didn't work for them, although Š works for S caron. Go figure. Who knows what happens if I were to try to use this program under Linux or Mac OS.

Oh, and 5 characters come out as undisplayable (question marks under Mozilla, empty boxes under IE), so I just replace them with question marks.

Update: fixed my spelling mistake in the title. Sigh. :(

Permanent link posted by bytehead @ 7/27/2004 11:46:00 AM   Edit this entry 0 comments Links to this post

0 Comments:

Post a Comment

Links to this post:

Create a Link

 

Article Index

Microsoft makes unicode entities SO fun. NOT!
Copyright © 2003-2008, Bryan Price. Licensed by Creative Commons License: Creative Commons License
Home| This page| Colophon| Disclaimer| Privacy Statement