Ticket #161 (WorksForMe)Mon Jan 14 20:48:53 UTC 2008
Phoenix - misreads hard space
Reported by: | Bryn Evans (177) | Severity: | Minor |
Part: | RISC OS: Application | Release: | 2nd public site release |
Milestone: | Status | WorksForMe |
Details by Bryn Evans (177):
When Phoenix [ version 2.13 ] encounters a Hard Space, character 160, a capital letter A circumflex,
character 194, is displayed.
I seem to remember that this happened with !Browse, prior to version 2.xx
Changelog:
Modified by Andrew Hodgkinson (6) Thu, February 28 2008 - 13:57:21 GMT
Sorry for the delay following this up. A few questions:
- What version of RISC OS are you using?
- Did you build the Phoenix version yourself, or did it come from ROOL CD 1?
- What does the output of “*Alphabet” say on your machine?
Thanks!
Modified by Trevor Johnson (329) Mon, January 24 2011 - 13:51:46 GMT
How many people are going to use Phoenix?
I suggest changing status to ‘WontFix’.
Modified by Sprow (202) Sun, January 12 2020 - 21:54:32 GMT
- Status changed from Open to WorksForMe
Prompted by unexpected micro update Browse 2.17, took a look at this. It turns out to not specifically be hard spaces being “misread”, any HTML entity is affected, for example © comes out as A-circumflex-copyright.
Rummaging in HTMLLib I see it has a lookup table of entities to character codes, then (depending on the state of UNIFONT) looks it up either via UCS4_to_UTF8() or __html_iso10646_to_latin1(). And that’s the key. My assertion is Byrn tried Unicode on Latin1 so saw lots of A-circumflexes which are the UTF8 escape character.
So:
- If you’re using a Unicode build of Browse, you need the system to use UTF8 (ie. have done *Alphabet UTF8)
- For non Unicode builds, you can use the default alphabet (most likely, *Alphabet Latin1)