Encodings
Jeffrey Lee (213) 6048 posts |
Also, well-formed Unicode text files should start with a Unicode BOM. |
Matthew Phillips (473) 721 posts |
Well, I’d agree if you said it the other way round: a file starting with a Unicode BOM is much more likely to be a Unicode file, but for UTF-8 a BOM is not required. The Wikipedia article gives various reasons in favour or against use of a BOM with UTF-8. |
nemo (145) 2552 posts |
Clive wrote
Yes, it’ll always be ‘hungarumlaut’ in my head because that’s its Postscript name. ;-)
Oh very nice. |
nemo (145) 2552 posts |
Indeed. BOMs are a screaming hint to ancient systems that can’t be bothered to recognise what they’ve got. They are much more useful for UTF-16 than UTF-8. |
Clive Semmens (2335) 3276 posts |
Oh, I know that. I wasn’t blaming you!! |
Glen Walker (2585) 469 posts |
Sorry for resurrecting an old thread…but I’ve been out of the loop somewhat. I did start work on an editor that could do as you ask (although I was only going to have it save/load between UTF-8 and the Latin1 in RISC OS). Haven’t got very far at all with it but its on a list of things I would like to do with RISC OS at some point! If anyone is interested progress will likely be on here when progress happens: |