This character encoding is all becoming very frustrating…..

I think we need to do some basic testing ourselves, because what Roman is saying contradicts what he said to me before. And when I did some rudimentary testing re TI-Nspire I didn’t notice any of these issues.
From what I understand or have observed:
- When creating either a legacy DAT or an XML DAT (will refer to these as ‘DAT’ and ‘XMLDAT’ from here for clarity), CMP does not automatically declare the XMLDAT as UTF-8 (“Unicode”). This is to be expected. XML doesn’t automatically mean UTF-8, and you can ‘declare’ an XML document to be encoded as pretty much anything. This is what I originally mentioned here:
http://www.tosecdev.org/index.php/forum/index.php?topic=191.msg2283#msg2283. This flexibility is kind of the point, and this encoding declaration is something you will see in XML based documents (like on the web, docx, ini’s etc). I do agree that having a simple toggle option in CMP DAT2DIR module would be useful however, but hey-ho….
- When creating an XMLDAT thru CMP, the “name”/”rom” with High-ACSII characters are created correct (from when I have tried, and Diaboł has tried - subject to the XMLDAT having the correct header).
- Diaboł is saying that the header is not created correct (with High-ASCII authors)? I don’t know about this since I didn’t realise anyone even used this part of the DAT2DIR. I always leave blank then open the new DAT in a text editor (Notepad++) and copy the existing header into the new, simply increasing the date counter.
When I get a chance I’ll create a test XMLDAT and test group of Low/High-ASCII files I think. Would be very useful if everyone tests it and gives feedback. Since pretty much everyone here (the ‘regulars’ I mean) are from different parts of the globe, this should be a very effect/broad test!
I know this all may be a bit painful/learning experience in the short term, but long term I really think it will benefit us.
This is all far from impossible of course…. MAME as been using XML for years and the latest MESS Software XMLDATs even include roms with Japansese Kanji!
We can get this licked too (for our uses)…..