Author Topic: Space character at end & non-english characters  (Read 3835 times)

Offline nomisholman

  • Newbie
  • *
  • Posts: 1
Space character at end & non-english characters
« on: April 21, 2011, 11:03:15 PM »
Hello I've been going through the latest dat files (2011-04-07) and have found a few corrections.
I search the forums but it seems like these aren't listed there.


These files have a space character at the end of the rom name.

Atari 2600 & VCS - Games (TOSEC-v2011-02-22_CM)
   name "ZZZ-UNK-Dark Mage Mod (19xx)(-)(PD)[ b][aka DM-FOOT] "
   name "ZZZ-UNK-Moon Patrol (1982) (Shock Vision-Brazil) "
   name "ZZZ-UNK-Space Shuttle - Journey Into Space (1983) (Activision) (PAL) "

Commodore C64 - Diskmags - [D64] (TOSEC-v2011-03-19_CM)
   name "Bloody Scene, The #1 (19xx)(-) "

IBM PC Compatibles - Applications - [IMA] (TOSEC-v2011-01-06_CM)
   name "Disklock v1.02 (1991)(Fifth Generation Systems)[sn 653-0006084] "


For the next set, the sub folders inside this set file have a "." character at the end. eg "ADVERTS2."

Atari ST - Online Magazines (TOSEC-v2009-09-01_CM)
   name "Atari Times Issue 11 (1997-09)(Croft Soft)"
   description "Atari Times Issue 11 (1997-09)(Croft Soft)"
   rom ( name "ATIMES.GIF" size 4042 crc 244c4b67 md5 f6149e24e838d6d169a0feb80beb3d79 )
   rom ( name "AT_11/ADVERTS/ADVERTS2./42BBS.GIF" size 3143 crc c68e9fe5 md5 bed0e271014501e6110f095a64810d9c )
   rom ( name "AT_11/ADVERTS/ADVERTS2./42BBS.HTM" size 1320 crc f7daf19e md5 97eb438e24392db0a01a27d3098f858d )
      ...... etc

      
I'm not sure if these are really problems but I understand the nameing convention is not to use non-english characters.

Texas Instruments TI-83 - Applications (TOSEC-v2011-03-24_CM)
   name "MOD (2003)(Martins, Ilidio)"
   rom ( name "mod\Links\Ilídio Home Page.url" size 122 crc 381b0c32 md5 80c8ec1318392e2663d9312b7ef87f00 sha1 1b762c0c27b2004d5c3fe722faa86370d32f2ae5 )

   name "NotePad v0.4 (2002)(Bergstrom, Erik)(beta)[req Ion]"
   rom ( name läsmig.txt size 697 crc 9308d6e7 md5 fd2e2a5c309a7e16f7e52907f725f5f8 sha1 44032fff58d7b6b66b9e887093b5c05a6351c768 )
   
   name "PicAsm v1.4 (2002)(Bergstrom, Erik)"
   rom ( name läsmig.txt size 599 crc 804e4d1d md5 5d53af98b099205fb59206227c5b5b21 sha1 a47342063804cfbaa1541851d15141e27c279e83 )

   name "ZText v1.0 (2002)(Bergstrom, Erik)"
   rom ( name Läsmig.txt size 1219 crc 89230ddf md5 7389d97371235b66443fe79472a4df82 sha1 b15ab1bd053293732751ecff79381577609026ec )

Texas Instruments TI-83 - Games (TOSEC-v2011-02-25_CM)
   name "Asteroid v2.2 (2004)(Bonomi, Hadrien)(TI-83-TI-83+)(fr)"
   rom ( name "Readme ù lire.txt" size 2290 crc afb32d06 md5 64ef9640c5afc1d20e47ad11784afe35 )

   name "Pokemon v1.6 (2000)(Peterson, Wade)"
   rom ( name "Pokémon v1.6.txt" size 4524 crc 058f7e6e md5 26f90e90c5915cdfc9e30389d9b2ea98 )


Note: läsmig is Swedish for readme!

I hope this helps.
Keep up the fantastic work.
nomisholman
« Last Edit: April 22, 2011, 01:36:32 AM by Cassiel »



Offline Cassiel

  • Administrator
  • Hero Member
  • *****
  • Posts: 1450
    • Email
Re: Space character at end & non-english characters
« Reply #1 on: April 22, 2011, 01:42:29 AM »
These are some good (and obscure, I like obscure!) errors you've found.

Part 1 (spaces): good spot, we'll get these corrected.
Part 2 (periods): more complicated (since these are unusual so called 'multi-part' sets) although I suspect these are correct
Part 3 (high ASCII): technically you are correct, these should only be low ASCII. This is a bit of a hot current discussion atm and hopefully we will have a more modern solution for these DATs including accepted characters soon. Watch this space!  :)

Offline Aral

  • TOSEC Member
  • Sr. Member
  • ****
  • Posts: 409
Re: Space character at end & non-english characters
« Reply #2 on: July 10, 2011, 02:38:49 PM »

Ok next, The "Texas TI series" T80 T81 T82 T83 has 67 pairs of zips, which externally, are byte for byte identical, only the zip name varies..

Let's not talk about the internal dupes ... this series is messy,
really needs a fan to go through and spring clean..

LOL

Offline PandMonium

  • Administrator
  • Hero Member
  • *****
  • Posts: 1315
Re: Space character at end & non-english characters
« Reply #3 on: July 11, 2011, 04:10:09 PM »
Hey oxyandy,

All help is welcome so feel free to post / attach list of dupes / triples and any other suggestion you might have :)

I've also just read the news about CMP + utf8 wip, if anyone is interested check it here.

Offline Cassiel

  • Administrator
  • Hero Member
  • *****
  • Posts: 1450
    • Email
Re: Space character at end & non-english characters
« Reply #4 on: July 14, 2011, 10:26:42 AM »
I've also just read the news about CMP + utf8 wip, if anyone is interested check it here.

Yup..... sometimes bribery/sexual favors/plain old bugging continuously does actually work....  :)

Offline TKaos

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 533
Re: Space character at end & non-english characters
« Reply #5 on: July 14, 2011, 02:54:03 PM »
Hello there,

please keep in mind that those DATs were made this year by Diabol, he had good reasons to keep all the folders and zips like they were.

So don't be surprised if he'll be a bit upset cause I'd be kinda pissed if someone else would touch the DATs that I'm currently working on, don't know how it is with Diabol though.

Of course I appreciate your work and interest in those DATs, it's just a quick note from me here that it's better if the maintainer of the DATs gets a message either here in a thread or a pm so it can be discussed before the DATs get changed.

As for myself, I still think that Diabol should write here and give us an info about if those changes are correct.

Offline Cassiel

  • Administrator
  • Hero Member
  • *****
  • Posts: 1450
    • Email
Re: Space character at end & non-english characters
« Reply #6 on: August 11, 2011, 03:10:39 PM »
Regarding ClrMamePro development, Diaboł PM'ed me regarding this and it reminded me I hadn't posted anything publicly.

Roman has made great strides in modernising the way CMP treats characters and archives, and now has a near fully functioning Unicode/UTF-8 compliant app. He has also updated the internal zip lib and removed a number of obsolete options (like the ancient oem-ansi conversion option).

Moving forward, everything in CMP will be Unicode by default.

Roman has been sharing with me WIP beta binaries to test, though sadly I've been too up to my eyes in work to assist much in testing (though other betas testers have been). Hopefully this will change in the next few days as I find some time to dedicate to this, and checking WIP DATs!

Full details here: http://www.emulab.it/forum/index.php?topic=514.0