I've a lot of work to end before leaving for a week so my answer will still be brief.
What is TOSEC and how is it useful to other people?An initial idea of what TOSEC is can be obtained by reading wikipedia entry and the small description in the site (
http://www.tosecdev.org/index.php/the-project ).
Basically TOSEC began as a software cataloging initiative to identify each software image (rom files) that floats around. To this end, setfiles (usually this means the zip (container) that contains the rom (or roms) file are renamed according to our naming convention (
http://www.tosecdev.org/index.php/tosec-naming-convention ). This naming convention consists of a set of rules to organize information such as software title, date, publisher and many other flags.
In addition to this, datfiles (the .dat you see) are created to make it possible to identify the files. We have hundreds of datfiles which normally follow the organization: <company name> <system name> <categories>{1+}. An example of this is for instance the datfile: "Commodore Amiga - Applications - Public Domain - [ADF]", which contains information about public domain applications, in the ADF file format for commodore amiga.
The dats contain entries about the set/rom files (file name and size) together with hashes (a string/code) that are used to identify each file (they can be seen as an "unique" code (of 8, 32 or 40 chars depending on the hash) is calculated from the file content.
Summing this all up, these dats containing the information we gather can be used with specific tools (called rom managers - examples are clrmame pro, romvault and romulus) to automatically identify and rename files. Imagine you have a folder full of roms you don't know exactly what they are, use the tool and the datfiles and the files are organized and renamed properly. This helps people knowing exactly which software do they own, for instance if the rom they have is broken (was dumped badly) - this means we don't catalog only perfect software but hacks and bad dumps that are common so anyone can identify them too.
How can someone contribute?Most of our members' work is focused in identifying and renaming roms. Since TOSEC already catalogs lots of it (especially in the main branch), we have been trying to improve the quality of data by reorganizing the dats structure and properly rename the existing sets, since many miss a lot of information. In addition to this, new sets are also added to the existent dats and and also "new" (doesn't necessarily means more recent) systems and dats are created. We have a few more ideas and details but this is the main idea.
So, when members have time to contribute, they go through their dats (each member works on a specific group of dats or systems) and reinspects the existing sets for errors and missing information (most of the time by launching the sets in emulators). The same is done with new roms that exist but have not been added to TOSEC. After this, new dats are generated and released after a while.
People without the time or experience may still contribute in several forms - you may see some examples around the forum, not everyone wants to rename things or has the time to make it frequently.
Some possible contributions are:
- submitting new / interesting files that are yet to be identified
- pointing out some errors that they found out in some datfiles / rom names (an example of this is amiga contributions at EAB forum and Kodoichi around here).
- fixing some dats that are not currently maintained or contributing with small fixes to some sets.
Just a small note about the two other branches:
- TOSEC-ISO is a side project with the goal of preserving / dumping optical media discs (CDs, DVDs, and so on), which then are added to dats (e.g. dats for Dreamcast discs). One of the main differences in disc based dats is that they contain only original media dumps.
- TOSEC-PIX is a similar project to TOSEC but instead of software, the goal is catalog scans of documents (manuals, magazines and so on). Aral is the guy behind it.
That's it, hopefully this sounded clear enough. Feel free to ask more.