I wrote a quick and dirty app that'll go through each XML grabbing the game name and category which I then output to a txt file
So now I have a txt file for every XML
Now is were the fun begins
I'm not really too sure what to do now... my initial thought was to merge all the txt files into 1 big list of games and categories, doing this would mean that even if a game wasn't covered in AllGames for 1 system it may been in their database for another system and if its the same game name it should be the same category, what do you think?
I could then write a tool similar to my FuzzyTextMatch to match up the names in the reference file with the names in the DB's
This seemed all well and good but the txt file, after removing duplicates, has 26000 entries
I'm now wondering if maybe I should write something to grab all the names from all the databases and run that through FuzzyTextMatch with the 1 big list of games and categories, I'm sure will take quite a long time but at least we wouldn't have to do it for every mdb.
I could divide up the results so that a few of us can check them and finally have another tool that will then take the confirmed results and make the changes to the MDBs for us?
How does this sound Nologic?
I haven't bothered uploaded the individual txt files as I don't see them being of any use to anyone so there's no point in wasting Tom's bandwidth but if you want them PM me and I'll send you them
Stu















