Welcome to Emulationworld

Forum Index | FAQ | New User | Login | Search

Make a New PostPrevious ThreadView All ThreadsNext ThreadShow in Flat Mode*

Subjecttext pattern recognition Reply to this message
Posted bynewsdee
Posted on10/06/04 06:37 PM

Some of you may know this... I'd like to write a program that will match up a description with a specific code. I'm trying to find out the kind of algorithms and techniques that I should research.

For example: I have a field with a description in uppercase, let's say "100 PAGE BOOK WITH RED COVER" and "3 1/2 INCH HARD DISK DRIVE UNIT". Now let's imagine that the first would match a category with code 12345 and the second a category with code 98765.

This is very simple in theory (switch/case and a bit of fuzzy logic to handle misspellings), but in practice I have a lot of codes to map to and I could have any kind of description (in a fixed-width field though). Now of course I don't expect there is any way to yield good results at every time, but I'm trying to figure out where to spend research time on (what area).

The goal of this is to parse information on a huge DB to a simplified one, so a search frontend only has to display categories. This could be done by hand but the number of entries make it extremely long, and I expect this would make the processing of this information much much faster (i.e. days => minutes).

Any ideas?

[download a life]

Entire Thread
Subject  Posted byPosted On
.text pattern recognition  newsdee10/06/04 06:37 PM
.*Re: text pattern recognition  Terry Bogard10/07/04 05:49 AM
..*Re: text pattern recognition  smf10/07/04 03:11 PM
...*thanks both!  newsdee10/07/04 04:07 PM
....*Re: thanks both!  smf10/08/04 03:13 AM
.....*Re: mapping codes  newsdee10/09/04 11:36 AM
......*Re: mapping codes  Videoman10/16/04 03:54 PM