Thursday, 13 November 2008

Text Data Mining vs. Information Retrieval / information, Access

Although only one of many factors, a driving force behind the growth of text mining has been the Web (Hastie, Tibshirani & Friedman, 2001). The growth of Internet commerce has created large repositories of documents, customer information, records and other information. The increasing need to record business and personal transactions and the growing volumes of logging within systems has provided an increasing amount of unstructured content. With the millions of new documents being produced each every year by the average company, knowledge discovery for forensic analysis is increasingly becoming reliant on text mining operations.

1. Hastie T., Tibshirani R. & Friedman J. (2001) “The Elements of Statistical Learning: Data Mining, Inference and Prediction”, Springer-Verlag

1 comment:

Manya Mayes said...

I consider criminal intelligence/forensic science to be extremely interesting and relevant text mining applications. Text mining and information retrieval techniques are also complementary and will be more closely aligned in the future. I agree with your comments. At SAS, we have seen a big rise in federal and state customers using text mining for criminal intelligence.

Manya Mayes