Designing a value based niche search engine using evolutionary strategies

Sourav Sengupta, Bernard J. Jansen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

Original languageEnglish
Title of host publicationProceedings ITCC 2005 - International Conference on Information Technology
Subtitle of host publicationCoding and Computing
EditorsH. Selvaraj, P.K. Srimani
Pages800-805
Number of pages6
Publication statusPublished - 21 Sep 2005
EventITCC 2005 - International Conference on Information Technology: Coding and Computing - Las Vegas, NV, United States
Duration: 4 Apr 20056 Apr 2005

Publication series

NameInternational Conference on Information Technology: Coding and Computing, ITCC
Volume1

Conference

ConferenceITCC 2005 - International Conference on Information Technology: Coding and Computing
CountryUnited States
CityLas Vegas, NV
Period4/4/056/4/05

    Fingerprint

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Sengupta, S., & Jansen, B. J. (2005). Designing a value based niche search engine using evolutionary strategies. In H. Selvaraj, & P. K. Srimani (Eds.), Proceedings ITCC 2005 - International Conference on Information Technology: Coding and Computing (pp. 800-805). (International Conference on Information Technology: Coding and Computing, ITCC; Vol. 1).