Designing a value based niche search engine using evolutionary strategies

Sourav Sengupta, Bernard Jansen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

Original languageEnglish
Title of host publicationInternational Conference on Information Technology: Coding and Computing, ITCC
EditorsH. Selvaraj, P.K. Srimani
Pages800-805
Number of pages6
Volume1
Publication statusPublished - 2005
Externally publishedYes
EventITCC 2005 - International Conference on Information Technology: Coding and Computing - Las Vegas, NV
Duration: 4 Apr 20056 Apr 2005

Other

OtherITCC 2005 - International Conference on Information Technology: Coding and Computing
CityLas Vegas, NV
Period4/4/056/4/05

Fingerprint

Search engines
Intranets
Evolutionary algorithms

ASJC Scopus subject areas

  • Engineering(all)

Cite this

Sengupta, S., & Jansen, B. (2005). Designing a value based niche search engine using evolutionary strategies. In H. Selvaraj, & P. K. Srimani (Eds.), International Conference on Information Technology: Coding and Computing, ITCC (Vol. 1, pp. 800-805)

Designing a value based niche search engine using evolutionary strategies. / Sengupta, Sourav; Jansen, Bernard.

International Conference on Information Technology: Coding and Computing, ITCC. ed. / H. Selvaraj; P.K. Srimani. Vol. 1 2005. p. 800-805.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Sengupta, S & Jansen, B 2005, Designing a value based niche search engine using evolutionary strategies. in H Selvaraj & PK Srimani (eds), International Conference on Information Technology: Coding and Computing, ITCC. vol. 1, pp. 800-805, ITCC 2005 - International Conference on Information Technology: Coding and Computing, Las Vegas, NV, 4/4/05.
Sengupta S, Jansen B. Designing a value based niche search engine using evolutionary strategies. In Selvaraj H, Srimani PK, editors, International Conference on Information Technology: Coding and Computing, ITCC. Vol. 1. 2005. p. 800-805
Sengupta, Sourav ; Jansen, Bernard. / Designing a value based niche search engine using evolutionary strategies. International Conference on Information Technology: Coding and Computing, ITCC. editor / H. Selvaraj ; P.K. Srimani. Vol. 1 2005. pp. 800-805
@inproceedings{27143c72aade4303a1a4f3d19d0970b1,
title = "Designing a value based niche search engine using evolutionary strategies",
abstract = "The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40{\%} in generation 1 of the algorithm to nearly 90{\%} in generation 10,000.",
author = "Sourav Sengupta and Bernard Jansen",
year = "2005",
language = "English",
isbn = "0769523153",
volume = "1",
pages = "800--805",
editor = "H. Selvaraj and P.K. Srimani",
booktitle = "International Conference on Information Technology: Coding and Computing, ITCC",

}

TY - GEN

T1 - Designing a value based niche search engine using evolutionary strategies

AU - Sengupta, Sourav

AU - Jansen, Bernard

PY - 2005

Y1 - 2005

N2 - The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

AB - The advent of e-commerce and corporate intranets has led to the growth of organizational repositories containing large, fragmented, and unstructured document collections. Though it is difficult to retrieve relevant documents from such collections, it is relatively less cumbersome to define categories broadly classifying the information contained in the collection. Such categories lend value to the information contained in the collection. This research addresses the issue of improving retrieval accuracy of search engines that retrieve documents from organizational repositories using a value based approach. We test an evolutionary algorithm approach on a document collection. The precision of the search algorithm improved from 40% in generation 1 of the algorithm to nearly 90% in generation 10,000.

UR - http://www.scopus.com/inward/record.url?scp=24744467371&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=24744467371&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:24744467371

SN - 0769523153

SN - 9780769523156

VL - 1

SP - 800

EP - 805

BT - International Conference on Information Technology: Coding and Computing, ITCC

A2 - Selvaraj, H.

A2 - Srimani, P.K.

ER -