Computer Science Department Faculty Publication Series

Evaluating the Performance of Distributed Architectures for Information Retrieval using a Variety of Workloads

Brendon Cahoon, University of Massachusetts - Amherst
Kathryn S. McKinley, University of Massachusetts - Amherst

Publication Date

1997

Abstract

Information explosion across the Internet and elsewhere offers access to an increasing number of document collections. In order for users to effectively access these collections, information retrieval (IR) systems must provide coordinated, concurrent, and distributed access. In this paper, we describe a fully functional distributed IR system based on the Inquery unified IR system. To refine this prototype, we implement a flexible simulation model which we use to present a series of experiments using a variety of workloads that measure system performance. We vary numerous system parameters such as the number of users, document collections, terms per query, query term frequency, think time, answers returned, and workload. Based on our initial results, we recommend simple changes to the prototype and evaluate the changes using the simulator. Because of the significant resource demands of information retrieval, it is not difficult to generate workloads that overwhelm system resources regardless of the architecture. However under some realistic workloads, we demonstrate system organizations for which response time gracefully degrades as the workload increases and performance scales with the number of processors. This scalable architecture includes a surprisingly small number of brokers through which a large number of clients and servers communicate.

Comments

This paper was harvested from CiteSeer

Recommended Citation

Cahoon, Brendon and McKinley, Kathryn S., "Evaluating the Performance of Distributed Architectures for Information Retrieval using a Variety of Workloads" (1997). Computer Science Department Faculty Publication Series. 50.
Retrieved from https://scholarworks.umass.edu/cs_faculty_pubs/50

Download

Included in

Computer Sciences Commons

COinS

ScholarWorks@UMass Amherst

Computer Science Department Faculty Publication Series

Evaluating the Performance of Distributed Architectures for Information Retrieval using a Variety of Workloads

Publication Date

Abstract

Comments

Recommended Citation

Included in

Browse

Author Corner

Links

ScholarWorks@UMass Amherst

Computer Science Department Faculty Publication Series

Evaluating the Performance of Distributed Architectures for Information Retrieval using a Variety of Workloads

Authors

Publication Date

Abstract

Comments

Recommended Citation

Included in

Share

Browse

Author Corner

Links