Off-campus UMass Amherst users: To download dissertations, please use the following link to log into our proxy server with your UMass Amherst user name and password.

Non-UMass Amherst users, please click the view more button below to purchase a copy of this dissertation from Proquest.

(Some titles may also be available free of charge in our Open Access Dissertation Collection, so please check there first.)

Solving the word mismatch problem through automatic text analysis

Jinxi Xu, University of Massachusetts Amherst

Abstract

Information Retrieval (IR) is concerned with locating documents that are relevant for a user's information need or query from a large collection of documents. A fundamental problem for information retrieval is word mismatch. A query is usually a short and incomplete description of the underlying information need. The users of IR systems and the authors of the documents often use different words to refer to the same concepts. This thesis addresses the word mismatch problem through automatic text analysis. We investigate two text analysis techniques, corpus analysis and local context analysis, and apply them in two domains of word mismatch, stemming and general query expansion. Experimental results show that these techniques can result in more effective retrieval.

Subject Area

Computer science|Information Systems

Recommended Citation

Xu, Jinxi, "Solving the word mismatch problem through automatic text analysis" (1997). Doctoral Dissertations Available from Proquest. AAI9737596.
https://scholarworks.umass.edu/dissertations/AAI9737596

Share

COinS