Doctoral Dissertations

Off-campus UMass Amherst users: To download campus access dissertations, please use the following link to log into our proxy server with your UMass Amherst user name and password.

Non-UMass Amherst users: Please talk to your librarian about requesting this dissertation through interlibrary loan.

Dissertations that have an embargo placed on them will not be available to anyone until the embargo expires.

Epistemological Databases for Probabilistic Knowledge Base Construction

Michael Louis Wick, University of Massachusetts AmherstFollow

Author ORCID Identifier

N/A

AccessType

Open Access Dissertation

Document Type

dissertation

Degree Name

Doctor of Philosophy (PhD)

Degree Program

Computer Science

Year Degree Awarded

2015

First Advisor

Andrew McCallum

Subject Categories

Artificial Intelligence and Robotics | Databases and Information Systems | Numerical Analysis and Scientific Computing | Other Computer Sciences

Abstract

Knowledge bases (KB) facilitate real world decision making by providing access to structured relational information that enables pattern discovery and semantic queries. Although there is a large amount of data available for populating a KB; the data must first be gathered and assembled. Traditionally, this integration is performed automatically by storing the output of an information extraction pipeline directly into a database as if this prediction were the ``truth.'' However, the resulting KB is often not reliable because (a) errors accumulate in the integration pipeline, and (b) they persist in the KB even after new information arrives that could rectify these errors. We envision a paradigm-shift in KB construction for addressing these concerns that we term an ``epistemological'' database. In epistemological databases the existence and properties of entities are not directly input into the DB; they are instead determined by inference on raw evidence input into the DB. This shift in thinking is important because it allows inference to revisit previous conclusions and retroactively correct errors as new evidence arrives. Evidence is abundant and in steady supply from web spiders, semantic web ontologies, external databases, and even groups of enthusiastic human editors. As this evidence continues to accumulate and inference continues to run in the background, the quality of the knowledge base continues to improve. In this dissertation we develop the machine learning components necessary to achieve epistemological knowledge base construction at scale with key contributions in modeling, inference and learning.

DOI

https://doi.org/10.7275/6461312.0

Recommended Citation

Wick, Michael Louis, "Epistemological Databases for Probabilistic Knowledge Base Construction" (2015). Doctoral Dissertations. 334.
https://doi.org/10.7275/6461312.0 https://scholarworks.umass.edu/dissertations_2/334

Download

Included in

Artificial Intelligence and Robotics Commons, Databases and Information Systems Commons, Numerical Analysis and Scientific Computing Commons, Other Computer Sciences Commons

COinS

ScholarWorks@UMass Amherst

Doctoral Dissertations

Epistemological Databases for Probabilistic Knowledge Base Construction

Author ORCID Identifier

AccessType

Document Type

Degree Name

Degree Program

Year Degree Awarded

First Advisor

Subject Categories

Abstract

DOI

Recommended Citation

Included in

Browse

Author Corner

Links

ScholarWorks@UMass Amherst

Doctoral Dissertations

Epistemological Databases for Probabilistic Knowledge Base Construction

Author

Author ORCID Identifier

AccessType

Document Type

Degree Name

Degree Program

Year Degree Awarded

First Advisor

Subject Categories

Abstract

DOI

Recommended Citation

Included in

Share

Browse

Author Corner

Links