Item Details

Print View

Determining Stopping Criteria in the Generation of Web-Derived Langua Ge Models

Monroe, Gary; Mikesell, David; French, James
Format
Report
Author
Monroe, Gary
Mikesell, David
French, James
Abstract
In this work, we present a small-scale evaluation of two query-based sampling techniques for building language models, using a database comprised of world-wide web documents. We propose a metric by which it is possible to determine when to cease sampling a given web database, and we compare this new metric to other metrics that have been used in previous work to determine the fidelity of sampled language models.
Language
English
Date Received
2012-10-29
Published
University of Virginia, Department of Computer Science, 2000
Published Date
2000
Rights
All rights reserved (no additional license for public reuse)
Collection
Libra Open Repository

Availability

Access Online