Natural Language Processing

I ‘ve just started reading this article . I think the base of searching is understanding the queries and the content in which you are looking for. The article gives an insight view to a Corpus-based aproach to semantic interpretation in NLP , it’s worth reading it.


Definition of a corpus:

from Corpus Linguistics
A collection of texts, spoken and/or written, which has been designed and compiled based on a set of clearly defined criteria.

CORPUS[13c: from Latin corpus body. The plural is usually corpora].

A collection of texts, especially if complete and self-contained: the corpus of Anglo-Saxon verse. Plural also corpuses. In linguistics and lexicography, a body of texts, utterances, or other specimens considered more or less representative of a language, and usually stored as an electronic database. Currently, computer corpora may store many millions of running words, whose features can be analysed by means of tagging (the addition of identifying and classifying tags to words and other formations) and the use of concordancing programs. Corpus linguistics studies data in any such corpus.


(The Oxford Companion to the English Language, ed. McArthur & McArthur, 1992)

References:
Links to anything & everything to do with the use of language corpora.

2 Responses to “Natural Language Processing”

  1. Anne Baker says:

    I found your blog on Google. I’ve bookmarked it and will watch out for your next blog post.

  2. I kind of disagree, but I do see your point.

Leave a Reply