Can Corpus Pattern Analysis Be Used in NLP?

Logo poskytovatele
Logo poskytovatele

Varování

Publikace nespadá pod Ekonomicko-správní fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Název česky Může být Corpus Pattern Analysis použita v ZPJ?
Autoři

CINKOVÁ Silvie HOLUB Martin RYCHLÝ Pavel SMEJKALOVÁ Lenka ŠINDLEROVÁ Jana

Rok publikování 2010
Druh Článek ve sborníku
Konference Text, Speech and Dialogue, 2010
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
www http://www.springerlink.com/index/15G111T3P76196R5.pdf
Doi http://dx.doi.org/10.1007/978-3-642-15760-8_10
Obor Jazykověda
Klíčová slova corpus; nlp; corpus pattern analysis
Popis Corpus Pattern Analysis (CPA), coined and implemented by Hanks as the Pattern Dictionary of English Verbs (PDEV), appears to be the only deliberate and consistent implementation of Sinclair's concept of Lexical Item. In his theoretical inquiries Hanks hypothesizes that the pattern repository produced by CPA can also support the word sense disambiguation task. Although more than 670 verb entries have already been compiled in PDEV, no systematic evaluation of this ambitious project has been reported yet. Assuming that the Sinclairian concept of the Lexical Item is correct, we started to closely examine PDEV with its possible NLP application in mind. Our experiments presented in this paper have been performed on a pilot sample of English verbs to provide a first reliable view on whether humans can agree in assigning PDEV patterns to verbs in a corpus. As a conclusion we suggest procedures for future development of PDEV.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.