Skip to main navigation Skip to search Skip to main content

中文CCG樹庫的構建

Translated title of the contribution: Construction of Chinese CCGbank

Research output: Journal Publications and ReviewsRGC 22 - Publication in policy or professional journal

Abstract

Combinatory Categorial Grammar(CCG) is a type-driven lexicalized grammar formalism with a transparent interface between syntax and semantics,which is essential to in-depth text processing.To apply CCG to real texts,however,a large scale lexicon needs to be constructed as indispensable support,demanding a great deal of manpower and resources.An effective way to alleviate this problem is to transform an existing treebank into a CCGbank.This paper presents an approach to deriving a Chinese CCGbank from Tsinghua Chinese Treebank,with the aid of a number of predefined verb sub-categorization and Chinese sentence patterns.The resulted CCGbank includes 32 737 sentences,of over 350k word tokens.The effectiveness of this approach is confirmed by an evaluation with manually annotated references and a comparison with reported works on the construction of several CCGbanks.
Translated title of the contributionConstruction of Chinese CCGbank
Original languageChinese (Traditional)
Pages (from-to)3 - 8
Journal中文信息学报
Volume26
Issue number3
Publication statusPublished - 2012

Fingerprint

Dive into the research topics of 'Construction of Chinese CCGbank'. Together they form a unique fingerprint.

Cite this