曲靖师范学院学报 ›› 2017, Vol. 36 ›› Issue (5): 42-46.

• 语言学研究 • 上一篇    下一篇

中文专利文献名词性短语中的并列结构的标注和分析

刘小蝶   

  1. 北京联合大学 国际交流学院,北京 100101
  • 收稿日期:2017-05-23 出版日期:2017-09-26
  • 作者简介:刘小蝶,北京联合大学国际交流学院讲师,博士,主要从事汉语语言学及应用语言学研究。
  • 基金资助:
    国家语委“十二五”科研规划项目(YB125-124);国家高技术研究发展计划(863计划)(2012AA011104)。

Annotating and Analyzing of Coordination with Overt Conjunctions in Nominal Groups Based on Chinese Patent Literature

Liu Xiaodie   

  1. College of International Education, Beijing Union University, Beijing 100101, China
  • Received:2017-05-23 Published:2017-09-26

摘要: 在HNC理论的指导下,在30篇共3613句的中文专利文献基础上,从数量、层级、语义类型、语义特征、干扰特征、结构特征、外部环境和位置特征等八个维度对中文专利文献名词性短语中并列结构进行语料标注,进而分析并列结构的分类及其分布情况,并在此基础上考察并总结并列结构的语义特征、结构特征和外部词特征,目的是辅助设计自动识别汉语名词性短语并列结构的策略、语言学规则和算法。

关键词: 语言学, 中文专利文献, 并列结构, 语义块, 语义特征

Abstract: Under the guidance of HNC theory, coordination with overt conjunctions (COC) in 3613 sentences of 30 articles in the Chinese patent literature is annotated in the eight aspects, namely number, level, semantic type, semantic feature, interference, structural feature, contextual words and boundary position. This paper counts and analyzes the types and distribution of COC, investigates semantic similarity, structural similarities and contextual information.Its aim is to design the strategies, algorithms and linguistic rules for automatically recognizing COC in nominal groups of Chinese patent literature.

Key words: Linguistics, Chinese patent literature, COC, semantic chunks, semantic features

中图分类号: