The potentials and limitations of modelling concept concreteness in computational semantic lexicons with dictionary defi

  • PDF / 625,074 Bytes
  • 13 Pages / 439.37 x 666.142 pts Page_size
  • 79 Downloads / 148 Views

DOWNLOAD

REPORT


The potentials and limitations of modelling concept concreteness in computational semantic lexicons with dictionary definitions Oi Yee Kwong

Published online: 18 April 2013  Springer Science+Business Media Dordrecht 2013

Abstract This paper explores the feasibility of modelling concept concreteness perceived by humans and representing it in computational semantic lexicons, addressing an issue at the crossroads of computational linguistics, lexicography, and psycholinguistics. The inherent distinction between concrete words and abstract words in psychology has relied mostly on subjective human ratings. This practice is hardly scalable and does not consider the effect of polysemy. In view of this, we attempt to obtain a measure of concreteness from dictionary definitions comparable to human judgement, capitalising on conventional lexicographic assumptions and the regularities exhibited in the surface structures of sense definitions. The structural pattern of a definition is analysed and scored on a 7-point scale of concreteness ratings. The definition scores turned out to be quite effective for a dichotomous distinction between concrete and abstract concepts and more consistent with human ratings for the former. Beyond the two-way distinction, however, the results were more variable. The study has thus revealed the potentials and limitations of our approach, suggesting that different defining styles probably reflect the describability of concepts, and describability alone may not be sufficient for differentiating the degree of concreteness. The range of definition patterns has to be reconsidered, in combination with other inseparable factors constituting our perception of concreteness, for better modelling on a finer scale of concreteness distinction to enrich semantic lexicons for natural language processing. Keywords Concept concreteness  Dictionary definitions  Semantic lexicons  Polysemy  Computational lexicography

O. Y. Kwong (&) Department of Chinese, Translation and Linguistics, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong e-mail: [email protected]

123

1150

O. Y. Kwong

1 Introduction Evidence from lexical decision tasks (e.g., Bleasdale 1987; Kroll and Merves 1986) and studies on children’s spoken and reading vocabulary (e.g., Yore and Ollila 1985), amongst others, tends to suggest that concrete concepts are often easier to learn and process than abstract concepts. Psychologists have also put forth various plausible explanations, such as the dual-coding theory (Paivio 1986) and the context availability theory (Schwanenflugel 1991). The distinction between concrete and abstract concepts is apparently natural and inherent, and plays an important role in human lexical processing. By analogy, if this common-sense phenomenon can be properly modelled in computational semantic lexicons, it should benefit natural language processing, especially those sub-tasks which draw heavily on lexico-semantic information. For example, coupled with the availability of characteristic linguist