In this paper we describe the construction of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definition sentences, and uses an HPSG based Japanese grammar to encode the syntactic and semantic information. We show how this treebank can be used to extract thesaurus information from definition sentences in a language-neutral way using minimal recursion semantics.
Two-word familiarity sets were measured in different years (1995 and 2002) and places (Kanto and Kinki, in Japan) for a large number of Japanese words, to examine the reliability of familiarity ratings. The correlation between the word familiarities of the two sets was extremely high (r = .958, N = 10,515). It is suggested that familiarity rating, at least for ordinary words found in a dictionary, is very reliable and not greatly affected by differences in years and places.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.