Karen A. Hamill scite author profile

Karen A. Hamill

2Publications

10Citation Statements Received

4Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

The use of titles for automatic document classification

Hamill¹,

Zamora²

1980

J. Am. Soc. Inf. Sci.

View full text Add to dashboard Cite

An experimental computer program has been developed to classify documents according to the 80 sections and five major section groupings of Chemical Abstracrs (CAI. The program uses pattern recognition techniques supplemented by heuristics. During the "training" phase, words from preclassified documents are selected, and the probability of occurrence of each word in each section of CA i s computed and stored in a reference dictionary. The "classification" phase matches each word of a document title against the dictionary and assigns a section number to the document using weights derived from the probabilities in the dictionary. Heuristic techniques are used to normalize word variants such as plurals, past tenses, and gerunds in both the training phase and the classification phase. The dictionary lookup technique is supplemented by the analysis of chemical nomenclature terms into their component word roots to influence the section to which the documents are assigned. Program performance and human consistency have been evaluated by comparing the program results against the published sections of CA and by conducting an experiment with people experienced in the assignment of documents to CA sections. The program assigned approximately 7896 of the documents to the correct major section groupings of CA and 67% of the correct sections or crossreferences a t a rate of 100 documents per second.

show abstract

Chemical Abstracts Service Chemical Registry System. 10. Registration of substances from pre-1965 indexes of Chemical Abstracts

Hamill¹,

Nelson²,

Stouw³

et al. 1988

J. Chem. Inf. Comput. Sci.

View full text Add to dashboard Cite

The Chemical Abstracts Service Chemical Registry System, operating since 1965, uniquely identifies chemical substances on the basis of molecular structure. Chemical Abstracts Service is now registering chemical substances cited in indexes to Chemical Abstracts prior to 1965. This effort will result in several hundred thousand additional chemical structures, along with their names, being available for online searching in the Registry File. Both the newly registered substances and those already on file are being linked to their pre-1965 citations in Chemical Abstracts in a new file called CAOLD. In this effort the printed Formula Index entries are converted to computer-readable form by using optical character recognition with the data subsequently processed with existing computer programs.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Karen A. Hamill

The use of titles for automatic document classification

Chemical Abstracts Service Chemical Registry System. 10. Registration of substances from pre-1965 indexes of Chemical Abstracts

Contact Info

Product

Resources

About