This paper presents a methodology to capture bibliographic data from the verso of the title pages of documents. A survey has been undertaken to identify the syntactic and semantic features of bibliographic elements on the verso of title pages. These features include the font size, line numbers and appearence of certain string of characters. Emphasis is given to the study of "cataloguing-in-publication" data. The results of the survey are used to develop heuristics which can help in developing a program to automatically identify the various bibliogaphic data elements. The back of the title pages are scanned and stored as HTML pages using optical recognition software. The heuristics are then applied on the HTML pages. Few samples of input and the output generated are presented. Finally, the problems related to OCR and the heuristics are enumerated.
The concept of 'social tagging' has gained popularity nowadays due to the emergence of web 2.0 technologies. Those technologies led to the practice of associating metadata with digital resources among users through collaboratively or socially for self-information retrieval. Many researchers have opined that social tags can enhance the use of library collections. The present study was predominantly carried out to compare social tags collected from the LibraryThing website with Library of Congress Subject Heading (LCSH) descriptors collected from the Library of Congress Online Catalogue applied for thousand book titles in the field of Economics. The study also aimed to know whether social tags can be applied in the library database or not. The findings elucidate that users mostly use descriptors (47.39 %) as tags than expert's usage of tags (12.77 %) as descriptors. Spearman's correlation suggests that 75 per cent chance where tags and descriptors can be used simultaneously in overlapping terms. The Jaccard similarity coefficient identifies that users and experts use different terminologies to annotate the books. Users and experts use at least one common keyword for major book titles (908). Users mostly sought title based keywords but experts use mostly subject-based terminologies. The study further clarifies that social tags may be incorporated into the library databases but cannot replace LCSHs. The accessibility and usage of documents especially in the field of economics may be enhanced once the notion of social tags is incorporated with the library OPAC.
In developing countries like India, taxpayers’ money is utilized for research and development. The researchers conduct their research using public money and publish their research papers in commercial journals. Firstly, the researcher uses Government funds for research. Secondly, government funds are also spent on subscribing to high-cost journals. Also, many Indian academic institutions can not subscribe to reputed commercial journals due to a lack of funds. In other words, research output generated using public money is not accessible to all. OA journals can solve this problem smoothly. In this study, researchers analyze the trends in Open Access publications and Closed access publications by India's top research institutes, IITs. Researchers found that IIT Hyderabad (26%) published the highest number of open-access publications. Old established IITs' open-access publication figures are lower than newly set-up IITs. However, there is an increase in Open Access publications by IITs over the last decade.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.