IR systems consist of phases like document preprocessing, indexing, query expansion, query matching, ranking etc. The document preprocessing phase is the most important phase to parse the document and collect keywords. Relevance of overall IR system improves if main topics of document are perfectly identified during this phase. It is a known fact that Topics are mostly phrase based. Existing phrase search methods like ngrams or positional indexes are quite complex and also suffer from problems of inaccuracy, requirement of large storage space etc. Moreover, IR system like digital library may consist of eBooks on one or more subjects. So for phrase collection, one may have to use appropriate ontology to retrieve phrases or topics. This paper presents a new approach called AuTopicGen (Automatic Topic Generator) that automatically collects most relevant topics of eBooks from its contents and indexes using rule based positional patterns approach. From the collected topics, we create topic hierarchy that can work as light weight ontology to improve overall performance of information retrieval system especially for phrase based queries and to assist user with query recommendation. Further this will be useful as topic maps, mind maps, to improve user interface to help user navigate through topics, for categorization, query expansion and ranking algorithms. We have successfully implemented the approach for topics collection practically on eBooks and presented in this paper.
The Internet web has become popular tool to assist human for their information needs from web server. Due to increasing number of users for web access day by day, there is a need to analyze behavior of such user, in order to monitor and improve performance and throughput of website. Web usage mining is one of the data mining applications which deal with web log files and extract useful information from web. There are different phases are for web usage mining: Data preprocessing, discover pattern and pattern analysis. Among them data preprocessing is the most crucial phase of web usage mining because without good quality of data it is difficult to identify pattern of users behavior. This paper provides reviews of different data preprocessing methods like data collection, data cleaning, User identification, session identification and path completion which will be useful for the community to select one or combination of available techniques in order to carry out efficient preprocessing in order to obtain reliable data mining outcome.
training experience than her general house jobs. When the midwives got into trouble she set off on her bicycle with emergency bag and lamp-many of the houses had no electricity-and applied the forceps or whatever was necessary. Both of them commonly had to act as anaesthetist and obstetrician in a routine that sounds deceptively simple. Don two pairs of gloves, go to the head end and give a light chloroform anaesthetic, rush to the other to do a forceps delivery. Most breeches were delivered vaginally in those days. Innovations preceded the NHS In Oxford we were in a unique position though, for in 1938, £2m was given to create the Nuffield department of obstetrics, together with the corresponding departments of medicine, surgery, and anaesthetics. Before taking up my post as first assistant to the new department I spent three months in Vienna studying under Professor Fraenkel. At this time, in marked contrast to England and America, colposcopy was a routine procedure in the teaching hospitals in Vienna, Leipzig, and Berlin, and indeed in much of Europe, and after the war we imported a colposcope and started a colposcopy service. Gradually we took over the gynaecology from the general surgeons and built up our department. We also took over responsibility for the maternity home and built up outpatient clinics in all the outlying hospitals in the Oxford region, with a back up consultancy service for general practitioner obstetricians working in cottage hospitals as well. I was on call all the time, and it was rare to have an unbroken night. But such was the spirit in the hospital that no one resented the hours they put in. One of the innovations with the most impact was the weekly departmental meetings we started to discuss difficult cases and clinical, research, and training policies. Registrars from as far afield as London, Bristol, and Birmingham came for, amazingly enough, at this time such meetings were actively discouraged because they were "not conducive to good departmental discipline." We also, in 1938 on the advice of Professor Chassar Moir, set up a flying squad for it was not uncommon for women with serious obstetric complications to be dead on arrival by the time the ambulance had reached hospital from villages 30-40 miles away.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.