Purpose – The aim of this research was to examine the use of the data island method for creating metadata records based on DCXML, MARCXML, and MODS with indexability and visibility of element tag names in web search engines. Design/methodology/approach – A total of 600 metadata records were developed in two groups (300 HTML-based records in an experimental group with special structure embedded in the < pre> tag of HTML based on the data island method, and 300 XML-based records as the control group with the normal structure). These records were analyzed through an experimental approach. The records of these two groups were published on two independent websites, and were submitted to Google and Bing search engines. Findings – Findings show that all the tag names of the metadata records created based on the data island method relating to the experimental group indexed by Google and Bing were visible in the search results. But the tag names in the control group's metadata records were not indexed by the search engines. Accordingly it is possible to index and retrieve the metadata records by their tag name in the search engines. But the records of the control group are accessible by the element values only. The research suggests some patterns to the metadata creators and the end users for better indexing and retrieval. Originality/value – The research used the data island method for creating the metadata records, and deals with the indexability and visibility of the metadata element tag names for the first time.
Purpose -The purpose of this research was to assess and compare the indexing and ranking of XML-based content objects containing MARCXML and XML-based Dublin Core (DCXML) metadata elements by general search engines (Google and Yahoo!), in a comparative analytical study. Design/methodology/approach -One hundred XML content objects in two groups were analyzed: those with MARCXML elements (50 records) and those with DCXML (50 records) published on two web sites (www.dcmixml.islamicdoc.org and www.marcxml.islamicdoc.org).The web sites were then introduced to the Google and Yahoo! search engines. Findings -The indexing of metadata records and the difference between their indexing and ranking were examined using descriptive statistics and a non-parametric Mann-Whitney U test. The findings show that the visibility of content objects was possible by all their metadata elements. There was no significant difference between two groups' indexing, but a difference was observed in terms of ranking. Practical implications -The findings of this research can help search engine designers in the optimum use of metadata elements to improve their indexing and ranking process with the aim of increasing availability. The findings can also help web content object providers in the proper and efficient use of metadata systems. Originality/value -This is the first research to examine the interoperability between XML-based metadata and web search engines, and compares the MARC format and DCMI in a research approach.
The aim of the study was to find out whether discarding XML declarations and changing file extensions (i.e. .xml) improve the indexability and visibility of DCXML, MARCXML and MODS element tag names in Web search engines. Two groups of metadata records were included in an experimental study: an experimental group composed of 300 XML-based records without XML declarations and with file extensions according to the name of related metadata standards, and a control group composed of 300 XML-based records with the normal XML structure. These were analysed through an experimental approach. The two sets of records were published on two separate websites and then the sites were introduced to Google and Yahoo!. Findings showed that Google and Yahoo! indexed and retrieved all the tag names relating to the experimental group. However they did not index the tag names in the control group's records. Based on the findings, some patterns are suggested to metadata creators and Web search engine developers.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.