Market basket analysis, which is a method of discovering co-occurrence relationships, is widely used for the purposes of marketing research and e-commerce, mainly by supermarkets and online stores. Moving beyond the traditional notion of a market basket understood as a fixed list of products, the technique can be applied for data mining in other fields of research which do not involve traditional transactions and purchases made by customers. The following article describes theoretical aspects of market basket analysis with an illustrative application based on data from the National Census of Population and Housing 2011 with respect to marital status. This is the first application of market basket analysis to census data to be conducted in Poland, in which attributes of the market basket have been replaced with respondents' demographic characteristics. This approach makes it possible to identify relationships between legal (de jure) marital status and actual (de facto) marital status, taking into account other basic socio-demographic variables available in large datasets. Using the R software to generate choropleth maps classified by province as a method of visualizing association rules, it was possible to conduct a spatial analysis of the phenomenon of interest.
Streszczenie: W badaniach prowadzonych przez Główny Urząd Statystyczny analiza koszykowa, która raczej była wykorzystywana w badaniach marketingowych, nie była do tej pory stosowana. Odchodząc od tradycyjnego rozumienia "koszyka zakupów" i zastępując konkretne produkty wariantami określonych cech społeczno-demograficznych osób czy gospodarstw domowych, można na danych z badań reprezentacyjnych czy spisów dokonać jej implementacji celem poszukiwania odpowiednich reguł i zależności. Głównym celem pracy jest wykorzystanie analizy koszykowej na gruncie badań prowadzonych przez statystykę publiczną w Polsce. Autorzy, wykorzystując dane z ostatniego spisu -NSP 2011, programy SAS, R i pakiety statystyczne arules oraz arulesViz, a także wybrane zmienne z tego badania, wskazują obszary przydatności analizy koszykowej w identyfikacji odpowiednich reguł występujących w tym zbiorze w kontekście zjawiska niepełnosprawności biologicznej.Słowa kluczowe: analiza koszykowa, Narodowy Spis Powszechny Ludności i Mieszkań 2011, niepełnosprawność biologiczna. Summary:Commonly used as a tool in marketing studies, market basket analysis (MBA) has not been applied in surveys conducted by the Central Statistical Office so far. If one modifies
Surveys and censuses conducted by the Central Statistical Office in Poland are the main sources of information about disability for official statistics. Because sample sizes for relevant cross-classification domains are too small to employ classical estimation methods, results are usually published at a relatively high level of aggregation (at country or province level) or for very broadly defined domains. To meet the growing demand for detailed information about disability, the authors present an attempt of applying the methodology of small area estimation to estimate the percentage of disabled people, in the legal and biological sense, across districts (NUTS 4/LAU 1 units) of the province of Wielkopolska crossclassified by the level of education. This methodological exercise is based on data from the 2011 census and employs selected techniques of indirect estimation. Estimates obtained in the study provide an indication of the spatial variation of disability in the target domains with greater precision. It is worth noting that this level of aggregation has not been considered for purposes of official statistical outputs because of unacceptably high estimation errors of the direct estimator.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.