IntroductionNowadays, the mobile phone is one of the fastest growing technologies in the developing world with global penetration rates reaching 90% [1]. This makes it a huge warehouse for customer's data. That is, every action taken by the customer (short message service (SMS), Call or Internet session) gets recorded within the telecom operator, in the so called (CDRs). There are many types of CDRs used mainly by telecom billing systems. CDR contains a lot of information, (type of event, who is involved in this event, datetime, cell identifier where this event has taken place). This raw data represents a valuable source for analyzing human and social behavior [2]. In the agricultural domain [3] mobile phone data is used to analyze mobility and seasonal activity patterns related to livelihood zones in Senegal, by creating mobility profiles for population and segmentation. While in energy domain [4] this data is used to analyze human activity, facilitate population growth estimation in rural areas and extrapolate electricity needs. In health sector [5,6] mobile phone data is used to study the relation between human mobility
AbstractIn the age of data driven solution, the customer demographic attributes, such as gender and age, play a core role that may enable companies to enhance the offers of their services and target the right customer in the right time and place. In the marketing campaign, the companies want to target the real user of the GSM (global system for mobile communications), not the line owner. Where sometimes they may not be the same. This work proposes a method that predicts users' gender and age based on their behavior, services and contract information. We used call detail records (CDRs), customer relationship management (CRM) and billing information as a data source to analyze telecom customer behavior, and applied different types of machine learning algorithms to provide marketing campaigns with more accurate information about customer demographic attributes. This model is built using reliable data set of 18,000 users provided by SyriaTel Telecom Company, for training and testing. The model applied by using big data technology and achieved 85.6% accuracy in terms of user gender prediction and 65.5% of user age prediction. The main contribution of this work is the improvement in the accuracy in terms of user gender prediction and user age prediction based on mobile phone data and end-to-end solution that approaches customer data from multiple aspects in the telecom domain. which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.