On-demand delivery is an emerging business in recent years where accurate indoor locations of Gig couriers play an important role in the order dispatch and delivery process. To cater to this need, WiFi-based indoor positioning methods have become an alternative method for on-demand delivery thanks to extensive WiFi deployment in the indoor environment. Existing WiFi-based indoor localization and positioning methods are not suitable for large-scale on-demand delivery scenarios due to high costs (e.g., high labor cost to collect fingerprints) and limited coverage due to limited labeled data. In this work, we explore (i) massive crowdsourced WiFi data collecting from wearable or mobile devices of couriers with little extra effort and (ii) natural manual reports data in the delivery process as two opportunities to perform merchant-level indoor positioning in a weak-supervised manner. Specifically, we proposed WePos, an end-to-end weak-supervised-based merchant-level positioning framework, which consists of the following three parts: (i) a Bidirectional Encoder Representations from Transformers (BERT) based pre-training module to learn latent embeddings of WiFi access points, (ii) a contrastive label self-generate module to produce pseudos for WiFi scanning lists by matching similarity embedding clustering results and couriers' reporting behaviors. (iii) a deep neural network-based classifier to fine-tune the whole training process and conduct online merchant-level position inference. To evaluate the performance of our system, we conduct extensive experiments in both a large-scale public crowdsourcing dataset with over 50 GB of WiFi signal records and a real-world WiFi crowdsourced dataset collected from Eleme, (i.e., one of the largest on-demand delivery platforms in China) in four multi-floor malls in Shanghai. Experimental results show that WePos outperforms state-of-the-art baselines in the merchant-level positioning performance, offer up to 91.4% in positioning accuracy.
An address, a textual description of a physical location, plays an important role in location-based services such as on-demand delivery and e-commerce. However, abnormal addresses (i.e., an address without detailed information representing a spatial location) have led to significant costs. In real-world settings like e-commerce, abnormal address detection is not trivial because it needs to be completed in real-time to support massive online queries. In this study, we design FastAddr, a fast abnormal address detection framework, which detects abnormal addresses among millions of addresses in a short time. By investigating and modeling the hierarchical structure of address data, we first design a novel contrastive address augmentation approach to generate training data via learning the entity transition probability matrix. We further design a lightweight multi-head attention model for learning compact address representation by modeling the address characteristics. We conduct a comprehensive three-phase evaluation. (i) We evaluate FastAddr on a real-world dataset and it yields the average F1 of 85.7% in 0.058 milliseconds, which outperforms the state-of-the-art models by 47.4% with similar detection time. (ii) An offline A/B test shows that FastAddr outperforms the previous deployed model significantly. (iii) We also conduct an online A/B test to compare FastAddr with the deployed model, which shows an improvement of F1 by more than 20%. Moreover, a real-world case study demonstrates both the efficiency and effectiveness of FastAddr.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.