Wei Li scite author profile

This paper proposes a new model of ethnic settlement, the ethnoburb. Ethnoburbs are suburban ethnic clusters of residential areas and business districts in large American metropolitan areas. They are multi-ethnic communities, in which one ethnic minority group has a significant concentration, but does not necessarily comprise a majority. The paper operationalises the ethnoburb model via an analysis of the demographic profiles and socioeconomic characteristics of the ethnoburban Chinese population in Los Angeles in order to understand the ethnoburb's role as a global economic outpost. This analysis also highlights social stratification by country of origin, and the micro-geographies of neighbourhood and workplace, features which reveal the ethnoburb's character as an urban mosaic.

show abstract

Fairness in Recommendation Ranking through Pairwise Comparisons

Beutel

et al. 2019

View full text Add to dashboard Cite

Recommender systems are one of the most pervasive applications of machine learning in industry, with many services using them to match users to products or information. As such it is important to ask: what are the possible fairness risks, how can we quantify them, and how should we address them?In this paper we offer a set of novel metrics for evaluating algorithmic fairness concerns in recommender systems. In particular we show how measuring fairness based on pairwise comparisons from randomized experiments provides a tractable means to reason about fairness in rankings from recommender systems. Building on this metric, we offer a new regularizer to encourage improving this metric during model training and thus improve fairness in the resulting rankings. We apply this pairwise regularization to a large-scale, production recommender system and show that we are able to significantly improve the system's pairwise fairness.

show abstract

Multi-Interest Network with Dynamic Routing for Recommendation at Tmall

Liu

et al. 2019

234

169

View full text Add to dashboard Cite

Industrial recommender systems usually consist of the matching stage and the ranking stage, in order to handle the billion-scale of users and items. The matching stage retrieves candidate items relevant to user interests, while the ranking stage sorts candidate items by user interests. Thus, the most critical ability is to model and represent user interests for either stage. Most of the existing deep learning-based models represent one user as a single vector which is insufficient to capture the varying nature of user's interests. In this paper, we approach this problem from a different view, to represent one user with multiple vectors encoding the different aspects of the user's interests. We propose the Multi-Interest Network with Dynamic routing (MIND) for dealing with user's diverse interests in the matching stage. Specifically, we design a multi-interest extractor layer based on capsule routing mechanism, which is applicable for clustering historical behaviors and extracting diverse interests. Furthermore, we develop a technique named label-aware attention to help learn a user representation with multiple vectors. Through extensive experiments on several public benchmarks and one largescale industrial dataset from Tmall, we demonstrate that MIND can achieve superior performance than state-of-the-art methods for recommendation. Currently, MIND has been deployed for handling major online traffic at the homepage on Mobile Tmall App.

show abstract

Recommending what video to watch next

et al. 2019

View full text Add to dashboard Cite

A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency

Sainath

et al. 2020

174

126

View full text Add to dashboard Cite

Thus far, end-to-end (E2E) models have not been shown to outperform state-of-the-art conventional models with respect to both quality, i.e., word error rate (WER), and latency, i.e., the time the hypothesis is finalized after the user stops speaking. In this paper, we develop a first-pass Recurrent Neural Network Transducer (RNN-T) model and a second-pass Listen, Attend, Spell (LAS) rescorer that surpasses a conventional model in both quality and latency. On the quality side, we incorporate a large number of utterances across varied domains [1] to increase acoustic diversity and the vocabulary seen by the model. We also train with accented English speech to make the model more robust to different pronunciations. In addition, given the increased amount of training data, we explore a varied learning rate schedule. On the latency front, we explore using the end-of-sentence decision emitted by the RNN-T model to close the microphone, and also introduce various optimizations to improve the speed of LAS rescoring. Overall, we find that RNN-T+LAS offers a better WER and latency tradeoff compared to a conventional model. For example, for the same latency, RNN-T+LAS obtains a 8% relative improvement in WER, while being more than 400-times smaller in model size.

show abstract

Sampling-bias-corrected neural modeling for large corpus item recommendations

Yang

Hong

et al. 2019

123

View full text Add to dashboard Cite

Distributions of suspended sediment concentration in the Yellow Sea and the East China Sea based on field surveys during the four seasons of 2011

Bian

Jiang

Quan

et al. 2013

Journal of Marine Systems

111

View full text Add to dashboard Cite

Two-Pass End-to-End Speech Recognition

C¹,

Pang²,

Rybach³

et al. 2019

103

View full text Add to dashboard Cite

The requirements for many applications of state-of-the-art speech recognition systems include not only low word error rate (WER) but also low latency. Specifically, for many use-cases, the system must be able to decode utterances in a streaming fashion and faster than real-time. Recently, a streaming recurrent neural network transducer (RNN-T) end-to-end (E2E) model has shown to be a good candidate for on-device speech recognition, with improved WER and latency metrics compared to conventional on-device models [1]. However, this model still lags behind a large state-of-the-art conventional model in quality [2]. On the other hand, a non-streaming E2E Listen, Attend and Spell (LAS) model has shown comparable quality to large conventional models [3]. This work aims to bring the quality of an E2E streaming model closer to that of a conventional system by incorporating a LAS network as a second-pass component, while still abiding by latency constraints. Our proposed two-pass model achieves a 17%-22% relative reduction in WER compared to RNN-T alone and increases latency by a small fraction over RNN-T.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Wei Li

Anatomy of a New Ethnic Settlement: The Chinese Ethnoburb in Los Angeles

Fairness in Recommendation Ranking through Pairwise Comparisons

Multi-Interest Network with Dynamic Routing for Recommendation at Tmall

Recommending what video to watch next

A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency

Sampling-bias-corrected neural modeling for large corpus item recommendations

Distributions of suspended sediment concentration in the Yellow Sea and the East China Sea based on field surveys during the four seasons of 2011

Two-Pass End-to-End Speech Recognition

Contact Info

Product

Resources

About