Hierarchical Neural Language Models for Joint Representation of Streaming Documents and their Content

Djuric, Nemanja; Wu, Hao; Radosavljević, Vladan; Grbovic, Mihajlo; Bhamidipati, Narayan

doi:10.1145/2736277.2741643

Cited by 63 publications

(54 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These powerful, efficient models have shown very promising results in capturing both semantic and syntactic relationships between words in large-scale text corpora, and obtained state-of-the-art results on many NLP tasks. Recently, the concept of embedding has been expanded to many applications, including sentences and paragraphs representation [11], summarization [21], questions answering [43], recommender systems [34] and so on.…”

Section: Embeddingmentioning

confidence: 99%

“…Thirdly, some slight differences in styles and genres of music pieces are also shown by the learned embeddings, which shows that the learned embeddings by MEM can effectively capture the accurate features of the corresponding music pieces. For example, as for the last four music pieces (13)(14)(15)(16), all of which are soundtracks for anime, and they are more similar to each other than the other pieces (1)(2)(3)(4)(5)(6)(7)(8)(9)(10)(11)(12) in Table 8. In addition, the former two pieces (13)(14) are more similar to each other than the latter two pieces (15)(16) in Table 8.…”

Section: Illustrations Of Selected Music Pieces' Embeddingsmentioning

confidence: 99%

See 1 more Smart Citation

Learning to embed music and metadata for context-aware music recommendation

et al. 2017

View full text Add to dashboard Cite

Contextual factors greatly influence users' musical preferences, so they are beneficial remarkably to music recommendation and retrieval tasks. However, it still needs to be studied how to obtain and utilize the contextual information. In this paper, we propose a context-aware music recommendation approach, which can recommend music pieces appropriate for users' contextual preferences for music. In analogy to matrix factorization methods for collaborative filtering, the proposed approach does not require music pieces to be represented by features ahead, but it can learn the representations from users' historical listening records. Specifically, the proposed approach first learns music pieces' embeddings (feature vectors in low-dimension continuous space) from music listening records and corresponding metadata. Then it infers and models users' global and contextual preferences for music from their listening records with the learned embeddings. Finally, it recommends appropriate music pieces according to the target user's preferences to satisfy her/his real-time requirements. Experimental evaluations on a real-world dataset show that the proposed approach outperforms baseline methods in terms of precision, recall, F1 score, and hitrate. Especially, our approach has better performance on sparse datasets.

show abstract

Section: Embeddingmentioning

confidence: 99%

Section: Illustrations Of Selected Music Pieces' Embeddingsmentioning

confidence: 99%

Learning to embed music and metadata for context-aware music recommendation

et al. 2017

View full text Add to dashboard Cite

show abstract

“…The related work is largely focused on the notion of word and text representations (as in (Djuric et al, 2015a;Le and Mikolov, 2014;Mikolov et al, 2013a)), which improve previous works on modeling lexical semantics using vector space models (Mikolov et al, 2013a). More recently, the concept of embeddings has been extended beyond words to a number of text segments, including phrases (Mikolov et al, 2013b), sentences and paragraphs (Le and Mikolov, 2014) and entities (Yang et al, 2014).…”

Section: Distributional Representation Of Comments (C2v)mentioning

confidence: 99%

“…More recently, the concept of embeddings has been extended beyond words to a number of text segments, including phrases (Mikolov et al, 2013b), sentences and paragraphs (Le and Mikolov, 2014) and entities (Yang et al, 2014). In order to learn vector representation we develop a comment embeddings approach akin to Le and Mikolov (2014) which is different from the one used in Djuric et al (2015a) since our representation doesn't model the relationships between the comments (e.g., temporal). Moreover, given the similarity with a prior state-of-the-art approach (Djuric et al, 2015b), this method can also be used as a strong baseline.…”

Section: Distributional Representation Of Comments (C2v)mentioning

confidence: 99%

Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue

Katagiri

Nakano

Fernández

et al. 2016

View full text Add to dashboard Cite

We extend special thanks to our Local co-Chairs, Ron Artstein and Alesia Gainer, and their team of student volunteers. We know SIGDIAL 2016 would not have been possible without Ron and Alesia, who invested so much effort in arranging the conference venue and accommodations, handling registration, making banquet arrangements, and handling numerous other preparations for the conference. The student volunteers for on-site assistance also deserve our appreciation.Ethan Selfridge, Sponsorships Chair, has earned our appreciation for recruiting and liaising with our conference sponsors, many of whom continue to contribute year after year. Sponsorships support valuable aspects of the program, such as the invited speakers and conference banquet. In recognition of this, we gratefully acknowledge the support of our sponsors: (Platinum level) Microsoft Research, Xerox and PARC, Intel, (Gold level) Facebook, (Silver level) Amazon Alexa, Interactions, Educational Testing Service, Honda Research Institute, and Yahoo!. At the same time, we thank Priscilla Rasmussen at the ACL for tirelessly handling the financial aspects of sponsorship for SIGDIAL 2016, and for securing our ISBN.iii We also thank the SIGdial board, especially officers Amanda Stent, Jason Williams and Kristiina Jokinen for their advice and support from beginning to end.Finally, we thank all the authors of the papers in this volume, and all the conference participants for making this stimulating event a valuable opportunity for growth in the research areas of discourse and dialogue. AbstractThis paper presents an end-to-end framework for task-oriented dialog systems using a variant of Deep Recurrent QNetworks (DRQN). The model is able to interface with a relational database and jointly learn policies for both language understanding and dialog strategy. Moreover, we propose a hybrid algorithm that combines the strength of reinforcement learning and supervised learning to achieve faster learning speed. We evaluated the proposed model on a 20 Question Game conversational game simulator. Results show that the proposed method outperforms the modular-based baseline and learns a distributed representation of the latent dialog state. IntroductionTask-oriented dialog systems have been an important branch of spoken dialog system (SDS) research (Raux et al., 2005; Young, 2006; Bohus and Rudnicky, 2003). The SDS agent has to achieve some predefined targets (e.g. booking a flight) through natural language interaction with the users. The typical structure of a task-oriented dialog system is outlined in Figure 1 (Young, 2006). This pipeline consists of several independently-developed modules: natural language understanding (the NLU) maps the user utterances to some semantic representation. This information is further processed by the dialog state tracker (DST), which accumulates the input of the turn along with the dialog history. The DST outputs the current dialog state and the dialog policy selects the next system action based on the dialog state. Then natural language gene...

show abstract

“…This requires the aid of dictionary and Chinese word segmentation technology, while the dictionary is field related and time varying, the Chinese word segmentation process is complex and the result accuracy is not high. In order to achieve high classification performance, document classification algorithms are adopted in supervised classification methods, such as decision tree, Naive Bayesian, KNN (K-nearest neighbor), SVM (Support vector machine), neural network (Djuric et al, 2015;Patel et al, 2013) and genetic algorithm (Revathi,2013).Because its classification algorithm often uses the supervised classification method, classification effect is highly dependent on the quality of artificial annotation corpus, and the document classification model transplantation is not high. In this paper, aiming at the problem of food safety document corpus, we improved the classification algorithm.…”

Section: Introductionmentioning

confidence: 99%

Food Safety Document Classification Using LSTM-based Ensemble Learning

2017

Revista Tecnica De La Facultad De Ingenieria Universidad Del Zu

View full text Add to dashboard Cite

It is a common case that there are huge news reports with different topics in a website (even in a food-related website), food safety news reports are needed to be selected for further analysis, and this is a text (or document) classification problem. In this paper, we propose a food safety document classification method using LSTM (long and short-term memory)-based ensemble learning. Firstly, due to the high cost of human-annotation, the food safety document corpus includes only one-class samples, and the food safety document classification based on such a corpus is a one-class classification problem. We propose an automatic corpus expansion approach which uses a large number of unlabeled news reported online as negative samples (the documents that are not related with food safety), and our food safety document corpus becomes a binary-class-based corpus that has both positive samples and negative samples. Secondly, our automatic corpus expansion brings the following two problems to document corpus: data noise and data unbalance. We choose an ensemble learning method which is based on LSTM(Long Short-Term Memory) for our document classification. Overall, the document classification method based on the LSTM-based ensemble learning method can automatically detect food safety documents from websites with outstanding performances.

show abstract

Hierarchical Neural Language Models for Joint Representation of Streaming Documents and their Content

Cited by 63 publications

References 16 publications

Learning to embed music and metadata for context-aware music recommendation

Learning to embed music and metadata for context-aware music recommendation

Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue

Food Safety Document Classification Using LSTM-based Ensemble Learning

Contact Info

Product

Resources

About