Paweł Cichosz scite author profile

Temporal di erence (TD) methods constitute a class of methods for learning predictions in multi-step prediction problems, parameterized by a recency factor . Currently the most important application of these methods is to temporal credit assignment in reinforcement learning. Well known reinforcement learning algorithms, such as AHC or Q-learning, may be viewed as instances of TD learning. This paper examines the issues of the e cient and general implementation of TD( ) for arbitrary , for use with reinforcement learning algorithms optimizing the discounted sum of rewards. The traditional approach, based on eligibility traces, is argued to su er from both ine ciency and lack of generality. The TTD (Truncated Temporal Di erences) procedure is proposed as an alternative, that indeed only approximates TD( ), but requires very little computation per action and can be used with arbitrary function representation methods. The idea from which it is derived is fairly simple and not new, but probably unexplored so far. Encouraging experimental results are presented, suggesting that using > 0 with the TTD procedure allows one to obtain a signi cant learning speedup at essentially the same cost as usual TD(0) learning.

show abstract

A Case Study in Text Mining of Discussion Forum Posts: Classification with Bag of Words and Global Vectors

Cichosz

2018

View full text Add to dashboard Cite

Despite the rapid growth of other types of social media, Internet discussion forums remain a highly popular communication channel and a useful source of text data for analyzing user interests and sentiments. Being suited to richer, deeper, and longer discussions than microblogging services, they particularly well reflect topics of long-term, persisting involvement and areas of specialized knowledge or experience. Discovering and characterizing such topics and areas by text mining algorithms is therefore an interesting and useful research direction. This work presents a case study in which selected classification algorithms are applied to posts from a Polish discussion forum devoted to psychoactive substances received from home-grown plants, such as hashish or marijuana. The utility of two different vector text representations is examined: the simple bag of words representation and the more refined embedded global vectors one. While the former is found to work well for the multinomial naive Bayes algorithm, the latter turns out more useful for other classification algorithms: logistic regression, SVMs, and random forests. The obtained results suggest that post-classification can be applied for measuring publication intensity of particular topics and, in the case of forums related to psychoactive substances, for monitoring the risk of drug-related crime.

show abstract

Fast and Efficient Reinforcement Learning with Truncated Temporal Differences

Cichosz

Mulawka

1995

View full text Add to dashboard Cite

An Analysis of Experience Replay in Temporal Difference Learning

Cichosz¹

1999

Cybernetics and Systems

View full text Add to dashboard Cite

Machine Learning Algorithms for Prediction of the Quality of Transmission in Optical Networks

Kozdrowski

Cichosz

Paziewski

et al. 2020

Entropy

View full text Add to dashboard Cite

Increasing demand in the backbone Dense Wavelength Division (DWDM) Multiplexing network traffic prompts an introduction of new solutions that allow increasing the transmission speed without significant increase of the service cost. In order to achieve this objective simpler and faster, DWDM network reconfiguration procedures are needed. A key problem that is intrinsically related to network reconfiguration is that of the quality of transmission assessment. Thus, in this contribution a Machine Learning (ML) based method for an assessment of the quality of transmission is proposed. The proposed ML methods use a database, which was created only on the basis of information that is available to a DWDM network operator via the DWDM network control plane. Several types of ML classifiers are proposed and their performance is tested and compared for two real DWDM network topologies. The results obtained are promising and motivate further research.

show abstract

Assessing the quality of classification models: Performance measures and evaluation procedures

Cichosz

2011

View full text Add to dashboard Cite

This article systematically reviews techniques used for the evaluation of classification models and provides guidelines for their proper application. This includes performance measures assessing the model’s performance on a particular dataset and evaluation procedures applying the former to appropriately selected data subsets to produce estimates of their expected values on new data. Their common purpose is to assess model generalization capabilities, which are crucial for judging the applicability and usefulness of both classification and any other data mining models. The review presented in this article is expected to be sufficiently in-depth and complete for most practical needs, while remaining clear and easy to follow with little prior knowledge. Issues that receive special attention include incorporating instance weights to performance measures, combining the same set of evaluation procedures with arbitrary performance measures, and avoiding pitfalls related to separating data subsets used for evaluation from those used for model creation. With the classification task unquestionably being one of the central data mining tasks and the vastly increasing number of data mining applications — not only in business, but also in engineering and research — this is expected to be interesting and useful for a wide audience. All presented techniques are accompanied by simple R language implementations and usage examples, which — whereas created to serve the illustration purpose mostly — can be actually used in practice.

show abstract

Real-time energy purchase optimization for a storage-integrated photovoltaic system by deep reinforcement learning

Kołodziejczyk

Żółtowska

Cichosz

2021

Control Engineering Practice

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Paweł Cichosz

Data Mining Algorithms

Truncating Temporal Differences: On the Efficient Implementation of TD(lambda) for Reinforcement Learning

A Case Study in Text Mining of Discussion Forum Posts: Classification with Bag of Words and Global Vectors

Fast and Efficient Reinforcement Learning with Truncated Temporal Differences

An Analysis of Experience Replay in Temporal Difference Learning

Machine Learning Algorithms for Prediction of the Quality of Transmission in Optical Networks

Assessing the quality of classification models: Performance measures and evaluation procedures

Real-time energy purchase optimization for a storage-integrated photovoltaic system by deep reinforcement learning

Contact Info

Product

Resources

About