Yaser Souri scite author profile

Visual attributes are great means of describing images or scenes, in a way both humans and computers understand. In order to establish a correspondence between images and to be able to compare the strength of each property between images, relative attributes were introduced. However, since their introduction, hand-crafted and engineered features were used to learn increasingly complex models for the problem of relative attributes. This limits the applicability of those methods for more realistic cases. We introduce a deep neural network architecture for the task of relative attribute prediction. A convolutional neural network (ConvNet) is adopted to learn the features by including an additional layer (ranking layer) that learns to rank the images based on these features. We adopt an appropriate ranking loss to train the whole network in an end-to-end fashion. Our proposed method outperforms the baseline and state-of-the-art methods in relative attribute prediction on various coarse and fine-grained datasets. Our qualitative results along with the visualization of the saliency maps show that the network is able to learn effective features for each specific attribute. Source code of the proposed method is available at https

show abstract

Fast Weakly Supervised Action Segmentation Using Mutual Consistency

Souri¹,

Fayyaz²,

Minciullo³

et al. 2022

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Fast Weakly Supervised Action Segmentation Using Mutual Consistency

Souri¹,

Fayyaz²,

Minciullo³

et al. 2019

Preprint

View full text Add to dashboard Cite

Action segmentation is the task of predicting the actions in each frame of a video. As obtaining the full annotation of videos for action segmentation is expensive, weakly supervised approaches that can learn only from transcripts are appealing. In this paper, we propose a novel, end-to-end approach for weakly supervised action segmentation based on a two-branch neural network. The two branches of our network predict two redundant but different representations for action segmentation. We propose a novel mutual consistency (MuCon) loss that enforces the consistency of the two redundant representations. Using MuCon loss together with a loss for transcript prediction, our proposed approach achieves performance statistically comparable to the state-of-the-art while being 14 times faster to train and 20 times faster during inference.

show abstract

Persian handwritten digit recognition by random forest and convolutional neural networks

Zamani

Souri

Rashidi

et al. 2015

View full text Add to dashboard Cite

Hierarchical Graph-Rnns for Action Detection of Multiple Activities

Biswas

Souri

Gall

2019

View full text Add to dashboard Cite

In this paper, we propose an approach that spatially localizes the activities in a video frame where each person can perform multiple activities at the same time. Our approach takes the temporal scene context as well as the relations of the actions of detected persons into account. While the temporal context is modeled by a temporal recurrent neural network (RNN), the relations of the actions are modeled by a graph RNN. Both networks are trained together and the proposed approach achieves state of the art results on the AVA dataset.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yaser Souri

Deep Relative Attributes

Fast Weakly Supervised Action Segmentation Using Mutual Consistency

Fast Weakly Supervised Action Segmentation Using Mutual Consistency

Persian handwritten digit recognition by random forest and convolutional neural networks

Hierarchical Graph-Rnns for Action Detection of Multiple Activities

Contact Info

Product

Resources

About