Bardia Doosti scite author profile

Mutual gaze detection, i.e., predicting whether or not two people are looking at each other, plays an important role in understanding human interactions. In this work, we focus on the task of image-based mutual gaze detection, and propose a simple and effective approach to boost the performance by using an auxiliary 3D gaze estimation task during the training phase. We achieve the performance boost without additional labeling cost by training the 3D gaze estimation branch using pseudo 3D gaze labels deduced from mutual gaze labels. By sharing the head image encoder between the 3D gaze estimation and the mutual gaze detection branches, we achieve better head features than learned by training the mutual gaze detection branch alone. Experimental results on three image datasets show that the proposed approach improves the detection performance significantly without additional annotations. This work also introduces a new image dataset that consists of 33.1K pairs of humans annotated with mutual gaze labels in 29.2K images.

show abstract

Observing Pianist Accuracy and Form with Computer Vision

Lee

Doosti

et al. 2019

View full text Add to dashboard Cite

HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation

Doosti¹,

Naha²,

Mirbagheri³

et al. 2020

Preprint

View full text Add to dashboard Cite

Hand-object pose estimation (HOPE) aims to jointly detect the poses of both a hand and of a held object. In this paper, we propose a lightweight model called HOPE-Net which jointly estimates hand and object pose in 2D and 3D in real-time. Our network uses a cascade of two adaptive graph convolutional neural networks, one to estimate 2D coordinates of the hand joints and object corners, followed by another to convert 2D coordinates to 3D. Our experiments show that through end-to-end training of the full network, we achieve better accuracy for both the 2D and 3D coordinate estimation problems. The proposed 2D to 3D graph convolution-based model could be applied to other 3D landmark detection problems, where it is possible to first predict the 2D keypoints and then transform them to 3D.

show abstract

An Early Rico Retrospective: Three Years of Uses for a Mobile App Dataset

Deka

Doosti

Huang

et al. 2021

View full text Add to dashboard Cite

A Computational Method for Evaluating UI Patterns

Doosti¹,

Dong²,

Deka³

et al. 2018

Preprint

View full text Add to dashboard Cite

UI design languages, such as Google's Material Design, make applications both easier to develop and easier to learn by providing a set of standard UI components. Nonetheless, it is hard to assess the impact of design languages in the wild. Moreover, designers often get stranded by strong-opinionated debates around the merit of certain UI components, such as the Floating Action Button and the Navigation Drawer. To address these challenges, this short paper introduces a method for measuring the impact of design languages and informing design debates through analyzing a dataset consisting of view hierarchies, screenshots, and app metadata for more than 9,000 mobile apps. Our data analysis shows that use of Material Design is positively correlated to app ratings, and to some extent, also the number of installs. Furthermore, we show that use of UI components vary by app category, suggesting a more nuanced view needed in design debates.

show abstract

Boosting Image-based Mutual Gaze Detection using Pseudo 3D Gaze

Doosti¹,

Chen²,

Vemulapalli³

et al. 2020

Preprint

View full text Add to dashboard Cite

Mutual gaze detection, i.e., predicting whether or not two people are looking at each other, plays an important role in understanding human interactions. In this work, we focus on the task of image-based mutual gaze detection, and propose a simple and effective approach to boost the performance by using an auxiliary 3D gaze estimation task during training. We achieve the performance boost without additional labeling cost by training the 3D gaze estimation branch using pseudo 3D gaze labels deduced from mutual gaze labels. By sharing the head image encoder between the 3D gaze estimation and the mutual gaze detection branches, we achieve better head features than learned by training the mutual gaze detection branch alone. Experimental results on three image datasets show that the proposed approach improves the detection performance significantly without additional annotations. This work also introduces a new image dataset that consists of 33.1K pairs of humans annotated with mutual gaze labels in 29.2K images.

show abstract

A Deep Study into the History of Web Design

Doosti

Crandall

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Bardia Doosti

HOPE-Net: A Graph-Based Model for Hand-Object Pose Estimation

Boosting Image-based Mutual Gaze Detection using Pseudo 3D Gaze

Observing Pianist Accuracy and Form with Computer Vision

HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation

An Early Rico Retrospective: Three Years of Uses for a Mobile App Dataset

A Computational Method for Evaluating UI Patterns

Boosting Image-based Mutual Gaze Detection using Pseudo 3D Gaze

A Deep Study into the History of Web Design

Contact Info

Product

Resources

About