Abstract:Automatic understanding and analysis of groups has attracted increasing attention in the vision and multimedia communities in recent years. However, little attention has been paid to the automatic analysis of group membership-i.e., recognizing which group the individual in question is part of. This paper presents a novel two-phase Support Vector Machine (SVM) based specific recognition model that is learned using an optimized generic recognition model. We conduct a set of experiments using a database collected… Show more
“…In our previous work Mou et al (2017), the generic recognition model and the various specific recognition models were trained separately. Specifically, we first trained a generic recognition model, obtaining an optimal value of the parameter w 0 , and then we trained a set of specific recognition models based on the optimized generic recognition model.…”
Section: Proposed Frameworkmentioning
confidence: 99%
“…The video IDs are stated in parentheses and are used to refer to the videos in the rest of the paper; the corresponding conditions and video durations (in minutes) are also listed. tion model, that was proposed in our previous work Mou et al (2017), allows the group membership recognition across all different conditions. However, since group members may behave distinctly in different conditions (e.g., while watching horror movies vs. comedies), the performance of generic recognition model may be significantly limited.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, we propose a specific recognition model for each condition specifically, but we learn it on the top of the generic recognition model. This paper is an extended version of our previous work Mou et al (2017). In Mou et al (2017), we proposed a two-phase learning framework to solve the group membership recognition problem, where we first trained a generic recognition model using all videos across all conditions and, then optimized the specific recognition model for each specific condition based on the optimization results obtained from the generic recognition model.…”
Section: Introductionmentioning
confidence: 99%
“…This paper is an extended version of our previous work Mou et al (2017). In Mou et al (2017), we proposed a two-phase learning framework to solve the group membership recognition problem, where we first trained a generic recognition model using all videos across all conditions and, then optimized the specific recognition model for each specific condition based on the optimization results obtained from the generic recognition model. Different from the aforementioned paper, in this work we unify the generic recognition model and the specific recognition model under a single deep framework.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, we conduct new experiments with a larger dataset. In the rest of the paper, we refer to the specific recognition model (as presented in our previous work Mou et al (2017)) as the two-phase Specific Recognition Model (SRM) and to the proposed Deep Specific Recognition Model as the DeepSRM.…”
Automatic understanding and analysis of groups has attracted increasing attention in the vision and multimedia communities in recent years. However, little attention has been paid to the automatic analysis of the non-verbal behaviors and how this can be utilized for analysis of group membership, i.e., recognizing which group each individual is part of. This paper presents a novel Support Vector Machine (SVM) based Deep Specific Recognition Model (DeepSRM) that is learned based on a generic recognition model. The generic recognition model refers to the model trained with data across different conditions, i.e., when people are watching movies of different types. Although the generic recognition model can provide a baseline for the recognition model trained for each specific condition, the different behaviors people exhibit in different conditions limit the recognition performance of the generic model. Therefore, the specific recognition model is proposed for each condition separately and built on top of the generic recognition model. A number of experiments are conducted using a database aiming to study group analysis while each group (i.e., four participants together) were watching a number of long movie segments. Our experimental results show that the proposed deep specific recognition model (44%) outperforms the generic recognition model (26%). The recognition of group membership also indicates that the non-verbal behaviors of individuals within a group share commonalities.
“…In our previous work Mou et al (2017), the generic recognition model and the various specific recognition models were trained separately. Specifically, we first trained a generic recognition model, obtaining an optimal value of the parameter w 0 , and then we trained a set of specific recognition models based on the optimized generic recognition model.…”
Section: Proposed Frameworkmentioning
confidence: 99%
“…The video IDs are stated in parentheses and are used to refer to the videos in the rest of the paper; the corresponding conditions and video durations (in minutes) are also listed. tion model, that was proposed in our previous work Mou et al (2017), allows the group membership recognition across all different conditions. However, since group members may behave distinctly in different conditions (e.g., while watching horror movies vs. comedies), the performance of generic recognition model may be significantly limited.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, we propose a specific recognition model for each condition specifically, but we learn it on the top of the generic recognition model. This paper is an extended version of our previous work Mou et al (2017). In Mou et al (2017), we proposed a two-phase learning framework to solve the group membership recognition problem, where we first trained a generic recognition model using all videos across all conditions and, then optimized the specific recognition model for each specific condition based on the optimization results obtained from the generic recognition model.…”
Section: Introductionmentioning
confidence: 99%
“…This paper is an extended version of our previous work Mou et al (2017). In Mou et al (2017), we proposed a two-phase learning framework to solve the group membership recognition problem, where we first trained a generic recognition model using all videos across all conditions and, then optimized the specific recognition model for each specific condition based on the optimization results obtained from the generic recognition model. Different from the aforementioned paper, in this work we unify the generic recognition model and the specific recognition model under a single deep framework.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, we conduct new experiments with a larger dataset. In the rest of the paper, we refer to the specific recognition model (as presented in our previous work Mou et al (2017)) as the two-phase Specific Recognition Model (SRM) and to the proposed Deep Specific Recognition Model as the DeepSRM.…”
Automatic understanding and analysis of groups has attracted increasing attention in the vision and multimedia communities in recent years. However, little attention has been paid to the automatic analysis of the non-verbal behaviors and how this can be utilized for analysis of group membership, i.e., recognizing which group each individual is part of. This paper presents a novel Support Vector Machine (SVM) based Deep Specific Recognition Model (DeepSRM) that is learned based on a generic recognition model. The generic recognition model refers to the model trained with data across different conditions, i.e., when people are watching movies of different types. Although the generic recognition model can provide a baseline for the recognition model trained for each specific condition, the different behaviors people exhibit in different conditions limit the recognition performance of the generic model. Therefore, the specific recognition model is proposed for each condition separately and built on top of the generic recognition model. A number of experiments are conducted using a database aiming to study group analysis while each group (i.e., four participants together) were watching a number of long movie segments. Our experimental results show that the proposed deep specific recognition model (44%) outperforms the generic recognition model (26%). The recognition of group membership also indicates that the non-verbal behaviors of individuals within a group share commonalities.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.