Tomoaki Matsunami scite author profile

In this work, an interactive entertainment system which employs multiplehuman tracking from a single camera is presented. The proposed system robustly tracks people in an indoor environment and displays their predicted future footsteps in front of them in real-time. The system is composed of a video camera, a computer and a projector. There are three main modules: tracking, analysis and visualization. The tracking module extracts people as moving blobs by using an adaptive background subtraction algorithm. Then, the location and orientation of their next footsteps are predicted. The future footsteps are visualized by a high-paced continuous display of foot images in the predicted location to simulate the natural stepping of a person. To evaluate the performance, the proposed system was exhibited during a public art exhibition in an airport. People showed surprise, excitement, curiosity. They tried to control the display of the footsteps by making various movements.

show abstract

Pedestrian Attribute Analysis Using a Top-View Camera in a Public Space

Yamasaki

Matsunami

2012

View full text Add to dashboard Cite

Abstract. In this paper, we propose a method to analyze gender of the pedestrian and whether he or she has a baggage or not in a public space. The challenging part of this work is we only use top-view camera images to protect the pedestrians' privacy. We focused on temporal changes in their position, shape, and contours over the frames because their appearances do not provide much information. We extracted the pedestrians' features using their position, area, aspect ratio, histogram of oriented gradients (HoG), and Fourier descriptors. The temporal information was taken into consideration by employing Gaussian mixture models (GMM), GMM universal background model (GMM-UBM), and bag of features (BoF) model. The attributes were classified by using support vector machines (SVM). We conducted experiments using 60-minute video captured by a top-view camera attached at an airport. Experimental results show that the classification accuracy is 69% for the gender classification and 79% for baggage possession classification.Keywords: Human attributes, surveillance, gender classification, bag possession classification. IntroductionVisual surveillance has been one of the most active research areas in computer vision. Surveillance cameras have been installed in a lot of places in such as stations, airports, or on the streets for security purposes. Visual surveillance data are easy to analyze for humans. On the other hand, analyzing the data by computers requires a wide range of algorithms such as moving object detection, object classification, counting, tracking, behavior labeling, human identification, abnormal object/event detection, flux analysis, data fusion collected from multiple cameras, and so on. Understanding human attribute and behavior, in particular, is getting more attention not only for security reasons but for better services, marketing, and so on. If surveillance systems can recognize gender and age range of the passengers, digital 542 T. Yamasaki and T. Matsunami singnage dedicatedly designed for a particular target can be displayed. If systems detect children who are alone, they might be lost and looking for their parents. In addition, systems can alert person who is carrying a large suitcases widely spread behind him/her, which is dangerous and is becoming a significant safety issue in crowded airports and stations. For activity recognition, Chen and Hauptmann proposed MoSIFT [3]. MoSIFT was an extension of the Scale Invariant Feature Transform (SIFT) [4] features to the temporal domain and showed its superiority to Histogram of Oriented Gradients (HoG) [5] . Zhang et al. analyzed the optimal camera angle for the gender classification using SMV classifiers [10], in which only yaw angles were considered. In these approaches, however, the quality of the images was well-controlled: target objects were large enough, taken from the frontal-view, and so on. On the other hand, only top-view images taken by a surveillance camera is used in this work, which could protect the pedestrians' privacy. Another challengin...

show abstract

Detecting Resized JPEG Images by Analyzing High Frequency Elements in DCT Coefficients

Yamasaki

Matsunami

Aizawa

2010

View full text Add to dashboard Cite

Evaluation on Biometric Accuracy Estimation Using Generalized Pareto (GP) Distribution

Yamada

Matsunami

2021

View full text Add to dashboard Cite

Human Attribute Analysis Using a Top-View Camera Based on Two-Stage Classification

Yamasaki

Matsunami

Chen

2013

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

SUMMARYThis paper presents a technique that analyzes pedestrians' attributes such as gender and bag-possession status from surveillance video. One of the technically challenging issues is that we use only topview camera images to protect privacy. The shape features over the frames are extracted by bag-of-features (BoF) using histogram of oriented gradients (HoG) vectors. In order to enhance the classification accuracy, a two-staged classification framework is presented. Multiple classifiers are trained by changing the parameters in the first stage. The outputs from the first stage is further trained and classified in the second stage classifier. The experiments using 60-minute video captured at Haneda Airport, Japan, show that the accuracies for the gender classification and the bagpossession classification were 95.8% and 97.2%, respectively, which is a significant improvement from our previous work.

show abstract

Human attribute analysis using a top-view camera based on multi-stage classification

Yamasaki

Matsunami

2011

View full text Add to dashboard Cite

This paper proposes pedestrians' attribute analysis such as gender and whether they have bags with them based on multi-layer classification. One of the technically challenging issues is we use only top-view camera images to protect the privacy of the pedestrians. The shape features over the frames are extracted by bag-of-features (BoF) using histogram of oriented gradients (HoG) vectors with the optimized parameters. Then, multiple classifiers using support vector machine (SVM) were generated by changing the parameters for the feature generation. A set of classification results using the multiple classifiers is fed to the second stage classifier to obtain the final results. The experimental results using 60-minute video captured at Haneda Airport, Japan, show that the accuracies for the gender classification and the with/without baggage classification were 95.8% and 97.2%, respectively with low false positive/negative rates, which is a significant improvement from our previous work which yielded 68.5% and 78.8% of accuracy, respectively.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.