Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First, in order to align pedestrians to a standard pose, the PoseBox structure is introduced, which is generated through pose estimation followed by affine transformations. Second, to reduce the impact of pose estimation errors and information loss during PoseBox construction, we design a Pose-Box fusion (PBF) CNN architecture that takes the original image, the PoseBox, and the pose estimation confidence as input. The proposed PIE descriptor is thus defined as the fully connected layer of the PBF network for the retrieval task. Experiments are conducted on the Market-1501, CUHK03, and VIPeR datasets. We show that Pose-Box alone yields decent re-ID accuracy, and that when integrated in the PBF network, the learned PIE descriptor produces competitive performance compared with the stateof-the-art approaches.
Abstract-We present a system for converting a fully panoramic (360 • ) video into a normal field-of-view (NFOV) hyperlapse for an optimal viewing experience. Our system exploits visual saliency and semantics to non-uniformly sample in space and time for generating hyperlapses. In addition, users can optionally choose objects of interest for customizing the hyperlapses. We first stabilize an input 360 • video by smoothing the rotation between adjacent frames and then compute regions of interest and saliency scores. An initial hyperlapse is generated by optimizing the saliency and motion smoothness followed by the saliency-aware frame selection. We further smooth the result using an efficient 2D video stabilization approach that adaptively selects the motion model to generate the final hyperlapse. We validate the design of our system by showing results for a variety of scenes and comparing against the state-of-the-art method through a large-scale user study.
Figure S1 | Comparison of FLUX with the guidestar-assisted wavefront shaping technique in terms of the vulnerability to speckle decorrelation. a. Dynamic samples such as in vivo tissue lead to exponential light field decorrelation over time. b. In digital optical phase conjugation, the fastest guidestar-assisted wavefront shaping technique, a sequence of steps needs to be completed before the shaped light returns to tissue and excites a fluorescent target. Therefore, the field decorrelation during the runtime of the entire cycle determines the system performance. As such, wavefront shaping is not effective because of the significant field decorrelation shown in the figure. c. Signal flow of FLUX. Fluorescence measurement and ultrasound modulated light measurement take place simultaneously, after which the signals will not interact with the sample again, and the speckle decorrelation during data transfer and processing decouple from the system performance. Therefore, this method is much more immune to speckle decorrelation.
Parasitic infections constitute a major global public health issue. Existing screening methods that are based on manual microscopic examination often struggle to provide sufficient volumetric throughput and sensitivity to facilitate early diagnosis. Here, we demonstrate a motility-based label-free computational imaging platform to rapidly detect motile parasites in optically dense bodily fluids by utilizing the locomotion of the parasites as a specific biomarker and endogenous contrast mechanism. Based on this principle, a cost-effective and mobile instrument, which rapidly screens ~3.2 mL of fluid sample in three dimensions, was built to automatically detect and count motile microorganisms using their holographic time-lapse speckle patterns. We demonstrate the capabilities of our platform by detecting trypanosomes, which are motile protozoan parasites, with various species that cause deadly diseases affecting millions of people worldwide. Using a holographic speckle analysis algorithm combined with deep learning-based classification, we demonstrate sensitive and label-free detection of trypanosomes within spiked whole blood and artificial cerebrospinal fluid (CSF) samples, achieving a limit of detection of ten trypanosomes per mL of whole blood (~five-fold better than the current state-of-the-art parasitological method) and three trypanosomes per mL of CSF. We further demonstrate that this platform can be applied to detect other motile parasites by imaging Trichomonas vaginalis, the causative agent of trichomoniasis, which affects 275 million people worldwide. With its cost-effective, portable design and rapid screening time, this unique platform has the potential to be applied for sensitive and timely diagnosis of neglected tropical diseases caused by motile parasites and other parasitic infections in resource-limited regions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.