Acetylation and characterization of spruce (Picea abies) galactoglucomannans

We develop a robust multi-scale structure-aware neural network for human pose estimation. This method improves the recent deep conv-deconv hourglass models with four key improvements: (1) multiscale supervision to strengthen contextual feature learning in matching body keypoints by combining feature heatmaps across scales, (2) multiscale regression network at the end to globally optimize the structural matching of the multi-scale features, (3) structure-aware loss used in the intermediate supervision and at the regression to improve the matching of keypoints and respective neighbors to infer a higher-order matching configurations, and (4) a keypoint masking training scheme that can effectively fine-tune our network to robustly localize occluded keypoints via adjacent matches. Our method can effectively improve state-of-theart pose estimation methods that suffer from difficulties in scale varieties, occlusions, and complex multi-person scenarios. This multi-scale supervision tightly integrates with the regression network to effectively (i) localize keypoints using the ensemble of multi-scale features, and (ii) infer global pose configuration by maximizing structural consistencies across multiple keypoints and scales. The keypoint masking training enhances these advantages to focus learning on hard occlusion samples. Our method achieves the leading position in the MPII challenge leaderboard among the state-of-the-art methods.

show abstract

T_kML-AP: Adversarial Attacks to Top-k Multi-Label Learning

Wang³

et al. 2021

View full text Add to dashboard Cite

UA-DETRAC 2017: Report of AVSS2017 & IWT4S Challenge on Advanced Traffic Monitoring

Lyu

Chang

et al. 2017

View full text Add to dashboard Cite

Multi-Scale Structure-Aware Network for Human Pose Estimation

Ke¹,

Chang²,

Qi³

et al. 2018

Preprint

View full text Add to dashboard Cite

Global Contrast Enhancement Detection via Deep Multi-Path Network

Zhang

et al. 2018

View full text Add to dashboard Cite

Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition

Ke¹,

Peng²,

Lyu³

2022

AAAI

View full text Add to dashboard Cite

Graph Convolutional Networks (GCNs) have been widely used to model the high-order dynamic dependencies for skeleton-based action recognition. Most existing approaches do not explicitly embed the high-order spatio-temporal importance to joints’ spatial connection topology and intensity, and they do not have direct objectives on their attention module to jointly learn when and where to focus on in the action sequence. To address these problems, we propose the To-a-T Spatio-Temporal Focus (STF), a skeleton-based action recognition framework that utilizes the spatio-temporal gradient to focus on relevant spatio-temporal features. We first propose the STF modules with learnable gradient-enforced and instance-dependent adjacency matrices to model the high-order spatio-temporal dynamics. Second, we propose three loss terms defined on the gradient-based spatio-temporal focus to explicitly guide the classifier when and where to look at, distinguish confusing classes, and optimize the stacked STF modules. STF outperforms the state-of-the-art methods on the NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400 datasets in all 15 settings over different views, subjects, setups, and input modalities, and STF also shows better accuracy on scarce data and dataset shifting settings.

show abstract

Fusing Global and Local Features for Generalized AI-Synthesized Image Detection

Jiang¹,

Jia²,

Ke³

et al. 2022

Preprint

View full text Add to dashboard Cite

With the development of the Generative Adversarial Networks (GANs) and DeepFakes, AI-synthesized images are now of such high quality that humans can hardly distinguish them from real images. It is imperative for media forensics to develop detectors to expose them accurately. Existing detection methods have shown high performance in generated images detection, but they tend to generalize poorly in the real-world scenarios, where the synthetic images are usually generated with unseen models using unknown source data. In this work, we emphasize the importance of combining information from the whole image and informative patches in improving the generalization ability of AI-synthesized image detection. Specifically, we design a two-branch model to combine global spatial information from the whole image and local informative features from multiple patches selected by a novel patch selection module. Multi-head attention mechanism is further utilized to fuse the global and local features. We collect a highly diverse dataset synthesized by 19 models with various objects and resolutions to evaluate our model. Experimental results demonstrate the high accuracy and good generalization ability of our method in detecting generated images.

show abstract

DeepFake-o-meter: An Open Platform for DeepFake Detection

Zhang

Sun

et al. 2021

View full text Add to dashboard Cite

In recent years, the advent of deep learning-based techniques and the significant reduction in the cost of computation resulted in the feasibility of creating realistic videos of human faces, commonly known as DeepFakes. The availability of open-source tools to create DeepFakes poses as a threat to the trustworthiness of the online media. In this work, we develop an open-source online platform, known as DeepFake-o-meter, that integrates state-of-the-art DeepFake detection methods and provide a convenient interface for the users. We describe the design and function of DeepFake-o-meter in this work.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lipeng Ke

Multi-Scale Structure-Aware Network for Human Pose Estimation

T_kML-AP: Adversarial Attacks to Top-k Multi-Label Learning

UA-DETRAC 2017: Report of AVSS2017 & IWT4S Challenge on Advanced Traffic Monitoring

Multi-Scale Structure-Aware Network for Human Pose Estimation

Global Contrast Enhancement Detection via Deep Multi-Path Network

Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition

Fusing Global and Local Features for Generalized AI-Synthesized Image Detection

DeepFake-o-meter: An Open Platform for DeepFake Detection

Contact Info

Product

Resources

About

Lipeng Ke

Multi-Scale Structure-Aware Network for Human Pose Estimation

TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning

UA-DETRAC 2017: Report of AVSS2017 & IWT4S Challenge on Advanced Traffic Monitoring

Multi-Scale Structure-Aware Network for Human Pose Estimation

Global Contrast Enhancement Detection via Deep Multi-Path Network

Towards To-a-T Spatio-Temporal Focus for Skeleton-Based Action Recognition

Fusing Global and Local Features for Generalized AI-Synthesized Image Detection

DeepFake-o-meter: An Open Platform for DeepFake Detection

Contact Info

Product

Resources

About

T_kML-AP: Adversarial Attacks to Top-k Multi-Label Learning