Capsule-based Object Tracking with Natural Language Specification

Ma, Ding; Wu, Xiangqian

doi:10.1145/3474085.3475349

Cited by 6 publications

(3 citation statements)

References 45 publications

(73 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In recent years, the two-stream framework [2], [4], [5], [8] has emerged as a dominant VL tracking paradigm (see Fig. 1(a)).…”

Section: A Vision-language Trackingmentioning

confidence: 99%

“…In the past few years, two-stream VL trackers [2], [4], [5], [8], which extract visual features and language features separately and then perform feature interaction in a fusion model (as shown in Fig 1(a)), have emerged as a domain framework and obtained significant progresses. For instance, Feng et al [4] proposed a Siamese natural language region proposal network for multi-stage feature extraction, and then applied an aggregation module to dynamically combine predictions from both visual and language modalities.…”

Section: Introductionmentioning

confidence: 99%

“…Firstly, the separation of feature extraction and integration prevents the model from performing early multi-modal feature interaction, resulting in limited objectbackground discriminative power [10], [11]. Although some works have attempted to design complicated [8] or multistage [4], [5] fusion models to enhance the associations between modalities, the lack of mutual interaction remains an insurmountable gap. More seriously, heavy fusion models increase the number of parameters, leading to significant computational inefficiency.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

School of Mechanical Engineering, Shanghai Jiao Tong University, 200240, Shanghai, China

Zhuang

Huang

et al. 2019

Mathematical Biosciences and Engineering

View full text Add to dashboard Cite

The crystallization kinetics and melting behavior of nylon 10,10 in neat nylon 10,10 and in nylon 10,10 -montmorillonite (MMT) nanocomposites were systematically investigated by differential scanning calorimetry. The crystallization kinetics results show that the addition of MMT facilitated the crystallization of nylon 10,10 as a heterophase nucleating agent; however, when the content of MMT was high, the physical hindrance of MMT layers to the motion of nylon 10,10 chains retarded the crystallization of nylon 10,10, which was also confirmed by polarized optical microscopy. However, both nylon 10,10 and nylon 10,10 -MMT nanocomposites exhibited multiple melting be-havior under isothermal and nonisothermal crystallization conditions. The temperature of the lower melting peak (peak I) was independent of MMT content and almost remained constant; however, the temperature of the highest melting peak (peak II) decreased with increasing MMT content due to the physical hindrance of MMT layers to the motion of nylon 10,10 chains.

show abstract

“…In recent years, the two-stream framework [2], [4], [5], [8] has emerged as a dominant VL tracking paradigm (see Fig. 1(a)).…”

Section: A Vision-language Trackingmentioning

confidence: 99%