Existing semantic segmentation models heavily rely on dense pixelwise annotations. To reduce the annotation pressure, we focus on a challenging task named zero-shot semantic segmentation, which aims to segment unseen objects with zero annotations. This task can be accomplished by transferring knowledge across categories via semantic word embeddings. In this paper, we propose a novel context-aware feature generation method for zero-shot segmentation named CaGNet. In particular, with the observation that a pixel-wise feature highly depends on its contextual information, we insert a contextual module in a segmentation network to capture the pixel-wise contextual information, which guides the process of generating more diverse and context-aware features from semantic word embeddings. Our method achieves state-of-the-art results on three benchmark datasets for zero-shot segmentation. CCS CONCEPTS • Computing methodologies → Image segmentation.
In this paper, a novel strategy based on a metasurface composed of simple and compact unit cells to achieve ultra-high-speed trigonometric operations under specific input values is theoretically and experimentally demonstrated. An electromagnetic wave (EM)-based optical diffractive neural network with only one hidden layer is physically built to perform four trigonometric operations (sine, cosine, tangent, and cotangent functions). Under the unique composite input mode strategy, the designed optical trigonometric operator responds to incident light source modes that represent different trigonometric operations and input values (within one period), and generates correct and clear calculated results in the output layer. Such a wave-based operation is implemented with specific input values, and the proposed concept work may offer breakthrough inspiration to achieve integrable optical computing devices and photonic signal processors with ultra-fast running speeds.
Web search is an essential way for human to obtain information, but it's still a great challenge for machines to understand the contents of web pages. In this paper, we introduce the task of web-based structural reading comprehension. Given a web page and a question about it, the task is to find an answer from the web page. This task requires a system not only to understand the semantics of texts but also the structure of the web page. Moreover, we proposed WebSRC, a novel Web-based Structural Reading Comprehension dataset. WebSRC consists of 0.44M question-answer pairs, which are collected from 6.5K web pages with corresponding HTML source code, screenshots, and metadata. Each question in WebSRC requires a certain structural understanding of a web page to answer, and the answer is either a text span on the web page or yes/no. We evaluate various strong baselines on our dataset to show the difficulty of our task. We also investigate the usefulness of structural information and visual features. Our dataset and task are publicly available at https://speechlab-sjtu. github.io/WebSRC/.
In this paper, we design and demonstrate a compact logic operator based on a single-layer metasurface at microwave frequency. By mapping the nodes in the trained fully connected neural network (FCNN) to the specific unit cells with phase control function of the metasurface, a logic operator with only one hidden layer is physically realized. When the incident wave illuminates specific operating regions of the metasurface, corresponding unit cells are activated and can scatter the incident wave to two designated zones containing logical information in the output layer. The proposed metasurface logic operator is experimentally verified to achieve three basic logic operations (NOT, OR, and AND) under different input signals. Our design shows great application potential in compact optical systems, low-power consumption information transmission, and ultrafast wave-based full signal processing.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.