Han Zhang scite author profile

Synthesizing high-quality images from text descriptions is a challenging problem in computer vision and has many practical applications. Samples generated by existing textto-image approaches can roughly reflect the meaning of the given descriptions, but they fail to contain necessary details and vivid object parts. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) to generate 256×256 photo-realistic images conditioned on text descriptions. We decompose the hard problem into more manageable sub-problems through a sketch-refinement process. The Stage-I GAN sketches the primitive shape and colors of the object based on the given text description, yielding Stage-I low-resolution images. The Stage-II GAN takes Stage-I results and text descriptions as inputs, and generates high-resolution images with photo-realistic details. It is able to rectify defects in Stage-I results and add compelling details with the refinement process. To improve the diversity of the synthesized images and stabilize the training of the conditional-GAN, we introduce a novel Conditioning Augmentation technique that encourages smoothness in the latent conditioning manifold. Extensive experiments and comparisons with state-of-the-arts on benchmark datasets demonstrate that the proposed method achieves significant improvements on generating photo-realistic images conditioned on text descriptions.

show abstract

Atomic‐Layer Graphene as a Saturable Absorber for Ultrafast Pulsed Lasers

Bao

Zhang

Wang

et al. 2009

Adv Funct Materials

2,343

1,523

View full text Add to dashboard Cite

The optical conductance of monolayer graphene is defined solely by the fine structure constant, α = e 2 /ћc (where e is the electron charge, ћ is Dirac's constant and c is the speed of light). The absorbance has been predicted to be independent of frequency. In principle, the interband optical absorption in zero-gap graphene could be saturated readily under strong excitation due to Pauli blocking. Here, we demonstrate the use of atomic layer graphene as saturable absorber in a mode-locked fiber laser for the generation of ultrashort soliton pulses (756 fs) at the telecommunication band. The modulation depth can be tuned in a wide range from 66.5% to 6.2% by varying the thickness of graphene. Our results suggest that ultrathin graphene films are potentially useful as optical elements in fiber lasers. Graphene as a laser mode locker can have many merits such as lower saturation intensity, ultrafast recovery time, tunable modulation depth and wideband tuneability.

show abstract

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

et al. 2018

View full text Add to dashboard Cite

In this paper, we propose an Attentional Generative Adversarial Network (AttnGAN) that allows attention-driven, multi-stage refinement for fine-grained text-to-image generation. With a novel attentional generative network, the At-tnGAN can synthesize fine-grained details at different subregions of the image by paying attentions to the relevant words in the natural language description. In addition, a deep attentional multimodal similarity model is proposed to compute a fine-grained image-text matching loss for training the generator. The proposed AttnGAN significantly outperforms the previous state of the art, boosting the best reported inception score by 14.14% on the CUB dataset and 170.25% on the more challenging COCO dataset. A detailed analysis is also performed by visualizing the attention layers of the AttnGAN. It for the first time shows that the layered attentional GAN is able to automatically select the condition at the word level for generating different parts of the image.

show abstract

dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

et al. 2018

View full text Add to dashboard Cite

Complex carbohydrates of plants are the main food sources of animals and microbes, and serve as promising renewable feedstock for biofuel and biomaterial production. Carbohydrate active enzymes (CAZymes) are the most important enzymes for complex carbohydrate metabolism. With an increasing number of plant and plant-associated microbial genomes and metagenomes being sequenced, there is an urgent need of automatic tools for genomic data mining of CAZymes. We developed the dbCAN web server in 2012 to provide a public service for automated CAZyme annotation for newly sequenced genomes. Here, dbCAN2 (http://cys.bios.niu.edu/dbCAN2) is presented as an updated meta server, which integrates three state-of-the-art tools for CAZome (all CAZymes of a genome) annotation: (i) HMMER search against the dbCAN HMM (hidden Markov model) database; (ii) DIAMOND search against the CAZy pre-annotated CAZyme sequence database and (iii) Hotpep search against the conserved CAZyme short peptide database. Combining the three outputs and removing CAZymes found by only one tool can significantly improve the CAZome annotation accuracy. In addition, dbCAN2 now also accepts nucleotide sequence submission, and offers the service to predict physically linked CAZyme gene clusters (CGCs), which will be a very useful online tool for identifying putative polysaccharide utilization loci (PULs) in microbial genomes or metagenomes.

show abstract

Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data

Gong

Wang

et al. 2012

International Journal of Remote Sensing

1,276

772

View full text Add to dashboard Cite

P. Gong et al. land-cover classification system as well as the International Geosphere-Biosphere Programme (IGBP) system. Using the four classification algorithms, we obtained the initial set of global land-cover maps. The SVM produced the highest overall classification accuracy (OCA) of 64.9% assessed with our test samples, with RF (59.8%), J4.8 (57.9%), and MLC (53.9%) ranked from the second to the fourth. We also estimated the OCAs using a subset of our test samples (8629) each of which represented a homogeneous area greater than 500 m × 500 m. Using this subset, we found the OCA for the SVM to be 71.5%. As a consistent source for estimating the coverage of global land-cover types in the world, estimation from the test samples shows that only 6.90% of the world is planted for agricultural production. The total area of cropland is 11.51% if unplanted croplands are included. The forests, grasslands, and shrublands cover 28.35%, 13.37%, and 11.49% of the world, respectively. The impervious surface covers only 0.66% of the world. Inland waterbodies, barren lands, and snow and ice cover 3.56%, 16.51%, and 12.81% of the world, respectively.

show abstract

Ultrasmall Black Phosphorus Quantum Dots: Synthesis and Use as Photothermal Agents

et al. 2015

View full text Add to dashboard Cite

Black phosphorus quantum dots (BPQDs) were synthesized using a liquid exfoliation method that combined probe sonication and bath sonication. With a lateral size of approximately 2.6 nm and a thickness of about 1.5 nm, the ultrasmall BPQDs exhibited an excellent NIR photothermal performance with a large extinction coefficient of 14.8 L g(-1) cm(-1) at 808 nm, a photothermal conversion efficiency of 28.4%, as well as good photostability. After PEG conjugation, the BPQDs showed enhanced stability in physiological medium, and there was no observable toxicity to different types of cells. NIR photoexcitation of the BPQDs in the presence of C6 and MCF7 cancer cells led to significant cell death, suggesting that the nanoparticles have large potential as photothermal agents.

show abstract

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

Zhang

et al. 2019

IEEE Trans. Pattern Anal. Mach. Intell.

824

724

View full text Add to dashboard Cite

Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGANs) aimed at generating high-resolution photo-realistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for text-to-image synthesis. The Stage-I GAN sketches the primitive shape and colors of a scene based on a given text description, yielding low-resolution images. The Stage-II GAN takes Stage-I results and the text description as inputs, and generates high-resolution images with photo-realistic details. Second, an advanced multi-stage generative adversarial network architecture, StackGAN-v2, is proposed for both conditional and unconditional generative tasks. Our StackGAN-v2 consists of multiple generators and multiple discriminators arranged in a tree-like structure; images at multiple scales corresponding to the same scene are generated from different branches of the tree. StackGAN-v2 shows more stable training behavior than StackGAN-v1 by jointly approximating multiple distributions. Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.

show abstract

From Black Phosphorus to Phosphorene: Basic Solvent Exfoliation, Evolution of Raman Scattering, and Applications to Ultrafast Photonics

Guo

Zhang

et al. 2015

Adv Funct Materials

878

608

View full text Add to dashboard Cite

Although phosphorene has attracted much attention in electronics and optoelectronics as a new type of two‐dimensional material, in‐depth investigations and applications have been limited by the current synthesis techniques. Herein, a basic N‐methyl‐2‐pyrrolidone (NMP) liquid exfoliation method is described to produce phosphorene with excellent water stability, controllable size and layer number, as well as in high yield. Phosphorene samples composed of one to four layers exhibit layer‐dependent Raman scattering characteristics thus providing a fast and efficient means for the in situ determination of the thickness (layer number) of phosphorene. The linear and nonlinear ultrafast absorption behavior of the as‐exfoliated phosphorene is investigated systematically by UV–vis–NIR absorption and Z‐scan measurements. By taking advantage of their unique nonlinear absorption, ultrashort pulse generation applicable to optical saturable absorbers is demonstrated. In addition to a unique fabrication technique, our work also reveals the large potential of phosphorene in ultrafast photonics.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Han Zhang

StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks

Atomic‐Layer Graphene as a Saturable Absorber for Ultrafast Pulsed Lasers

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

dbCAN2: a meta server for automated carbohydrate-active enzyme annotation

Finer resolution observation and monitoring of global land cover: first mapping results with Landsat TM and ETM+ data

Ultrasmall Black Phosphorus Quantum Dots: Synthesis and Use as Photothermal Agents

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

From Black Phosphorus to Phosphorene: Basic Solvent Exfoliation, Evolution of Raman Scattering, and Applications to Ultrafast Photonics

Contact Info

Product

Resources

About