Gordon Euhyun Moon scite author profile

Sparsity, which occurs in both scientific applications and Deep Learning (DL) models, has been a key target of optimization within recent ASIC accelerators due to the potential memory and compute savings. These applications use data stored in a variety of compression formats. We demonstrate that both the compactness of different compression formats and the compute efficiency of the algorithms enabled by them vary across tensor dimensions and amount of sparsity. Since DL and scientific workloads span across all sparsity regions, there can be numerous format combinations for optimizing memory and compute efficiency. Unfortunately, many proposed accelerators operate on one or two fixed format combinations. This work proposes hardware extensions to accelerators for supporting numerous format combinations seamlessly and demonstrates ∼4× speedup over performing format conversions in software.

show abstract

Parallel Data-Local Training for Optimizing Word2Vec Embeddings for Word and Graph Embeddings

Moon

Newman-Griffis

Kim

et al. 2019

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gordon Euhyun Moon

ALO-NMF: Accelerated Locality-Optimized Non-negative Matrix Factorization

A Large-Scale Study in Predictability of Daily Activities and Places

A Layer-Parallel Approach for Training Deep Neural Networks .

Extending Sparse Tensor Accelerators to Support Multiple Compression Formats

Parallel Data-Local Training for Optimizing Word2Vec Embeddings for Word and Graph Embeddings

Contact Info

Product

Resources

About