Alexey Artemov scite author profile

We introduce ABC-Dataset, a collection of one million Computer-Aided Design (CAD) models for research of geometric deep learning methods and applications. Each model is a collection of explicitly parametrized curves and surfaces, providing ground truth for differential quantities, patch segmentation, geometric feature detection, and shape reconstruction. Sampling the parametric descriptions of surfaces and curves allows generating data in different formats and resolutions, enabling fair comparisons for a wide range of geometric learning algorithms. As a use case for our dataset, we perform a large-scale benchmark for estimation of surface normals, comparing existing data driven methods and evaluating their performance against both the ground truth and traditional normal estimation methods.

show abstract

Perceptual Deep Depth Super-Resolution

Voynov

Artemov

Egiazarian

et al. 2019

View full text Add to dashboard Cite

RGBD images, combining high-resolution color and lower-resolution depth from various types of depth sensors, are increasingly common. One can significantly improve the resolution of depth maps by taking advantage of color information; deep learning methods make combining color and depth information particularly easy.However, fusing these two sources of data may lead to a variety of artifacts. If depth maps are used to reconstruct 3D shapes, e.g., for virtual reality applications, the visual quality of upsampled images is particularly important.The main idea of our approach is to measure the quality of depth map upsampling using renderings of resulting 3D surfaces. We demonstrate that a simple visual appearancebased loss, when used with either a trained CNN or simply a deep prior, yields significantly improved 3D shapes, as measured by a number of existing perceptual metrics. We compare this approach with a number of existing optimization and learning-based techniques.

show abstract

Latent Video Transformer

Rakhimov¹,

Volkhonskiy²,

Artemov³

et al. 2020

Preprint

View full text Add to dashboard Cite

The video generation task can be formulated as a prediction of future video frames given some past frames. Recent generative models for videos face the problem of high computational requirements. Some models require up to 512 Tensor Processing Units for parallel training. In this work, we address this problem via modeling the dynamics in a latent space. After the transformation of frames into the latent space, our model predicts latent representation for the next frames in an autoregressive manner. We demonstrate the performance of our approach on BAIR Robot Pushing and Kinetics-600 datasets. The approach tends to reduce requirements to 8 Graphical Processing Units for training the models while maintaining comparable generation quality.

show abstract

Deep Vectorization of Technical Drawings

Egiazarian

Voynov

Artemov

et al. 2020

View full text Add to dashboard Cite

We present a new method for vectorization of technical line drawings, such as floor plans, architectural drawings, and 2D CAD images. Our method includes (1) a deep learning-based cleaning stage to eliminate the background and imperfections in the image and fill in missing parts, (2) a transformer-based network to estimate vector primitives, and (3) optimization procedure to obtain the final primitive configurations. We train the networks on synthetic data, renderings of vector line drawings, and manually vectorized scans of line drawings. Our method quantitatively and qualitatively outperforms a number of existing techniques on a collection of representative technical drawings.

show abstract

Latent Video Transformer

Rakhimov

Volkhonskiy

Artemov

et al. 2021

View full text Add to dashboard Cite

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Barabanau¹,

Artemov²,

Burnaev³

et al. 2019

Preprint

View full text Add to dashboard Cite

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Barabanau

Artemov

Burnaev

et al. 2020

View full text Add to dashboard Cite

Voxelwise 3D Convolutional and Recurrent Neural Networks for Epilepsy and Depression Diagnostics from Structural and Functional MRI Data

Pominova

Artemov

Sharaev

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Alexey Artemov

ABC: A Big CAD Model Dataset for Geometric Deep Learning

Perceptual Deep Depth Super-Resolution

Latent Video Transformer

Deep Vectorization of Technical Drawings

Latent Video Transformer

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Monocular 3D Object Detection via Geometric Reasoning on Keypoints

Voxelwise 3D Convolutional and Recurrent Neural Networks for Epilepsy and Depression Diagnostics from Structural and Functional MRI Data

Contact Info

Product

Resources

About