Oleksii Sidorov scite author profile

Deep learning algorithms have demonstrated state-ofthe-art performance in various tasks of image restoration. This was made possible through the ability of CNNs to learn from large exemplar sets. However, the latter becomes an issue for hyperspectral image processing where datasets commonly consist of just a few images. In this work, we propose a new approach to denoising, inpainting, and superresolution of hyperspectral image data using intrinsic properties of a CNN without any training. The performance of the given algorithm is shown to be comparable to the performance of trained networks, while its application is not restricted by the availability of training data. This work is an extension of original "deep prior" algorithm to hyperspectral imaging domain and 3D-convolutional networks.

show abstract

TextCaps: A Dataset for Image Captioning with Reading Comprehension

Sidorov

Rohrbach

et al. 2020

105

112

View full text Add to dashboard Cite

Conditional GANs for Multi-Illuminant Color Constancy: Revolution or yet Another Approach?

Sidorov

2019

View full text Add to dashboard Cite

Changing the Image Memorability: From Basic Photo Editing to GANs

Sidorov

2019

View full text Add to dashboard Cite

Memorability is considered to be an important characteristic of visual content, whereas for advertisement and educational purposes it is often crucial. Despite numerous studies on understanding and predicting image memorability, there are almost no achievements in memorability modification. In this work, we study two approaches to image editing -GAN and classical image processing -and show their impact on memorability.The visual features which influence memorability directly stay unknown till now, hence it is impossible to control it manually. As a solution, we let GAN learn it deeply using labeled data, and then use it for conditional generation of new images. By analogy with algorithms which edit facial attributes, we consider memorability as yet another attribute and operate with it in the same way. Obtained data is also interesting for analysis, simply because there are no real-world examples of successful change of image memorability while preserving its other attributes. We believe this may give many new answers to the question "what makes an image memorable?"Apart from that we also study the influence of conventional photo-editing tools (Photoshop, Instagram, etc.) used daily by a wide audience on memorability. In this case, we start from real practical methods and study it using statistics and recent advances in memorability prediction. Photographers, designers, and advertisers will benefit from the results of this study directly. Figure 1: Modification of memorability using the proposed algorithm. All the results were generated without any human intervention."What" and "how" to change were learned by the model from experimental data.

show abstract

Artificial Color Constancy via GoogLeNet with Angular Loss Function

Sidorov

2020

Applied Artificial Intelligence

View full text Add to dashboard Cite

Color Constancy is the ability of the human visual system to perceive colors unchanged independently of the illumination. Giving a machine this feature will be beneficial in many fields where chromatic information is used. Particularly, it significantly improves scene understanding and object recognition.In this paper, we propose transfer learning-based algorithm, which has two main features: accuracy higher than many state-of-the-art algorithms and simplicity of implementation. Despite the fact that GoogLeNet was used in the experiments, given approach may be applied to any CNN. Additionally, we discuss design of a new loss function oriented specifically to this problem, and propose a few the most suitable options.

show abstract

Are All the Frames Equally Important?

Sidorov

Pedersen

Kim³

et al. 2020

View full text Add to dashboard Cite

In this work, we address the problem of measuring and predicting temporal video saliency -a metric which defines the importance of a video frame for human attention. Unlike the conventional spatial saliency which defines the location of the salient regions within a frame (as it is done for still images), temporal saliency considers importance of a frame as a whole and may not exist apart from context. The proposed interface is an interactive cursor-based algorithm for collecting experimental data about temporal saliency. We collect the first human responses and perform their analysis. As a result, we show that qualitatively, the produced scores have very explicit meaning of the semantic changes in a frame, while quantitatively being highly correlated between all the observers. Apart from that, we show that the proposed tool can simultaneously collect fixations similar to the ones produced by eye-tracker in a more affordable way. Further, this approach may be used for creation of first temporal saliency datasets which will allow training computational predictive algorithms. The proposed interface does not rely on any special equipment, which allows to run it remotely and cover a wide audience.

show abstract

Craquelure as a Graph: Application of Image Processing and Graph Neural Networks to the Description of Fracture Patterns

Sidorov

Hardeberg

2019

View full text Add to dashboard Cite

Novel Approach to Uniformization of a Color Space via Generic Deep Learning-Based Transformation

Sidorov

2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Oleksii Sidorov

Deep Hyperspectral Prior: Single-Image Denoising, Inpainting, Super-Resolution

TextCaps: A Dataset for Image Captioning with Reading Comprehension

Conditional GANs for Multi-Illuminant Color Constancy: Revolution or yet Another Approach?

Changing the Image Memorability: From Basic Photo Editing to GANs

Artificial Color Constancy via GoogLeNet with Angular Loss Function

Are All the Frames Equally Important?

Craquelure as a Graph: Application of Image Processing and Graph Neural Networks to the Description of Fracture Patterns

Novel Approach to Uniformization of a Color Space via Generic Deep Learning-Based Transformation

Contact Info

Product

Resources

About