K. Chaudhury scite author profile

K. Chaudhury

5Publications

84Citation Statements Received

88Citation Statements Given

How they've been cited

How they cite others

Affiliations

Google (United States), University of Kentucky, Adobe Systems (United States)

Publications

Order By: Most citations

Auto-rectification of user photos

Chaudhury

DiVerdi

Ioffe

2014

View full text Add to dashboard Cite

The image auto rectification project at Google aims to create a pleasanter version of user photos by correcting the small, involuntary camera rotations (roll / pitch/ yaw) that often occur in non-professional photographs. Our system takes the image closer to the fronto-parallel view by performing an affine rectification on the image that restores parallelism of lines that are parallel in the fronto-parallel image view. This partially corrects perspective distortions, but falls short of full metric rectification which also restores angles between lines. On the other hand the 2D homography for our rectification can be computed from only two (as opposed to three) estimated vanishing points, allowing us to fire upon many more images. A new RANSAC based approach to vanishing point estimation has been developed. The main strength of our vanishing point detector is that it is line-less, thereby avoiding the hard, binary (line/no-line) upstream decisions that cause traditional algorithm to ignore much supporting evidence and/or admit noisy evidence for vanishing points. A robust RANSAC based technique for detecting horizon lines in an image is also proposed for analyzing correctness of the estimated rectification. We post-multiply our affine rectification homography with a 2D rotation which aligns the closer vanishing point with the image Y axis.

show abstract

Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce

Shankar¹,

Narumanchi²,

Ananya³

et al. 2017

Preprint

View full text Add to dashboard Cite

In this paper, we present a uni ed end-to-end approach to build a large scale Visual Search and Recommendation system for ecommerce. Previous works have targeted these problems in isolation. We believe a more e ective and elegant solution could be obtained by tackling them together. We propose a uni ed Deep Convolutional Neural Network architecture, called VisNet 1 , to learn embeddings to capture the notion of visual similarity, across several semantic granularities. We demonstrate the superiority of our approach for the task of image retrieval, by comparing against the state-of-the-art on the Exact Street2Shop [14] dataset. We then share the design decisions and trade-o s made while deploying the model to power Visual Recommendations across a catalog of 50M products, supporting 2K queries a second at Flipkart, India's largest e-commerce company. e deployment of our solution has yielded a signi cant business impact, as measured by the conversion-rate.

show abstract

ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Mukherjee¹,

Shankar²,

Ghosh³

et al. 2018

Preprint

View full text Add to dashboard Cite

Accurate demand forecasts can help on-line retail organizations better plan their supply-chain processes. The challenge, however, is the large number of associative factors that result in large, non-stationary shifts in demand, which traditional time series and regression approaches fail to model. In this paper, we propose a Neural Network architecture called AR-MDN, that simultaneously models associative factors, time-series trends and the variance in the demand. We first identify several causal features and use a combination of feature embeddings, MLP and LSTM to represent them. We then model the output density as a learned mixture of Gaussian distributions. The AR-MDN can be trained end-to-end without the need for additional supervision. We experiment on a dataset of an year's worth of data over tens-of-thousands of products from Flipkart. The proposed architecture yields a significant improvement in forecasting accuracy when compared with existing alternatives.

show abstract

Google Newspaper Search  Image Processing and Analysis Pipeline

Chaudhury

Jain

Thirthala

et al. 2009

View full text Add to dashboard Cite

A trajectory-based computational model for optical flow estimation

Chaudhury

Mehrotra²

1995

IEEE Trans. Robot. Automat.

View full text Add to dashboard Cite

Abstract-A new computational model for optical flow estimation isproposed. The proposed model utilizes trajectory information present in a multiframe spatio-temporal volume. Optical flow estimation is formulated as an optimization problem. The solution to this optimization problem yields a velocity field corresponding to smoothest and shortest trajectories of constant intensity points within the spatio-temporal volume. The approach is motivated by principles of inertia of morion and least action in physics and vision psychology. An analogy between a trajectory and a "thin wire" is discussed. A simple mechanism for handling trajectory discontinuities is also incorporated. The optimization problem is solved by stochastic relaxation techniques. Some experimental results and performance comparisons with two existing optical flow estimation techniques are presented to demonstrate the effectiveness of the proposed approach.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

K. Chaudhury

Auto-rectification of user photos

Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce

ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Google Newspaper Search  Image Processing and Analysis Pipeline

A trajectory-based computational model for optical flow estimation

Contact Info

Product

Resources

About

K. Chaudhury

Auto-rectification of user photos

Deep Learning based Large Scale Visual Recommendation and Search for E-Commerce

ARMDN: Associative and Recurrent Mixture Density Networks for eRetail Demand Forecasting

Google Newspaper Search &#150; Image Processing and Analysis Pipeline

A trajectory-based computational model for optical flow estimation

Contact Info

Product

Resources

About

Google Newspaper Search Image Processing and Analysis Pipeline