I am a Senior Applied Scientist at Amazon, working on Amazon Go, building technology for cashierless shopping. I led the research effort to launch Amazon Go's Just Walk Out technology in third party stores. Our team built a computer vision based system to generate, maintain and update, in real-time, a complete map of product placement in a physical store (also known as a store planogram), which was a pre-requisite to launching the Just Walk Out technology in third party stores. This system currently runs in over a hundred third party stores worldwide and is based on deep learning models for 3D object detection and large-scale product identification.
Previously, I was a Senior Computer Scientist at SRI International, where I worked on real-time human behavior recognition and interaction modeling using multimodal sensors, and on
using image analytics to enhance social media based security applications.
I completed my PhD from the University of Maryland, College Park, under the guidance of Prof. Larry Davis. In my thesis, I addressed the problem of efficiently retrieving images from large databases based on complex and multi-modal queries. Before that, I completed my
BTech. from the Indian Institute of Technology, Bombay.
Google Scholar Citations
- LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
IEEE International Conference on Computer Vision, (ICCV) 2023, (Oral Presentation)
Koutilya PNVR, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie and David Jacobs
[paper][code][www]
- Energy-Based Learning for Scene Graph Generation
IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2021, (nominated for the best paper award)
Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni and Leonid Sigal
[paper][code][video]
- Deep Multimodal Fusion: A Hybrid Approach
International Journal of Computer Vision, (IJCV) 2018
Mohamed Amer, Tim Shields, Behjat Siddiquie, Amir Tamrakar, Ajay Divakaran and Sek Chai
[link]
- Multi-Modal Image Retrieval for Complex Queries using Small Codes
ACM International Conference on Multimedia Retrieval, (ICMR) 2014
Behjat Siddiquie, Brandyn White, Abhishek Sharma and Larry S. Davis
[pdf]
[poster]
[Supplementary Material]
- Emotion Detection in Speech using Deep Networks
IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP) 2014
Mohamed Amer, Behjat Siddiquie, Colleen Richey and Ajay Divakaran
[pdf]
[poster]
- Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos
IEEE Transactions on Multimedia, 2012
Rogerio Feris, Behjat Siddiquie, James Petterson, Yun Zhai, Ankur Datta, Lisa Brown and Sharath Pankanti
[link]
- Image Ranking and Retrieval based on Multi-Attribute Queries
IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2011, (Oral Presentation)
Behjat Siddiquie, Rogerio S. Feris and Larry S. Davis
[pdf]
[slides]
[talk]
- Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning
IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2010, (Oral Presentation)
Behjat Siddiquie and Abhinav Gupta
[pdf]
[slides]
[poster]
[talk]
- Incremental Multiple Kernel Learning for Object Recognition
IEEE International Conference on Computer Vision, (ICCV) 2009
Aniruddha Kembhavi, Behjat Siddiquie, Roland Miezianko, Scott McCloskey and Larry S. Davis
[pdf]
[poster]
[video]
|