Behjat Siddiquie

Amazon, Seattle, WA
behjats at gmail dot com
CV
I am a Senior Applied Scientist at Amazon, working on Amazon Go, building technology for cashierless shopping. I led the research effort to launch Amazon Go's Just Walk Out technology in third party stores. Our team built a computer vision based system to generate, maintain and update, in real-time, a complete map of product placement in a physical store (also known as a store planogram), which was a pre-requisite to launching the Just Walk Out technology in third party stores. This system currently runs in over a hundred third party stores worldwide and is based on deep learning models for 3D object detection and large-scale product identification.

Previously, I was a Senior Computer Scientist at SRI International, where I worked on real-time human behavior recognition and interaction modeling using multimodal sensors, and on using image analytics to enhance social media based security applications.

I completed my PhD from the University of Maryland, College Park, under the guidance of Prof. Larry Davis. In my thesis, I addressed the problem of efficiently retrieving images from large databases based on complex and multi-modal queries. Before that, I completed my BTech. from the Indian Institute of Technology, Bombay.

Selected Publications (Complete List)

Google Scholar Citations
  • LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
    IEEE International Conference on Computer Vision, (ICCV) 2023, (Oral Presentation)
    Koutilya PNVR, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie and David Jacobs
    [paper][code][www]

  • Energy-Based Learning for Scene Graph Generation
    IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2021, (nominated for the best paper award)
    Mohammed Suhail, Abhay Mittal, Behjat Siddiquie, Chris Broaddus, Jayan Eledath, Gerard Medioni and Leonid Sigal
    [paper][code][video]

  • Deep Multimodal Fusion: A Hybrid Approach
    International Journal of Computer Vision, (IJCV) 2018
    Mohamed Amer, Tim Shields, Behjat Siddiquie, Amir Tamrakar, Ajay Divakaran and Sek Chai
    [link]

  • Multi-Modal Image Retrieval for Complex Queries using Small Codes
    ACM International Conference on Multimedia Retrieval, (ICMR) 2014
    Behjat Siddiquie, Brandyn White, Abhishek Sharma and Larry S. Davis
    [pdf] [poster] [Supplementary Material]

  • Emotion Detection in Speech using Deep Networks
    IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP) 2014
    Mohamed Amer, Behjat Siddiquie, Colleen Richey and Ajay Divakaran
    [pdf] [poster]

  • Large-Scale Vehicle Detection, Indexing, and Search in Urban Surveillance Videos
    IEEE Transactions on Multimedia, 2012
    Rogerio Feris, Behjat Siddiquie, James Petterson, Yun Zhai, Ankur Datta, Lisa Brown and Sharath Pankanti
    [link]

  • Image Ranking and Retrieval based on Multi-Attribute Queries
    IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2011, (Oral Presentation)
    Behjat Siddiquie, Rogerio S. Feris and Larry S. Davis
    [pdf] [slides] [talk]

  • Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning
    IEEE Conference on Computer Vision and Pattern Recognition, (CVPR) 2010, (Oral Presentation)
    Behjat Siddiquie and Abhinav Gupta
    [pdf] [slides] [poster] [talk]

  • Incremental Multiple Kernel Learning for Object Recognition
    IEEE International Conference on Computer Vision, (ICCV) 2009
    Aniruddha Kembhavi, Behjat Siddiquie, Roland Miezianko, Scott McCloskey and Larry S. Davis
    [pdf] [poster] [video]