Skip to content
View mondalanindya's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mondalanindya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mondalanindya/README.md

Hi, I'm Anindya Mondal πŸ‘‹

Typing SVG

Profile Views Twitter Follow GitHub followers


About Me

I am a PhD Researcher at the Centre for Vision, Speech and Signal Processing (CVSSP) and the Surrey Institute for People-Centred AI, University of Surrey, Guildford, UK.

My research focuses on building robust vision systems by integrating multi-modal signals β€” with emphasis on object counting, multi-label action recognition, event-based vision, and text-to-image generation using vision-language and foundation models.

  • πŸ“ Guildford, UK
  • πŸŽ“ Teaching Assistant β€” Applied ML (EEEM068) & Advanced CV and Deep Learning (EEEM071), University of Surrey
  • πŸ’Ό Open to collaborations, research & industry opportunities
  • πŸ“« anindyam[dot]jan[at]gmail[dot]com

Publications

Paper Venue Links
CountLoop: Training-Free High-Instance Image Generation via Iterative Agent Guidance Preprint 2025 arXiv Code Project
OmniCount: Multi-label Object Counting with Semantic-Geometric Priors AAAI 2025 arXiv Code
MSQNet: Actor-agnostic Multi-label Action Recognition with Multi-modal Query ICCVW 2023 arXiv Code
TimeGNN: Time-varying Signals Recovery via Graph Neural Networks ICASSP 2023 arXiv
Recovery of Missing Sensor Data by Reconstructing Time-varying Graph Signals EUSIPCO 2022 Code
Moving Object Detection for Event-based Vision using Graph Spectral Clustering ICCVW 2021 arXiv Code

Awards & Recognition

  • πŸ† AAAI 2025 Conference cum Travel Grant β€” Philadelphia, USA
  • πŸ† ICCV 2023 Conference Grant β€” Paris, France
  • πŸŽ“ University of Surrey Postgraduate Studentship (UK)
  • πŸ… Uplink Research Internship Award β€” ACM SIGKDD India Chapter

Research Interests

Computer Vision Object Counting Action Recognition Text-to-Image Generation Event-based Vision Graph Signal Processing Vision-Language Models Foundation Models


Tech Stack

Python PyTorch TensorFlow OpenCV Jupyter Git Linux


GitHub Stats

Anindya's GitHub Stats Top Languages
GitHub Streak

Connect with Me

Email LinkedIn Twitter Website arXiv DBLP


⭐ If you find my work useful, consider starring the repositories!

Pinned Loading

  1. MSQNet MSQNet Public

    Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]

    Python 24 1

  2. ICCVW2021_GSCEventMOD ICCVW2021_GSCEventMOD Public archive

    Moving Object Detection for Event-based vision using Graph Spectral Clustering (Python implementation)

    Jupyter Notebook 27 6

  3. OmniCount OmniCount Public

    [AAAI 2025] Official code for "OmniCount: Multi-label Object Counting with Semantic-Geometric Priors"

    Python 21 2