ActiveVisionPortal

About

This project develops an open portal for goal-directed vision research.

Datasets

COCO-Search18 dataset is available at https://sites.google.com/view/cocosearch/home.

Environment Setup

The environment dependencies are specified in environment.yaml. To set up the environment using conda, run:

  conda env create -f environment.yaml
  conda activate ActiveVision

Alternatively, for Compute Canada users, see the detailed section below.

Running the Framework

The main entry point is main.py, which provides a unified interface for training and evaluating different gaze prediction models.

Example Commands

List all available models:

  python main.py --list_models

Train a model:

  python main.py --model <model_name> --train --dataset datasets/COCO-Search18

Evaluate a model:

  python main.py --model <model_name> --eval --dataset datasets/COCO-Search18

Show model-specific help:

  python main.py --model <model_name> --help_model

You may also specify the dataset directory via --dataset, e.g., --dataset datasets/COCO-Search18.

Required Files

Due to storage constraints, some required files are hosted on Compute Canada. Please place them in exactly the same directory structure as referenced in the code. For example:

ActiveVisionPortal/
├── datasets/
│   └── COCO-Search18/
│   └── ...
├── models/
│   └── model 1/
│       └── checkpoint(s)/
│       └── data/
│       └── pretrained_models(if applicable)/
│       └── model/
│       └── entry.py
│       └── ...
│   └── model 2/
│   └── ...

Setting Up on Compute Canada

Step 1: Prepare Project Directory

Please place the entire project directory in a location of your choice on Compute Canada, which matches the directory structure shown in Required Files.

For example, you can use: ~/scratch/ActiveVisionPortal/

Step 2: Create Environment

Run the provided setup script:

  bash setup_env.sh

Compile Custom CUDA Operators:

  cd ~/scratch/ActiveVisionPortal/models/HAT/model/pixel_decoder/ops/
  dos2unix make.sh

Allocate a GPU node:

  salloc --gres=gpu:1 --mem=16G --cpus-per-task=4 --time=01:00:00

Inside the GPU shell:

  sh make.sh
  exit

Step 3: Running

Navigate to your project root (e.g., where main.py is located):

  cd ~/scratch/ActiveVisionPortal

Then follow the instructions in the Running the Framework section below to train or evaluate a model.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
models		models
.gitignore		.gitignore
CLIP_main.py		CLIP_main.py
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
main.py		main.py
model_registry.py		model_registry.py
run_vlm_parallel.sh		run_vlm_parallel.sh
setup_env.sh		setup_env.sh
vlm_parallel.py		vlm_parallel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ActiveVisionPortal

About

Datasets

Environment Setup

Running the Framework

Example Commands

Required Files

Setting Up on Compute Canada

Step 1: Prepare Project Directory

Step 2: Create Environment

Step 3: Running

About

Uh oh!

Releases

Packages

Languages

License

lc542/ActiveVisionPortal

Folders and files

Latest commit

History

Repository files navigation

ActiveVisionPortal

About

Datasets

Environment Setup

Running the Framework

Example Commands

Required Files

Setting Up on Compute Canada

Step 1: Prepare Project Directory

Step 2: Create Environment

Step 3: Running

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages