Multi-View Foundation Models

This repo is the official implementation of Multi-View Foundation Models, by Leo Segre*, Or Hirschorn* and Shai Avidan

Introduction

We introduce a novel framework that transforms existing 2D Foundation Models (like DINO, SAM, and CLIP) into Multi-View Foundation Models. Current 2D models process images independently, leading to inconsistent feature representations for the same 3D point viewed from multiple camera angles.

Setup/Install

We recommend using Anaconda or Miniconda. To set up the environment, follow the instructions below.

Create environment

conda create --name multi_view_foundation_models -y python=3.10
conda activate multi_view_foundation_models
pip install torch==2.8.0 torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
pip install -r requirements.txt
pip install -e .

Demo

To run the demo on a sample scene (Pikachu), use the command below. It will download the Pikachu scene and the pretrained DINOv2_reg model and run a visual correspondence comparison.

python demo.py

Data

Download the Generalization dataset from this link.

For the ScanNet++ dataset - each new user needs to submit application to request access to ScanNet++ from its official website. According to the Terms of Use of ScanNet++, we can only share the preprocessed data with people who have also signed the Terms of Use and been granted access to ScanNet++. After you submit your application and get approved from the ScanNet++ team, you can Forward the approval email to leosegre@mail.tau.ac.il and then we will share our preprocessed data with you directly.

Extact Features

To extract features from images using a specific foundation model, run the following command. For example, for DINOv2:

python test/extract_features.py --exp_name dinov2_reg --colmap_path {path/to/data/root/dir} --exp_directory experiments --scene pikachu --load_pretrained

Training

Run the relevant experiment, for example for DINOv2:

python train/train_dino.py --exp_name {exp_name} --colmap_path {path/to/data/root/dir} --exp_directory {path/to/exp/dir} --config_name dinov2_reg.yaml

If you don't have the camera parameters, use the regular training script with the no-plucker config (the dataloader will automatically use dummy poses):

python train/train_dino.py --exp_name {exp_name} --colmap_path {path/to/data/root/dir} --exp_directory {path/to/exp/dir} --config_name dino_v2_reg_no_plucker.yaml

Testing

python test/test_3d.py --exp_directory {exp_dir} --exp_name {exp_name} --colmap_path {path/to/data/root/dir} --results_dir {path/to/results/dir} --compare_to_base --fit3d

To test on our pretrained models, use the below command (You can change the model type by changing the exp_name to {dinov2_reg, dinov2_reg_no_plucker, dinov3, clip, sam}).

python test/test_3d.py --load_pretrained --exp_directory {exp_dir} --exp_name dinov2_reg --colmap_path {path/to/data/root/dir} --results_dir {path/to/results/dir} --compare_to_base --fit3d

If you don't have the camera parameters, use the standard test script with the no-plucker experiment name:

python test/test_3d.py --load_pretrained --exp_directory {exp_dir} --exp_name dinov2_reg_no_plucker --colmap_path {path/to/data/root/dir} --results_dir {path/to/results/dir} --compare_to_base --fit3d

BibTeX

If you find our models useful, please consider citing our paper!

@article{MultiViewFoundationModels2025,
      title={Multi-View Foundation Models}, 
      author={Leo Segre and Or Hirschorn and Shai Avidan},
      year={2025},
      eprint={2512.15708},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.15708}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
BASE_SAM		BASE_SAM
CLIP		CLIP
FiT3D		FiT3D
OpenClip		OpenClip
SAM		SAM
configs		configs
dino3d		dino3d
dinov3_checkpoint		dinov3_checkpoint
eval_probe3d		eval_probe3d
images		images
test		test
third_party/dinov3		third_party/dinov3
train		train
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
demo.py		demo.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-View Foundation Models

Introduction

Setup/Install

Create environment

Demo

Data

Extact Features

Training

Testing

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-View Foundation Models

Introduction

Setup/Install

Create environment

Demo

Data

Extact Features

Training

Testing

BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages