A2: Semantic Scene Understanding for 3D Representation

[**Wesley Haverkort**](mailto:w.j.haverkort@student.rug.nl)

Sync slides

https://docs.google.com/presentation/d/1ioU0Bz3pH50mXnqPpxjEqHX3ZRrUq6BBKUTdcfEVIbw/edit?usp=drive_link

Progress

Last updated: Jan 30 2026

Overview

This project focuses on semantic scene understanding and 3D representation using RGB-D data.

Instead of tracking individual objects, the system processes the entire image and assigns a semantic label to each pixel. These labels are then combined with depth data to construct a semantically enriched 3D representation of the scene.

The pipeline is designed to support:

Semantic segmentation (2D)
Semantic point cloud generation (3D)
Future integration with Gaussian Splatting for multi-view scene reconstruction

Setup

[ROS2 Humble](https://docs.ros.org/en/humble/Installation.html) A LTS ROS2 Distribution for Ubuntu Jammy (22.04).

Install [PyTorch](https://pytorch.org/get-started/locally/). If you are using a Jetson Orin Nano either install PyTorch using NVIDIA's guides or install the following wheels:

	pip install https://pypi.jetson-ai-lab.io/jp6/cu126/+f/62a/1beee9f2f1470/torch-2.8.0-cp310-cp310-linux_aarch64.whl#sha256=62a1beee9f2f147076a974d2942c90060c12771c94740830327cae705b2595fc

Install torchvision:

	pip install https://pypi.jetson-ai-lab.io/jp6/cu126/+f/907/c4c1933789645/torchvision-0.23.0-cp310-cp310-linux_aarch64.whl#sha256=907c4c1933789645ebb20dd9181d40f8647978e6bd30086ae7b01febb937d2d1

[realsense-ros](https://github.com/realsenseai/realsense-ros) A ROS wrapper for Intel® RealSense™ cameras.

Usage

Run the following in the working directory:

source /opt/ros/humble/setup.bash
colcon build --packages-select localizer
source install/setup.bash
ros2 run localizer config_ui

Setup your parameters and click start. Click somewhere on the image to start tracking that object (It can only track things that are trained in YOLO).

Remote viewing

To remotely view your screen use nomachine by starting the nxserver service and connecting to it using the nomachine app on your mobile device.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
rviz		rviz
src/localizer		src/localizer
.gitignore		.gitignore
README.md		README.md
Starting form MSc Internship CS.pdf		Starting form MSc Internship CS.pdf
start_camera.bash		start_camera.bash
verify_marker.py		verify_marker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A2: Semantic Scene Understanding for 3D Representation

Sync slides

Progress

Overview

Setup

Usage

Remote viewing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A2: Semantic Scene Understanding for 3D Representation

Sync slides

Progress

Overview

Setup

Usage

Remote viewing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages