Know which exact frames are included in your samples

## 🚀 Feature
For traceability and reproducibility of a dataset, it is essential to know which exact frames are included in each definitive clip created by the `FrameVideo.get_clip()` method. This information is available in the current implementation, but it is not passed downstream.

## Motivation
Models are deemed to be transparent by upcoming legislation such as the [AI Act in Europe](https://artificialintelligenceact.eu/high-level-summary/) and [in the US](https://www.nextgov.com/artificial-intelligence/2024/01/bill-sets-transparency-standards-ai-models-including-use-copyrighted-material/393052/). Traceability and reproducibility are key factors for AI models in terms of transparency. Knowing which exact frames are included in your train/test sample clips is vital for traceability and reproducibility.

## Pitch
`FrameVideo.get_clip()` returns the "frame_indices" in its output dictionary. However, this information is not processed by `LabeledVideoDataset.__next__()`. The solution is as simple as adding the "frame_indices" key and corresponding value to `sample_dict`, returned by `LabeledVideoDataset.__next__()`:

From [LabeledVideoDataset, L215](https://github.com/facebookresearch/pytorchvideo/blob/1fadaef40dd393ca09680f55582399f4679fc9b7/pytorchvideo/data/labeled_video_dataset.py#L215)
``` 
            frames = self._loaded_clip["video"]
            audio_samples = self._loaded_clip["audio"]
            frame_indices = self._loaded_clip["frame_indices"]
            sample_dict = {
                "video": frames,
                "video_name": video.name,
                "video_index": video_index,
                "frame_indices": frame_indices,
                "clip_index": clip_index,
                "aug_index": aug_index,
                **info_dict,
                **({"audio": audio_samples} if audio_samples is not None else {}),
            }

```

Alternatively, the frame_indices information could be added to the `info_dict`.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Know which exact frames are included in your samples #260

🚀 Feature

Motivation

Pitch

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Know which exact frames are included in your samples #260

Description

🚀 Feature

Motivation

Pitch

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions