torchvision 0.9.0

Breaking changes

Split coco_detection_dataset() into coco_detection_dataset() (detection only) and new coco_segmentation_dataset() (instance segmentation) reducing memory usage by ~50% (@Chandraveersingh1717, #280).
coco_classes() is now aligned with the 90 sparse pytorch COCO classses, in order to match pretrained model predictions. (@318).
Renamed $categories to $classes for consistency (character vector of class names; old attribute is deprecated with a warning) (#300).

New features

Added resolution function for COCO Imagenet_21k and Pascal_VOC classes and labels (#284).
Added article showcasing model_fcn_resnet50() with visualization utilities draw_segmentation_masks() and vision_make_grid() (@DerrickUnleashed, #281).
Added collection dataset catalog with search_collection(), get_collection_catalog(), and list_collection_datasets() functions for discovering and exploring collections (#271, @ANAMASGARD).
Added target_transform_coco_masks() and target_transform_trimap_masks() transformation functions for explicit segmentation mask generation (@ANAMASGARD).
Added support for connectivity argument for drawing lines between keypoints in draw_keypoints() (@DerrickUnleashed, #303).

New models

Added model_maskrcnn_resnet50_fpn() and model_maskrcnn_resnet50_fpn_v2() for instance segmentation (#278, @ANAMASGARD).
Added model_convnext_*_detection() for object detection, with * within tiny/small/base (#262, @ANAMASGARD).
Added model_convnext_*_fcn() and model_convnext_*_upernet() for semantic segmentation, with * within tiny/small/base (#265, @ANAMASGARD).

New datasets

Added vggface2_dataset() for loading the VGGFace2 dataset (@DerrickUnleashed, #238).
Added moth dataset to rf100_biology_collection() and currency and wine_label to rf100_document_collection() (#274).

Bug fixes and improvements

model_maskrcnn_*() and model_fasterrcnn_*() now match the pytorch implementation (#318).
transform_ now correctly manage batched 4D torch tensors and 4D arrays (#313).
mnist_datataset() and derivatives now correctly return item x() values with a 1-channel dimension (@Chandraveersingh1717, #307).
transform_affine(), transform_rotate() and random derivatives now use interpolation and fill parameter in favor of resampleand fillcolor (@Chandraveersingh1717, #299).
Fixed draw_keypoints() documentation and error message (@srishtiii28, #296).
Standardized dataset messages: download messages now include split information, success messages show image count and class count for consistency.
fix model_fasterrcnn_* did not provide boxes output normalized to image size, did not manage batches, fix performance of the roi_align() function (#284).
fix rf100 collection bounding-box now consider the correct native COCO format being 'xywh' (#272).
Remove .getbatch method from MNIST as it is providing inconsistent tensor dimensions with .getitem due to non-vectorized transform_ operations (#264).
Added article for draw_keypoints() (@DerrickUnleashed, #303).
Fix typos and align model documentation for model_deeplabv3_* and model_convnext_*_detection() to ensure consistency (@DerrickUnleashed #302).
Fixed draw_keypoints() documentation: corrected connectivity parameter type, obsolete note, colors description default (@srishtiii28 #296).

torchvision 0.8.0

New datasets

Added lfw_people_dataset() and lfw_pairs_dataset() for loading Labelled Faces in the Wild (LFW) datasets (@DerrickUnleashed, #203).
Added places365_dataset() for loading the Places365 dataset (@koshtiakanksha, #196).
Added pascal_segmentation_dataset(), and pascal_detection_dataset() for loading the Pascal Visual Object Classes datasets (@DerrickUnleashed, #209).
Added whoi_plankton_dataset(), whoi_small_plankton_dataset(), and whoi_small_coral_dataset() (@cregouby, #236).
Added rf100_document_collection(), rf100_medical_collection(), rf100_biology_collection(), rf100_damage_collection(), rf100_infrared_collection(), and rf100_underwater_collection(). These are collections of datasets from Roboflow 100 under the same theme, for a total of 35 datasets (@koshtiakanksha, @cregouby, #239).
Added rf100_peixos_segmentation_dataset(). (@koshtiakanksha, @cregouby, #250).

New models

Added model_maxvit() for MaxViT: Multi-Axis Vision Transformer (#229, @koshtiakanksha).
Added model_facenet_pnet(), model_facenet_rnet(), and model_facenet_onet() for Facenet MTCNN face detection models. (@DerrickUnleashed, #227)
Added model_mtcnn() and model_inception_resnet_v1() models for face detection and recognition. (@DerrickUnleashed, #217)
Added model_mobilenet_v3_large() and model_mobilenet_v3_small() models for efficient image classification. (@DerrickUnleashed, #237)
Added 8 of the model_convnext_() family models for image classification, thanks to @horlar1 contribution. (@cregouby, #251)
Added 2 model_fasterrcnn_resnet50_() models and 2 model_fasterrcnn_mobilenet_v3_large_() for object detection. (@koshtiakanksha, #251)

New features

Added imagenet_label() and imagenet_classes() for ImageNet classes resolution (#229, @koshtiakanksha).
base_loader() now accept URLs (@cregouby, #246).
draw_segmentation_masks() now accepts semantic segmentation models torch_float() output. (@cregouby #247)
MNIST datasets and Roboflow 100 collections now have a .getbatch attached method (@cregouby #255)

Bug fixes and improvements

Switch pre 0.5.0 models to their /v2/ URL in torch-cdn.mlverse.org. (#215)
Models are now separated in the documentation by tasks between classification, object detection, and semantic segmentation models (@cregouby, #247)
Breaking Change : Refactoring of coco_* dataset family now provides each item$x being an image array (for consistency with other datasets). You can use transform = transform_to_tensor to restore the previous x output to be a torch_tensor().
transform_ are now documented into 3 different categories: unitary transformations, random transformations and combining transformations. (@cregouby, #250)
Deprecation : emnist_dataset is deprecated in favor of emnist_collection() (@cregouby, #260).

torchvision 0.7.0

New datasets

Added fashion_mnist_dataset() for loading the Fashion-MNIST dataset (@koshtiakanksha, #148).
Added eurosat_dataset(), eurosat_all_bands_dataset(), and eurosat100_dataset() for loading RGB, all-band, and small-subset variants of the EuroSAT dataset (@cregouby, #126).
Added qmnist_dataset() for loading the QMNIST dataset (@DerrickUnleashed, #153).
Added emnist_dataset() for loading the EMNIST dataset (@DerrickUnleashed, #152).
Added fgvc_aircraft_dataset() for loading the FGVC-Aircraft dataset (@DerrickUnleashed, #156).
Added coco_detection_dataset() and coco_caption_dataset() for loading the MS COCO detection and captions datasets (@koshtiakanksha, #161, #172).
Added caltech101_dataset() and caltech256_dataset() for loading the Caltech 101 and 256 datasets (@DerrickUnleashed, #158).
Added fer_dataset() for loading the FER-2013 dataset (@DerrickUnleashed, #154).
Added flowers102_dataset() for loading the Flowers102 dataset (@DerrickUnleashed, #157).
Added flickr8k_dataset() and flickr30k_dataset() for loading the Flickr8k and Flickr30k datasets (@DerrickUnleashed, #159).
Added oxfordiiitpet_dataset(), oxfordiiitpet_binary_dataset(), and oxfordiiitpet_segmentation_dataset() for loading the Oxford-IIIT Pet datasets (@DerrickUnleashed, #162).
Added rf100_document_collection(), rf100_underwater_collection(), rf100_medical_collection(), rf100_biology_collection(), and rf100_peixos_segmentation_dataset() for loading Roboflow 100 datasets (@koshtiakanksha, #239).

New models

Added EfficientNet model family (B0–B7) – scalable CNNs for image classification. (#166, @koshtiakanksha)
Added EfficientNetV2 model family (V2-S/M/L) – improved EfficientNet models for faster training. (#166, @koshtiakanksha)
Added model_vit_b_16(), model_vit_b_32(), model_vit_l_16(), model_vit_l_32(), and model_vit_h_14() for loading Vision Transformer models (@DerrickUnleashed, #202).

New features

tensor_image_display() and tensor_image_browse() now accept all tensor_image dtypes (@cregouby, #115).
draw_bounding_boxes() and draw_segmentation_masks() now accept image_with_bounding_box and image_with_segmentation_mask inputs which are the default items class for respectively detection datasets and segmentation datasets (@koshtiakanksha, #175).
fgvc_aircraft_dataset() gains support for annotation_level = "all" (@DerrickUnleashed, #168).
folder_dataset() now supports TIFF image formats (@cregouby, #169).
New nms() and batched_nms() functions provide Non-Maximum Suppression utilities. Added box_convert() to convert between bounding box formats (@Athospd, #40).

Minor bug fixes and improvements

transform_rotation() now correctly uses width × height for image size instead of width × width (@cregouby, #114).
Clarified documentation for transform_affine() to reduce confusion with transform_random_affine() (@cregouby, #116).
Added French translations for message outputs (@cregouby, #112).

torchvision 0.6.0

Remove again dependency on zip::unzip added in version 0.4.0. (#89)
Improve performance on tinyimagenet-alexnet example (#90, @statist-bhfz)
Updated URL of downloaded resources to use the new torch CDN. (#109)

torchvision 0.5.1

Remove usage of torch_lstsq that was removed in torch v0.10.0

torchvision 0.5.0

Bugs fixed in transform_adjust_hue() and transform_linear_transformation() (#72, #73, @sebffischer)
add draw_bounding_boxes() , draw_segmentation_masks() and draw_keypoints() on top of image tensors, and add a convenience tensor_image_browse() and tensor_image_display() functions to visualize image tensors respectively in browser or in X11 device (#80, @cregouby)
Added the InceptionV3 model. (#82)

torchvision 0.4.1

Implemented MobileNetV2 (#60)
Improved vignettes so they use nnf_cross_entropy for numerical stability. (#61)
Implement the full list of ResNet model family (#66, @cregouby)
Improved how datasets and models are downloaded by using a large timeout by default and downloading to temporary file to avoid hard to debug errors when the files are corrupt. (#67)

torchvision 0.4.0

Added a dependency on zip to zip::unzip the tinyimagenet dataset.
Removed all usages of torch::enumerate() from docs and tests in favor of coro::loop() (#57)
Fixed non-namespaced calls to torch. (#58)

torchvision 0.3.0

Use a self hosted version of the MNIST dataset to avoid frequent download failures. (#48)
Fix torch_arange calls after breaking change in torch. (#47)
Fix bug in transform_resize when passing size with length 1. (#49)

torchvision 0.2.0

Fixed bugs in transform_rotate. (#31)
Fixed bugs in transform_random_affine and transform_affine (#32)
Added VGG model (#35)

torchvision 0.1.0

Added a NEWS.md file to track changes to the package.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torchvision 0.9.0

Breaking changes

New features

New models

New datasets

Bug fixes and improvements

torchvision 0.8.0

New datasets

New models

New features

Bug fixes and improvements

torchvision 0.7.0

New datasets

New models

New features

Minor bug fixes and improvements

torchvision 0.6.0

torchvision 0.5.1

torchvision 0.5.0

torchvision 0.4.1

torchvision 0.4.0

torchvision 0.3.0

torchvision 0.2.0

torchvision 0.1.0

FilesExpand file tree

NEWS.md

Latest commit

History

NEWS.md

File metadata and controls

torchvision 0.9.0

Breaking changes

New features

New models

New datasets

Bug fixes and improvements

torchvision 0.8.0

New datasets

New models

New features

Bug fixes and improvements

torchvision 0.7.0

New datasets

New models

New features

Minor bug fixes and improvements

torchvision 0.6.0

torchvision 0.5.1

torchvision 0.5.0

torchvision 0.4.1

torchvision 0.4.0

torchvision 0.3.0

torchvision 0.2.0

torchvision 0.1.0