Skip to content

Latest commit

 

History

History
162 lines (118 loc) · 11.1 KB

File metadata and controls

162 lines (118 loc) · 11.1 KB

torchvision 0.9.0

Breaking changes

  • Split coco_detection_dataset() into coco_detection_dataset() (detection only) and new coco_segmentation_dataset() (instance segmentation) reducing memory usage by ~50% (@Chandraveersingh1717, #280).
  • coco_classes() is now aligned with the 90 sparse pytorch COCO classses, in order to match pretrained model predictions. (@318).
  • Renamed $categories to $classes for consistency (character vector of class names; old attribute is deprecated with a warning) (#300).

New features

  • Added resolution function for COCO Imagenet_21k and Pascal_VOC classes and labels (#284).
  • Added article showcasing model_fcn_resnet50() with visualization utilities draw_segmentation_masks() and vision_make_grid() (@DerrickUnleashed, #281).
  • Added collection dataset catalog with search_collection(), get_collection_catalog(), and list_collection_datasets() functions for discovering and exploring collections (#271, @ANAMASGARD).
  • Added target_transform_coco_masks() and target_transform_trimap_masks() transformation functions for explicit segmentation mask generation (@ANAMASGARD).
  • Added support for connectivity argument for drawing lines between keypoints in draw_keypoints() (@DerrickUnleashed, #303).

New models

  • Added model_maskrcnn_resnet50_fpn() and model_maskrcnn_resnet50_fpn_v2() for instance segmentation (#278, @ANAMASGARD).
  • Added model_convnext_*_detection() for object detection, with * within tiny/small/base (#262, @ANAMASGARD).
  • Added model_convnext_*_fcn() and model_convnext_*_upernet() for semantic segmentation, with * within tiny/small/base (#265, @ANAMASGARD).

New datasets

  • Added vggface2_dataset() for loading the VGGFace2 dataset (@DerrickUnleashed, #238).
  • Added moth dataset to rf100_biology_collection() and currency and wine_label to rf100_document_collection() (#274).

Bug fixes and improvements

  • model_maskrcnn_*() and model_fasterrcnn_*() now match the pytorch implementation (#318).
  • transform_ now correctly manage batched 4D torch tensors and 4D arrays (#313).
  • mnist_datataset() and derivatives now correctly return item x() values with a 1-channel dimension (@Chandraveersingh1717, #307).
  • transform_affine(), transform_rotate() and random derivatives now use interpolation and fill parameter in favor of resampleand fillcolor (@Chandraveersingh1717, #299).
  • Fixed draw_keypoints() documentation and error message (@srishtiii28, #296).
  • Standardized dataset messages: download messages now include split information, success messages show image count and class count for consistency.
  • fix model_fasterrcnn_* did not provide boxes output normalized to image size, did not manage batches, fix performance of the roi_align() function (#284).
  • fix rf100 collection bounding-box now consider the correct native COCO format being 'xywh' (#272).
  • Remove .getbatch method from MNIST as it is providing inconsistent tensor dimensions with .getitem due to non-vectorized transform_ operations (#264).
  • Added article for draw_keypoints() (@DerrickUnleashed, #303).
  • Fix typos and align model documentation for model_deeplabv3_* and model_convnext_*_detection() to ensure consistency (@DerrickUnleashed #302).
  • Fixed draw_keypoints() documentation: corrected connectivity parameter type, obsolete note, colors description default (@srishtiii28 #296).

torchvision 0.8.0

New datasets

  • Added lfw_people_dataset() and lfw_pairs_dataset() for loading Labelled Faces in the Wild (LFW) datasets (@DerrickUnleashed, #203).
  • Added places365_dataset() for loading the Places365 dataset (@koshtiakanksha, #196).
  • Added pascal_segmentation_dataset(), and pascal_detection_dataset() for loading the Pascal Visual Object Classes datasets (@DerrickUnleashed, #209).
  • Added whoi_plankton_dataset(), whoi_small_plankton_dataset(), and whoi_small_coral_dataset() (@cregouby, #236).
  • Added rf100_document_collection(), rf100_medical_collection(), rf100_biology_collection(), rf100_damage_collection(), rf100_infrared_collection(), and rf100_underwater_collection(). These are collections of datasets from Roboflow 100 under the same theme, for a total of 35 datasets (@koshtiakanksha, @cregouby, #239).
  • Added rf100_peixos_segmentation_dataset(). (@koshtiakanksha, @cregouby, #250).

New models

  • Added model_maxvit() for MaxViT: Multi-Axis Vision Transformer (#229, @koshtiakanksha).
  • Added model_facenet_pnet(), model_facenet_rnet(), and model_facenet_onet() for Facenet MTCNN face detection models. (@DerrickUnleashed, #227)
  • Added model_mtcnn() and model_inception_resnet_v1() models for face detection and recognition. (@DerrickUnleashed, #217)
  • Added model_mobilenet_v3_large() and model_mobilenet_v3_small() models for efficient image classification. (@DerrickUnleashed, #237)
  • Added 8 of the model_convnext_() family models for image classification, thanks to @horlar1 contribution. (@cregouby, #251)
  • Added 2 model_fasterrcnn_resnet50_() models and 2 model_fasterrcnn_mobilenet_v3_large_() for object detection. (@koshtiakanksha, #251)

New features

  • Added imagenet_label() and imagenet_classes() for ImageNet classes resolution (#229, @koshtiakanksha).
  • base_loader() now accept URLs (@cregouby, #246).
  • draw_segmentation_masks() now accepts semantic segmentation models torch_float() output. (@cregouby #247)
  • MNIST datasets and Roboflow 100 collections now have a .getbatch attached method (@cregouby #255)

Bug fixes and improvements

  • Switch pre 0.5.0 models to their /v2/ URL in torch-cdn.mlverse.org. (#215)
  • Models are now separated in the documentation by tasks between classification, object detection, and semantic segmentation models (@cregouby, #247)
  • Breaking Change : Refactoring of coco_* dataset family now provides each item$x being an image array (for consistency with other datasets). You can use transform = transform_to_tensor to restore the previous x output to be a torch_tensor().
  • transform_ are now documented into 3 different categories: unitary transformations, random transformations and combining transformations. (@cregouby, #250)
  • Deprecation : emnist_dataset is deprecated in favor of emnist_collection() (@cregouby, #260).

torchvision 0.7.0

New datasets

  • Added fashion_mnist_dataset() for loading the Fashion-MNIST dataset (@koshtiakanksha, #148).
  • Added eurosat_dataset(), eurosat_all_bands_dataset(), and eurosat100_dataset() for loading RGB, all-band, and small-subset variants of the EuroSAT dataset (@cregouby, #126).
  • Added qmnist_dataset() for loading the QMNIST dataset (@DerrickUnleashed, #153).
  • Added emnist_dataset() for loading the EMNIST dataset (@DerrickUnleashed, #152).
  • Added fgvc_aircraft_dataset() for loading the FGVC-Aircraft dataset (@DerrickUnleashed, #156).
  • Added coco_detection_dataset() and coco_caption_dataset() for loading the MS COCO detection and captions datasets (@koshtiakanksha, #161, #172).
  • Added caltech101_dataset() and caltech256_dataset() for loading the Caltech 101 and 256 datasets (@DerrickUnleashed, #158).
  • Added fer_dataset() for loading the FER-2013 dataset (@DerrickUnleashed, #154).
  • Added flowers102_dataset() for loading the Flowers102 dataset (@DerrickUnleashed, #157).
  • Added flickr8k_dataset() and flickr30k_dataset() for loading the Flickr8k and Flickr30k datasets (@DerrickUnleashed, #159).
  • Added oxfordiiitpet_dataset(), oxfordiiitpet_binary_dataset(), and oxfordiiitpet_segmentation_dataset() for loading the Oxford-IIIT Pet datasets (@DerrickUnleashed, #162).
  • Added rf100_document_collection(), rf100_underwater_collection(), rf100_medical_collection(), rf100_biology_collection(), and rf100_peixos_segmentation_dataset() for loading Roboflow 100 datasets (@koshtiakanksha, #239).

New models

  • Added EfficientNet model family (B0–B7) – scalable CNNs for image classification. (#166, @koshtiakanksha)
  • Added EfficientNetV2 model family (V2-S/M/L) – improved EfficientNet models for faster training. (#166, @koshtiakanksha)
  • Added model_vit_b_16(), model_vit_b_32(), model_vit_l_16(), model_vit_l_32(), and model_vit_h_14() for loading Vision Transformer models (@DerrickUnleashed, #202).

New features

  • tensor_image_display() and tensor_image_browse() now accept all tensor_image dtypes (@cregouby, #115).
  • draw_bounding_boxes() and draw_segmentation_masks() now accept image_with_bounding_box and image_with_segmentation_mask inputs which are the default items class for respectively detection datasets and segmentation datasets (@koshtiakanksha, #175).
  • fgvc_aircraft_dataset() gains support for annotation_level = "all" (@DerrickUnleashed, #168).
  • folder_dataset() now supports TIFF image formats (@cregouby, #169).
  • New nms() and batched_nms() functions provide Non-Maximum Suppression utilities. Added box_convert() to convert between bounding box formats (@Athospd, #40).

Minor bug fixes and improvements

  • transform_rotation() now correctly uses width × height for image size instead of width × width (@cregouby, #114).
  • Clarified documentation for transform_affine() to reduce confusion with transform_random_affine() (@cregouby, #116).
  • Added French translations for message outputs (@cregouby, #112).

torchvision 0.6.0

  • Remove again dependency on zip::unzip added in version 0.4.0. (#89)
  • Improve performance on tinyimagenet-alexnet example (#90, @statist-bhfz)
  • Updated URL of downloaded resources to use the new torch CDN. (#109)

torchvision 0.5.1

  • Remove usage of torch_lstsq that was removed in torch v0.10.0

torchvision 0.5.0

  • Bugs fixed in transform_adjust_hue() and transform_linear_transformation() (#72, #73, @sebffischer)
  • add draw_bounding_boxes() , draw_segmentation_masks() and draw_keypoints() on top of image tensors, and add a convenience tensor_image_browse() and tensor_image_display() functions to visualize image tensors respectively in browser or in X11 device (#80, @cregouby)
  • Added the InceptionV3 model. (#82)

torchvision 0.4.1

  • Implemented MobileNetV2 (#60)
  • Improved vignettes so they use nnf_cross_entropy for numerical stability. (#61)
  • Implement the full list of ResNet model family (#66, @cregouby)
  • Improved how datasets and models are downloaded by using a large timeout by default and downloading to temporary file to avoid hard to debug errors when the files are corrupt. (#67)

torchvision 0.4.0

  • Added a dependency on zip to zip::unzip the tinyimagenet dataset.
  • Removed all usages of torch::enumerate() from docs and tests in favor of coro::loop() (#57)
  • Fixed non-namespaced calls to torch. (#58)

torchvision 0.3.0

  • Use a self hosted version of the MNIST dataset to avoid frequent download failures. (#48)
  • Fix torch_arange calls after breaking change in torch. (#47)
  • Fix bug in transform_resize when passing size with length 1. (#49)

torchvision 0.2.0

  • Fixed bugs in transform_rotate. (#31)
  • Fixed bugs in transform_random_affine and transform_affine (#32)
  • Added VGG model (#35)

torchvision 0.1.0

  • Added a NEWS.md file to track changes to the package.