Research-ready and production-friendly neural network pruning for PyTorch—transparent methods, reproducible baselines, and deployment metrics to compress models for real-world use.
machine-learning acceleration computer-vision deep-learning reproducible-research optimization latency torch pytorch energy-efficiency model-compression inference-optimization edge-ai structured-pruning green-ai unstructured-pruning neural-network-pruning efficient-ai low-latency-inference
-
Updated
Jan 25, 2026 - Python