Hi mllm team,
I'm conducting NPU-aware algorithm optimization research using Qualcomm Device Cloud (QDC) with Snapdragon 8 Gen 3. Due to the lack of Hexagon SDK access and compilation environment constraints on the cloud platform, I'm unable to compile the QNN backend from source.
Could you please provide:
- Pre-compiled QNN backend binaries for Android ARM64
- Or guidance on using QNN AOT compilation to avoid device-side operator package compilation
My setup:
- Device: QDC Snapdragon 8 Gen 3
- QNN SDK: v2.14 (device firmware V73)
- Goal: Run NPU-aware optimization experiments (token pruning, quantization, KV cache)
Models are already downloaded. I can work with any recent mllm version that's compatible.
Thank you!
Hi mllm team,
I'm conducting NPU-aware algorithm optimization research using Qualcomm Device Cloud (QDC) with Snapdragon 8 Gen 3. Due to the lack of Hexagon SDK access and compilation environment constraints on the cloud platform, I'm unable to compile the QNN backend from source.
Could you please provide:
My setup:
Models are already downloaded. I can work with any recent mllm version that's compatible.
Thank you!