The obtained qwen1.5-1.8b-chat-rot_q4_0.mllm file size does not match the one in the ModelScope repository.

Hello,

I'm trying to reproduce the following two model files from the **v1-backup**:
- `qwen1.5-1.8b-chat-rot_q4_0.mllm`
- `qwen1.5-1.8b-chat-rot_qnn.mllm`

I strictly followed the instructions step by step in:
- [QNN backend README](https://github.com/UbiquitousLearning/mllm/blob/v1-backup/mllm/backends/qnn/README.md)
- [Model conversion guide](https://ubiquitouslearning.github.io/mllm_website/customization/convert_model/)

However, the generated `qwen1.5-1.8b-chat-rot_q4_0.mllm` file is **1.0 GB** in size, which **does not match** the file size of the official version hosted on ModelScope:  
🔗 [mllmTeam/Qwen1.5-1.8B-Chat on ModelScope](https://www.modelscope.cn/models/mllmTeam/Qwen1.5-1.8B-Chat)

- **My generated file**: `qwen1.5-1.8b-chat-rot_q4_0.mllm` (1.0 GB)
<img width="548" height="232" alt="Image" src="https://github.com/user-attachments/assets/46cd0f46-833d-4eaf-be11-529dda33a65a" />

- **Official ModelScope file**: `qwen1.5-1.8b-chat-rot_q4_0.mllm` (3.17GB)
<img width="666" height="509" alt="Image" src="https://github.com/user-attachments/assets/33f14db0-81be-48e2-9c73-339d56df04ea" />

Whether there are additional quantization or export steps I might have missed? How did you obtain the model files?
Thank you for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The obtained qwen1.5-1.8b-chat-rot_q4_0.mllm file size does not match the one in the ModelScope repository. #573

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The obtained qwen1.5-1.8b-chat-rot_q4_0.mllm file size does not match the one in the ModelScope repository. #573

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions