Qwen/Qwen3-VL-4B-Thinking-FP8

#1451

by SkyMind - opened Oct 14

Discussion

SkyMind

Oct 14

https://huggingface.co/Qwen/Qwen3-VL-4B-Thinking-FP8

Thanks!

nicoboss

Oct 14

Let's do https://huggingface.co/Qwen/Qwen3-VL-4B-Thinking instead as we can't conveart an already quantized model into a GGUF.

nicoboss

Oct 14

Unfortinately Qwen3VLForConditionalGeneration is not currently supported by llama.cpp nor do I currently see any pull requests that are working on adding support for it.

wqerrewetw

Oct 15

https://github.com/ggml-org/llama.cpp/issues/16207

nicoboss

Oct 15

https://github.com/ggml-org/llama.cpp/issues/16207

Awesome. Thanks for linking this PR. I must have missed it due to it already being 3 weeks since work on it started. I will follow it and do this model as soon as it is merged.

wqerrewetw

Oct 30

Awesome. Thanks for linking this PR. I must have missed it due to it already being 3 weeks since work on it started. I will follow it and do this model as soon as it is merged.

https://github.com/ggml-org/llama.cpp/pull/16780 merged

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment