Qwen3 4B Instruct 2507 - Gemini 3 Pro Preview (No Reasoning) Distill

This model was trained on a Gemini 3 Pro Preview dataset with a high reasoning effort.

The reasoning summaries were then formatted out of the dataset and the model was finetuned on the final answers only.

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Ollama

An Ollama Modelfile is included for easy deployment.

GGUF

Model size

4B params

Architecture

qwen3

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Finetuned

Quantized

(2)

this model