Qwen3 4B Instruct 2507 - Gemini 3 Pro Preview (No Reasoning) Distill

This model was trained on a Gemini 3 Pro Preview dataset with a high reasoning effort.

The reasoning summaries were then formatted out of the dataset and the model was finetuned on the final answers only.

  • 🧬 Datasets:

    • TeichAI/gemini-3-pro-preview-high-reasoning-1000x
  • 🏗 Base Model:

    • unsloth/Qwen3-4B-Instruct-2507
  • ⚡ Use cases:

    • Coding
    • Science
  • ∑ Stats (Dataset)

    • Costs: $ 32.7 (USD)
    • Total tokens (input + output): 2.73 M

This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Ollama

An Ollama Modelfile is included for easy deployment.

Downloads last month
2,580
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for TeichAI/Qwen3-4B-Instruct-2507-Gemini-3-Pro-Preview-Distill-GGUF

Dataset used to train TeichAI/Qwen3-4B-Instruct-2507-Gemini-3-Pro-Preview-Distill-GGUF