How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
Use Docker
docker model run hf.co/kawasumi/Tema_Q-R-4B-GGUF:Q4_K_M
Quick Links

๐Ÿš€ Tema_Q-R-4B

๐Ÿ”ฅ ใƒขใƒ‡ใƒซๆฆ‚่ฆ

Tema_Q-R-4B๏ผˆๅคฉ้ฆฌๆฑ‚๏ผ‰ ใฏใ€GoogleใŒ้–‹็™บใ—ใŸ้ซ˜ๆ€ง่ƒฝใชใ‚ชใƒผใƒ—ใƒณใƒขใƒ‡ใƒซ Gemma 3 4B ใ‚’ๅŸบ็›คใซใ—ใŸใ€ๆ—ฅๆœฌ่ชžใ€่‹ฑ่ชžใ€ไธญๅ›ฝ่ชžๅ‘ใ‘ใฎๆ”น่‰ฏ็‰ˆๅคง่ฆๆจก่จ€่ชžใƒขใƒ‡ใƒซ๏ผˆLLM๏ผ‰ใงใ™ใ€‚

้€šๅธธใฎGemma 3ใงใฏๅ›ž็ญ”ใŒ้›ฃใ—ใ„ใƒ—ใƒญใƒณใƒ—ใƒˆใซๅฏพใ—ใฆใ‚‚ใ€ใ‚ˆใ‚Š่‡ช็”ฑใงๆœ‰็”จใชๅฟœ็ญ”ใ‚’็”Ÿๆˆใงใใ‚‹ใ‚ˆใ†่จญ่จˆใ•ใ‚Œใฆใ„ใพใ™ใ€‚ใ‚ฏใƒชใ‚จใ‚คใƒ†ใ‚ฃใƒ–ใชๅŸท็ญ†ใ€่ค‡้›‘ใชใƒ—ใƒญใ‚ฐใƒฉใƒŸใƒณใ‚ฐใ‚ฟใ‚นใ‚ฏใ€ใƒ‡ใ‚ฃใƒผใƒ—ใช็Ÿฅ่ญ˜ๆŽขๆฑ‚ใชใฉใ€ใ‚ใ‚‰ใ‚†ใ‚‹ๅˆ†้‡ŽใงAIใฎๅฏ่ƒฝๆ€งใ‚’ๆœ€ๅคง้™ใซๅผ•ใๅ‡บใ—ใŸใ„ใƒฆใƒผใ‚ถใƒผใซๆœ€้ฉใงใ™ใ€‚

Tema_Q-R-4B is an improved Large Language Model (LLM) tailored for Japanese, English, and Chinese, built upon Gemma 3 4B, a high-performance open model developed by Google.

It is designed to generate more flexible and useful responses, even for prompts that the standard Gemma 3 might find challenging to answer. It is ideal for users who wish to maximize the potential of AI in all fields, including creative writing, complex programming tasks, and deep knowledge exploration.

้ …็›ฎ ่ฉณ็ดฐ
ใƒ™ใƒผใ‚นใƒขใƒ‡ใƒซ Google Gemma 3 4B
ใƒขใƒ‡ใƒซๅ Tema_Q-R-4B
ๅฏพๅฟœ่จ€่ชž ๆ—ฅๆœฌ่ชž (JA), ่‹ฑ่ชž (EN), ไธญๅ›ฝ่ชž (ZH)
ใƒขใƒ‡ใƒซใ‚ตใ‚คใ‚บ 4 Billion Parameters
ใƒฉใ‚คใ‚ปใƒณใ‚น Gemma 3ใฎใƒฉใ‚คใ‚ปใƒณใ‚นใซๆบ–ๆ‹ 
้–‹็™บ KY, TY, HY, KK

๐Ÿ›ก๏ธ ่ฒฌไปปใ‚ใ‚‹AIๅˆฉ็”จใจๅญฆ็ฟ’ใƒ‡ใƒผใ‚ฟใฎๅฎ‰ๅ…จๆ€ง

โš ๏ธ ่ฒฌไปปใ‚ใ‚‹ๅˆฉ็”จใฎๅพนๅบ•

  • ใƒฆใƒผใ‚ถใƒผใฎ่ฒฌไปป: ใƒขใƒ‡ใƒซใฎๅˆฉ็”จ่€…ใฏใ€็”Ÿๆˆใ•ใ‚ŒใŸใ‚ณใƒณใƒ†ใƒณใƒ„ใŒใ€้ฉ็”จใ•ใ‚Œใ‚‹ๆณ•ๅพ‹ใ€่ฆๅˆถใ€ใŠใ‚ˆใณHugging Faceใฎๅˆฉ็”จ่ฆ็ด„/ใ‚ณใƒณใƒ†ใƒณใƒ„ใƒใƒชใ‚ทใƒผใซๆบ–ๆ‹ ใ™ใ‚‹ใ“ใจใ‚’ๅ…จ้ข็š„ใซไฟ่จผใ™ใ‚‹ๅฟ…่ฆใŒใ‚ใ‚Šใพใ™ใ€‚
  • ็ฆๆญขไบ‹้ …: ใ“ใฎใƒขใƒ‡ใƒซใ‚’ใ€ใ„ใ‹ใชใ‚‹ๅทฎๅˆฅใ€ใƒใƒฉใ‚นใƒกใƒณใƒˆใ€ๆšดๅŠ›ใ€้•ๆณ•่กŒ็‚บใ€ใŠใ‚ˆใณๆœ‰ๅฎณใช็›ฎ็š„ใฎใŸใ‚ใซๅˆฉ็”จใ™ใ‚‹ใ“ใจใ‚’ๅ›บใ็ฆใ˜ใพใ™ใ€‚
Downloads last month
17
GGUF
Model size
4B params
Architecture
gemma3
Hardware compatibility
Log In to add your hardware

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for kawasumi/Tema_Q-R-4B-GGUF

Quantized
(5)
this model

Spaces using kawasumi/Tema_Q-R-4B-GGUF 3