swallow llm - Robuta Search

https://swallow-llm.github.io/qwen3-swallow.ja.html

Qwen3の日本語能力と推論能力を強化した大規模言語モデル (8B, 30B-A3B, 32B)

https://b.hatena.ne.jp/entry/s/swallow-llm.github.io/qwen3-swallow.ja.html

この記事に対して3件のコメントがあります。注目されているコメントは「Qwen3ベースで日本語強化は手堅い。8Bでこれだけ動けば、わざわざ重いモデル使わなくて済む場面も増えそう。産総研と東工大の安定感に期待」です。

https://swallow-llm.github.io/index.ja.html

Open LLMs from academic research and development

https://swallow-llm.github.io/gptoss-swallow.ja.html

GPT-OSSの日本語能力と推論能力を強化した大規模言語モデル (20B, 120B)

https://swallow-llm.github.io/llama3.1-swallow.ja.html

Llama 3.1の英語の能力を維持しながら、日本語の能力を強化した大規模言語モデル (8B, 70B)

https://www.chokkan.org/swallow/leaderboard/task-post.ja.html?category=qwen3-swallow-8b

日本語・英語の大規模言語モデルの性能を棒グラフやレーダーチャート、散布図で比較

https://b.hatena.ne.jp/site/swallow-llm.github.io/

https://swallow-llm.github.io/llama3-swallow.ja.html

Llama 3の日本語能力を強化した大規模言語モデル (8B, 70B)