> For the complete documentation index, see [llms.txt](https://docs.aiandbusiness.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.aiandbusiness.com/ja/into-ai/moderullm.md). # 大規模言語モデル（LLM） {% hint style="info" %} * ChatGPT * local * Prompt * Hallucination Problem {% endhint %} ## 大規模言語モデルのランキング(LLM) Ranking {% tabs %} {% tab title="Artificial Analysis" %} **Artificial Analysis** AI言語モデルおよびAPIの独立分析。品質、速度、価格の比較を提供します。 {% embed url="" %} {% endtab %} {% tab title="Scale Leaderboards" %} {% embed url="" %} {% endtab %} {% tab title="LM Arena" %} {% embed url="" %} LMArena.aiは、テキスト、ビジョン、Web開発、検索、コーディングなどの分野で、240以上の大規模言語モデルを3.5百万以上のユーザー投票に基づいてランキングする包括的なAIモデルリーダーボードを公開しました。従来のベンチマークとは異なり、LMArenaはクラウドソースによるブラインド投票システムを採用し、ユーザーが同じプロンプトに対する匿名のモデル応答を比較して優れたものに投票することで、モデルの性能を動的かつ実世界で評価します。 {% endtab %} {% endtabs %} *** ## 大規模言語モデル ### クローズドソースモデル： {% tabs %} {% tab title="OpenAI o1" %} **OpenAI's new o1 model, released on September 12, 2024, is now the most powerful LLM that can reason through complex problems by breaking them down into steps, excelling particularly in areas like mathematics, coding, and scientific reasoning where it outperforms previous models and even rivals human experts in some cases.** ### OpenAI o1 and o1 pro {% embed url="" %} {% endtab %} {% tab title="GPT-4o" %} GPT-4o-2024-08-06, OpenAIの最も強力なLLM 2024年5月29日、GPT-4oを搭載したChatGPTが全員に無料で提供されました。 {% endtab %} {% tab title="Grok-3" %} Elon MuskのAI企業xAIは、2025年2月18日に「世界最強のAI」と称されるGrok 3を正式リリースしました。xAIによると、このモデルは推論能力と各種ベンチマークで従来のモデルを大きく上回る性能を発揮するとのことです。 {% embed url="" %} {% endtab %} {% tab title="Claude" %} Claudeは、Anthropicによって開発された大規模言語モデルのファミリーで、テキストや画像ベースの入力に対して自然で人間のような応答を生成するように設計されています。Claude 3モデルファミリーには3つのバージョンがあります： * Claude 3 Opus：最も高度なモデルで、非常に複雑なタスクにおいて人間に近い理解力と流暢さを持ちます。 * Claude 3 Sonnet：知能と速度のバランスが取れており、迅速な応答が求められる企業の業務に最適です。 * Claude 3 Haiku：最も速くてコンパクトなモデルで、簡単なクエリやリクエストに対してほぼ即座に応答します。 Anthropicは、データによると最高の大規模言語モデルである新しい**Claude Sonnet 3.5**を発表しました。 {% embed url="" %} {% endtab %} {% tab title="Gemini" %} ### **Gemini 2.5** {% embed url="" %} {% endtab %} {% endtabs %} ### オープンソースの大規模言語モデル: {% tabs %} {% tab title="gpt-oss" %} OpenAIは、Apache 2.0ライセンスのもと、オープンウェイトの言語モデル「gpt‑oss（gpt‑oss‑120bと20b）」を公開しました。高度な推論やツール利用に特化し、120Bモデルはo4‑miniに匹敵する性能を持ち、20Bモデルは一般的なPC環境でも動作可能な効率性を備えています。 {% endtab %} {% tab title="DeepSeek" %} DeepSeek v3は、6710億パラメータを搭載した先進的なオープンソースの大規模言語モデルで、卓越した性能とコスト効率を兼ね備えています。これにより、OpenAIやGoogleといった大手企業のクローズドソースモデルに対する有力な代替オプションとして注目されています。 {% endtab %} {% tab title="LLama" %} LLama is a family of state-of-the-art large language models developed by Meta, designed to perform a wide range of natural language processing tasks with varying levels of computational efficiency and accuracy. ### Llama 4 [https://ai.meta.com/blog/llama-4-multimodal-intelligence/](https://ai.meta.com/blog/llama-4-multimodal-intelligence/) Llama 4：Metaの最新オープンウェイト多モーダルAIモデルシリーズ Llama 4は、Metaが発表した最新のオープンウェイトな多モーダルAIモデルシリーズで、テキストと画像の処理を効率化する「Mixture of Experts（MoE）」アーキテクチャを採用しています。本シリーズには、1,000万トークンのコンテキストウィンドウを持つScoutや、多言語タスクに強みを持つMaverickなど、特徴的なモデルが含まれており、柔軟かつ高度な応用が可能です。 {% embed url="" %} {% endtab %} {% tab title="Mistral" %} Mistral AIは、テキスト生成、コード生成、多言語推論などのさまざまなタスクに対応したオープンソースおよび商業用の大規模言語モデル（LLM）を提供しています。 ; **Mistral Large 2** Mistral AIは、最先端の推論、知識、およびコーディング能力を備えた最先端の言語モデル（LLM）であるMistral Large 2を導入しました。128kのコンテキストウィンドウを持ち、フランス語、ドイツ語、スペイン語、イタリア語、ポルトガル語、アラビア語、ヒンディー語、ロシア語、中国語、日本語、韓国語を含む数十の言語、および80以上のコーディング言語をサポートしています。 {% endtab %} {% tab title="Qwen3" %} Qwen3は、Alibaba Cloudが開発した最新世代の大規模言語モデル（LLM）であり、「思考モード」と「非思考モード」を切り替えるハイブリッド型問題解決アプローチを採用しています。 119言語に対応しており、密集型モデル（Dense）とMoE（Mixture-of-Experts）型モデルなど、さまざまなサイズ・構成が用意されています。スケーラブルかつ高効率、そして多言語対応が求められるAIアプリケーションに最適化された設計が特徴です。 {% embed url="" %} {% endtab %} {% endtabs %} *** ## 学習リソース：


Transformers, Explained: Understand the Model Behind GPT-3, BERT, and T5	https://daleonai.com/transformers-explained
What Is ChatGPT Doing … and Why Does It Work?	https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/
ChatGPT Prompt Engineering for Developers	https://www.deeplearning.ai/short-courses/chatgpt-prompt-engineering-for-developers/
LLM101n: Let's build a Storyteller	https://github.com/karpathy/LLM101n
How GPT works	https://bbycroft.net/llm
How I Use "AI" by Nicholas Carlini	https://nicholas.carlini.com/writing/2024/how-i-use-ai.html
Transformer Explainer	https://poloclub.github.io/transformer-explainer/
Generative AI Handbook: A Roadmap for Learning Resources	https://genai-handbook.github.io/
Anthropic courses	https://github.com/anthropics/courses/tree/master
Deep Dive into LLMs like ChatGPT by Andrej Karpathy	https://youtu.be/7xTGNNLPyMI?si=6WfUHQx8-XqNxlQQ
KarpaHow I use LLMs by Andrej Karpathy	https://youtu.be/EWvNQjAaOHw?si=bzp-Dy5yvnCohubp
Google - Prompt Engineering by Lee Boonstra	https://www.kaggle.com/whitepaper-prompt-engineering