C# データ構造 - Dictionary と SortedList

投稿 2023/07/27 更新 2026/03/04

中級

By Sehyup

22 分読む時間

C# データ構造 - Dictionary と SortedList

TL;DR — 要点まとめ

Dictionary は O(1) のハッシュテーブルで、ゲーム開発で最もよく使う Key-Value 構造
SortedDictionary は O(log n) の赤黒木で、挿入/削除時も常にソート状態を維持
SortedList は挿入が遅い (O(n)) がメモリ効率が高く、配列のようにインデックスアクセス可能
性能優先なら Dictionary、整列済みで更新が多いなら SortedDictionary、省メモリ+インデックスなら SortedList

はじめに

ゲーム開発では、データを「どう保存し、どう検索するか」が性能に直結します。数千件のモンスターステータスを ID で引く場合、インベントリを整列状態で保つ場合、ランキングを順序付きで表示する場合では、最適なデータ構造が異なります。

C# で代表的な Key-Value 構造は Dictionary<TKey, TValue>、SortedList<TKey, TValue>、SortedDictionary<TKey, TValue> の 3 つです。見た目は似ていますが内部実装は大きく異なり、性能特性もかなり違います。

本記事では各構造の内部動作原理を .NET ランタイム視点で整理し、ゲーム開発でいつ何を選ぶべきかを実践的にまとめます。

Part 1: 内部構造

1. Dictionary - Hash Table ベース

Dictionary<TKey, TValue> は C# で最も頻繁に使われる Key-Value 構造です。内部は Hash Table で実装され、平均 O(1) 検索を提供します。

実際の内部構造: buckets + entries

Dictionary の核は 2 つの配列です。

  
// .NET Runtime source (simplified)
private int[] buckets;     // hash -> entries index mapping
private Entry[] entries;   // array storing actual data

private struct Entry
{
    public uint hashCode;  // hash value of key
    public int next;       // next entry index in same bucket (-1 means end)
    public TKey key;
    public TValue value;
}

重要なのは、チェイニングが別 LinkedList ではなく entries 配列内の next インデックスでつながっている点です。連続配列を使うため Cache Locality が高くなります。

flowchart TB
    subgraph Buckets["buckets[] (サイズ: 素数)"]
        B0["[0] -> 2"]
        B1["[1] -> -1 (空)"]
        B2["[2] -> 0"]
        B3["[3] -> 3"]
        B4["[4] -> -1"]
        B5["[5] -> 1"]
        B6["[6] -> -1"]
    end

    subgraph Entries["entries[] (連続配列)"]
        E0["[0] hash=9284 next=-1<br/>Key=MOB_001 Val=ゴブリン"]
        E1["[1] hash=5831 next=-1<br/>Key=MOB_005 Val=スライム"]
        E2["[2] hash=3500 next=4<br/>Key=MOB_003 Val=オーク"]
        E3["[3] hash=7423 next=-1<br/>Key=MOB_002 Val=ドラゴン"]
        E4["[4] hash=3507 next=-1<br/>Key=MOB_008 Val=ミノタウロス"]
    end

    B0 -->|"index 2"| E2
    B2 -->|"index 0"| E0
    B3 -->|"index 3"| E3
    B5 -->|"index 1"| E1
    E2 -->|"next=4<br/>(同バケット衝突)"| E4

検索手順 (TryGetValue)

1. hashCode = key.GetHashCode()
2. bucketIndex = hashCode % buckets.Length
3. entryIndex = buckets[bucketIndex]
4. entries[entryIndex].key と比較
   - 一致 -> value を返す
   - 不一致 -> entries[entryIndex].next を辿る
   - next == -1 -> キーなし

ゲーム開発で例えると「索引付きの引き出し棚」です。ハッシュで引き出しを決め、内部ラベルで照合します。

リサイズ: 素数ベース

衝突が増えると性能は O(n) に近づきます。Dictionary は負荷を監視し、必要時にリサイズします。このとき新サイズは単純 2 倍ではなく、現在サイズの 2 倍以上の最小素数を使います。

// Prime table used internally (.NET HashHelpers.cs excerpt)
// 3, 7, 11, 17, 23, 29, 37, 47, 59, 71, 89, 107, 131, 163, 197, 239, 293, 353, 431, 521, 631, ...

素数を使う理由は hashCode % bucketCount で分布を作るためです。素数だと偏りに強く、2 の冪では低ビット偏重で衝突が増えやすくなります。

要点: リサイズは buckets / entries 再確保 + 全再ハッシュを伴う O(n) コストです。件数見積もりが可能なら初期 capacity 指定が必須です。

Before - 容量未指定

// リサイズが複数回発生し得る
var monsterStats =
    new Dictionary<string, MonsterData>();

foreach (var data in allMonsterData)
{
    monsterStats[data.Id] = data;
}

After - 初期容量指定

// 理想的にはリサイズ 0 回
var monsterStats =
    new Dictionary<string, MonsterData>(
        allMonsterData.Count);

foreach (var data in allMonsterData)
{
    monsterStats[data.Id] = data;
}

flowchart LR
    subgraph Before["リサイズ前 (buckets: 7)"]
        direction TB
        BB["entries 5/7 使用<br/>衝突チェーン発生"]
    end

    subgraph Resize["リサイズ処理"]
        R1["1. 新サイズ = HashHelpers.GetPrime(7 x 2)<br/>   -> 17"]
        R2["2. buckets[17], entries[17] 再確保"]
        R3["3. 全エントリ再ハッシュ<br/>   (hashCode % 17)"]
    end

    subgraph After["リサイズ後 (buckets: 17)"]
        direction TB
        BA["entries 5/17 使用<br/>衝突緩和"]
    end

    Before --> Resize --> After

GetHashCode / Equals 契約

規則	説明
`a.Equals(b) == true`	`a.GetHashCode() == b.GetHashCode()` である必要がある
`a.GetHashCode() == b.GetHashCode()`	`a.Equals(b)` は false でも良い (衝突許容)
Dictionary に入っている間	`GetHashCode()` が変化してはいけない

Q. mutable なオブジェクトを Key にしてはいけない理由は? 追加後に内部値が変わるとハッシュが変わり、別バケットを探してしまって値が見つからなくなります。
Q. string.GetHashCode() はプロセス間で固定ですか? 固定ではありません。 .NET Core/.NET 5+ ではプロセスごとにハッシュシードが変わります。ハッシュ値を永続化してはいけません。Key 本体を保存してください。

2. SortedList - 整列配列ベース

SortedList<TKey, TValue> は内部で 整列された 2 配列 (keys[], values[]) を使い、常に Key 昇順を保ちます。

  
// .NET Runtime source (simplified)
private TKey[] keys;      // sorted key array
private TValue[] values;  // value array aligned by index
private int _size;        // element count (keys.Length >= _size)

flowchart TB
    subgraph SortedList["SortedList 内部構造"]
        direction TB
        subgraph Keys["keys[] (整列維持)"]
            K0["[0] 100"]
            K1["[1] 200"]
            K2["[2] 300"]
            K3["[3] 500"]
            K4["[4] 800"]
        end
        subgraph Values["values[] (同一インデックス対応)"]
            V0["[0] ゴブリン"]
            V1["[1] オーク"]
            V2["[2] ドラゴン"]
            V3["[3] スライム"]
            V4["[4] フェニックス"]
        end
        subgraph Idx["インデックスアクセス: O(1)"]
            I0["Keys[2] -> 300"]
            I1["Values[2] -> ドラゴン"]
        end
    end

検索: 二分探索

整列配列なので Array.BinarySearch() による O(log n) 検索です。

配列: [100, 200, 300, 500, 800]

Step 1: lo=0, hi=4, mid=2 -> 300 < 500 -> lo=3
Step 2: lo=3, hi=4, mid=3 -> 500 == 500 -> 発見

挿入: 配列シフトのコスト

中間挿入では後続要素を Array.Copy() でずらす必要があり、最悪 O(n) です。

flowchart TB
    subgraph Before["挿入前: Key=250"]
        B["[0]100  [1]200  [2]300  [3]500  [4]800"]
    end

    subgraph Step1["Step 1: 二分探索で挿入位置決定"]
        S1["~index = 2"]
    end

    subgraph Step2["Step 2: Array.Copy で後方シフト"]
        S2["[0]100  [1]200  [_]___  [3]300  [4]500  [5]800"]
    end

    subgraph After["Step 3: index=2 に挿入"]
        A["[0]100  [1]200  [2]250  [3]300  [4]500  [5]800"]
    end

    Before --> Step1 --> Step2 --> After

Q. SortedList はインデックスアクセス可能? 可能です。Keys[index] / Values[index] は O(1)。SortedDictionary との大きな違いです。
Q. 挿入順序で性能は変わる? 大きく変わります。 昇順挿入は末尾追加が多く高速、逆順/ランダムはシフト多発で遅くなります。
Q. Count と Capacity の違いは? Count は要素数、Capacity は内部配列サイズ。大量挿入前に Capacity 設定、終了後 TrimExcess() が有効です。

3. SortedDictionary - Red-Black Tree ベース

SortedDictionary<TKey, TValue> は整列維持という点は同じですが、内部は Red-Black Tree です。

Red-Black Tree の規則

ノードは Red または Black
ルートは常に Black
Red ノードの子は Black (Red-Red 連続不可)
ルートから各葉までの Black ノード数は同じ

flowchart TB
    subgraph RBTree["Red-Black Tree (root=300)"]
        N300["300<br/>⬛ Black"]
        N200["200<br/>🔴 Red"]
        N500["500<br/>⬛ Black"]
        N100["100<br/>⬛ Black"]
        N250["250<br/>⬛ Black"]
        N800["800<br/>🔴 Red"]

        N300 --> N200
        N300 --> N500
        N200 --> N100
        N200 --> N250
        N500 --> N800
    end

なぜ AVL ではなく RB Tree か

特性	AVL	Red-Black
平衡条件	厳密	緩やか
挿入時回転	最大 2 回だが再検査が重い	最大 2 回中心
削除時回転	O(log n) 回転可能	最大 3 回
検索	やや速い	やや遅い
挿入/削除	やや不利	更新処理に有利

更新が多い Key-Value コレクションでは RB Tree が現実的です。

挿入時回転の例

flowchart LR
    subgraph Before["挿入前"]
        direction TB
        A300["300 ⬛"]
        A200["200 🔴"]
        A100["100 ⬛"]
        A300 --> A200
        A200 --> A100
    end

    subgraph Insert["150 を Red で挿入"]
        direction TB
        B300["300 ⬛"]
        B200["200 🔴"]
        B100["100 ⬛"]
        B150["150 🔴 <- NEW"]
        B300 --> B200
        B200 --> B100
        B100 --> B150
    end

    subgraph Rebalance["回転 + 色変更"]
        direction TB
        C200["200 ⬛"]
        C150["150 🔴"]
        C300["300 🔴"]
        C100["100 ⬛"]
        C200 --> C150
        C200 --> C300
        C150 --> C100
    end

    Before -->|"BST 挿入"| Insert
    Insert -->|"RB 違反修正"| Rebalance

回転回数が少ないのが利点ですが、ノードは個別ヒープオブジェクトのためメモリオーバーヘッドは大きいです。

Q. SortedDictionary.First() は O(1)? 違います。 左端ノードまで降りるため O(log n) です。
Q. foreach の順序保証は? 保証されます。 In-order traversal で Key 昇順です。

Part 2: 性能比較

4. 時間計算量比較

演算	Dictionary	SortedList	SortedDictionary
検索	O(1) 平均	O(log n)	O(log n)
挿入	O(1) 平均	O(n)	O(log n)
削除	O(1) 平均	O(n)	O(log n)
順序巡回	O(n log n) (別ソート必要)	O(n)	O(n)
インデックスアクセス	不可	O(1)	不可
最小/最大	O(n)	O(1)	O(log n)

5. 実測傾向

10,000 件 int Key 基準の概略傾向:

演算 (10,000)	Dictionary	SortedList	SortedDictionary
全挿入	~0.3ms	~15ms (ランダム) / ~1ms (整列入力)	~3ms
単件検索	~20ns	~150ns	~200ns
全巡回	~0.1ms	~0.05ms	~0.15ms
メモリ	~450KB	~240KB	~640KB

ポイント:

Dictionary 検索は SortedList より 7〜8 倍程度速い
SortedList 挿入は入力順で 最大 15 倍差
SortedDictionary は通常メモリ消費が最大

  
// Quick benchmark example in Unity
public void BenchmarkCollections(int count)
{
    var sw = new System.Diagnostics.Stopwatch();
    var random = new System.Random(42);
    var keys = Enumerable.Range(0, count).OrderBy(_ => random.Next()).ToArray();

    sw.Restart();
    var dict = new Dictionary<int, int>(count);
    foreach (var key in keys) dict[key] = key;
    sw.Stop();
    Debug.Log($"Dictionary Insert: {sw.Elapsed.TotalMilliseconds:F3}ms");

    sw.Restart();
    var sortedList = new SortedList<int, int>(count);
    foreach (var key in keys) sortedList[key] = key;
    sw.Stop();
    Debug.Log($"SortedList Insert (random): {sw.Elapsed.TotalMilliseconds:F3}ms");

    sw.Restart();
    var sortedDict = new SortedDictionary<int, int>();
    foreach (var key in keys) sortedDict[key] = key;
    sw.Stop();
    Debug.Log($"SortedDictionary Insert: {sw.Elapsed.TotalMilliseconds:F3}ms");
}

6. メモリ使用量比較

構造	内部構造	要素あたりオーバーヘッド	特性
Dictionary	buckets[] + entries[]	~28 bytes	連続配列 + 空きスロット
SortedList	keys[] + values[]	~16 bytes	連続配列、キャッシュ効率高い
SortedDictionary	TreeNode object	~48+ bytes	ヒープ分散、GC 追跡対象増加

flowchart TB
    subgraph Memory["メモリレイアウト比較"]
        subgraph Dict["Dictionary"]
            DM["entries[]: [E0][E1][E2][E3][__][__]<br/>buckets[]: [2][-1][0][3][-1][-1][-1]"]
        end
        subgraph SL["SortedList"]
            SM["keys[]:   [K0][K1][K2][K3]<br/>values[]: [V0][V1][V2][V3]"]
        end
        subgraph SD["SortedDictionary"]
            SDM["[Node] -> [Node] -> [Node] -> ...<br/>ノードごとに個別ヒープ確保"]
        end
    end

Unity では GC Spike がフレーム落ち要因です。SortedDictionary は要素数分のノードオブジェクトが増え、GC 負担が高くなります。

Part 3: 実戦活用

7. シナリオ別選択ガイド

flowchart LR
    subgraph Decision["データ構造選択フロー"]
        Q1{"整列が必要?"}
        Q2{"挿入/削除が頻繁?"}
        Q3{"インデックスアクセスが必要?"}

        Q1 -->|No| D["Dictionary<br/>(多くのケースで基本)"]
        Q1 -->|Yes| Q2
        Q2 -->|Yes| SD["SortedDictionary"]
        Q2 -->|No, 読取中心| Q3
        Q3 -->|Yes| SL["SortedList"]
        Q3 -->|No| SD2["SortedDictionary<br/>(安全な既定)"]
    end

マスターデータ参照 -> Dictionary

  
private Dictionary<int, MonsterMasterData> monsterTable;

public void LoadMasterData(IReadOnlyList<MonsterMasterData> rawData)
{
    monsterTable = new Dictionary<int, MonsterMasterData>(rawData.Count);

    foreach (var data in rawData)
    {
        monsterTable[data.ID] = data;
    }
}

public MonsterMasterData GetMonster(int monsterID)
{
    return monsterTable.TryGetValue(monsterID, out var data) ? data : null;
}

リアルタイムイベントタイムライン -> SortedDictionary

  
private SortedDictionary<(float time, int id), GameEvent> eventTimeline
    = new SortedDictionary<(float time, int id), GameEvent>();
private int nextEventID;

public void ScheduleEvent(float time, GameEvent evt)
{
    eventTimeline[(time, nextEventID++)] = evt;
}

public void ProcessEvents(float currentTime)
{
    while (eventTimeline.Count > 0)
    {
        var first = eventTimeline.First();
        if (first.Key.time > currentTime) break;

        first.Value.Execute();
        eventTimeline.Remove(first.Key);
    }
}

8. 実戦 Tips と注意点

Dictionary の初期容量設定

  
// BAD: リサイズ多発
var dict = new Dictionary<int, string>();
for (int i = 0; i < 10000; i++)
    dict.Add(i, $"item_{i}");

// GOOD: 初期容量指定
var dict = new Dictionary<int, string>(10000);
for (int i = 0; i < 10000; i++)
    dict.Add(i, $"item_{i}");

概算でも capacity 指定すると効果があります。 .NET 6+ なら EnsureCapacity(int capacity) も利用可能。

enum Key 使用時の注意 (Unity Mono)

  
// BAD: Unity Mono では enum key で boxing が発生し得る
var dict = new Dictionary<MyEnum, string>();

// GOOD: custom comparer で boxing 回避
public struct MyEnumComparer : IEqualityComparer<MyEnum>
{
    public bool Equals(MyEnum x, MyEnum y) => x == y;
    public int GetHashCode(MyEnum obj) => (int)obj;
}

var dict = new Dictionary<MyEnum, string>(new MyEnumComparer());

Q. TryGetValue と ContainsKey + indexer はどちらが良い? TryGetValue 一択です。

  
// BAD: 探索 2 回
if (dict.ContainsKey(key))
    var value = dict[key];

// GOOD: 探索 1 回
if (dict.TryGetValue(key, out var value))
    DoSomething(value);

SortedList 大量挿入最適化

  
// BAD: ランダム順挿入 -> O(n^2) に近づく
var sortedList = new SortedList<int, string>();
foreach (var item in unsortedItems)
    sortedList.Add(item.Key, item.Value);

// GOOD: 先にソートしてから挿入
var sorted = unsortedItems.OrderBy(x => x.Key).ToArray();
var sortedList = new SortedList<int, string>(sorted.Length);
foreach (var item in sorted)
    sortedList.Add(item.Key, item.Value);

9. 拡張: ConcurrentDictionary と読み取り専用公開

ConcurrentDictionary - マルチスレッド安全

  
// BAD: Dictionary は同時書き込みに非対応
private Dictionary<int, CachedData> cache = new();

// GOOD: ConcurrentDictionary
private ConcurrentDictionary<int, CachedData> cache = new();

var data = cache.GetOrAdd(key, k => LoadData(k));

ConcurrentDictionary は単一スレッド環境ではオーバーヘッドがあるため不要です。

ReadOnlyDictionary - 外部公開を安全化

  
private Dictionary<int, MonsterData> monsterTable;
private ReadOnlyDictionary<int, MonsterData> readOnlyTable;

public void LoadMasterData(IReadOnlyList<MonsterData> rawData)
{
    monsterTable = new Dictionary<int, MonsterData>(rawData.Count);
    foreach (var data in rawData)
        monsterTable[data.ID] = data;

    readOnlyTable = new ReadOnlyDictionary<int, MonsterData>(monsterTable);
}

public IReadOnlyDictionary<int, MonsterData> MonsterTable => readOnlyTable;

10. 総合比較まとめ

基準	Dictionary	SortedList	SortedDictionary
内部構造	Hash Table	Sorted Array	Red-Black Tree
整列維持	なし	あり	あり
検索	O(1)	O(log n)	O(log n)
挿入/削除	O(1)	O(n)	O(log n)
インデックスアクセス	なし	O(1)	なし
最小/最大	O(n)	O(1)	O(log n)
メモリ効率	中	良い	悪い
GC 負担	低	最小	高
適用シナリオ	ID 高速参照	整列 + 読取中心	整列 + 更新頻繁
ゲーム例	マスター/キャッシュ	ランキング	スケジューラ/タイムライン

チェックリスト

[✅] Dictionary の初期容量を設定したか?

[✅] Key が immutable で GetHashCode / Equals 契約を満たすか?

[✅] ContainsKey + indexer ではなく TryGetValue を使っているか?

[✅] 整列不要なのに SortedList/SortedDictionary を使っていないか?

[✅] SortedList 大量挿入時に事前ソートしているか?

[✅] Unity Mono で enum Key 使用時に custom comparer を検討したか?

[✅] Key 重複対策 (複合キー) を考慮したか?

[✅] マルチスレッドアクセスで ConcurrentDictionary を使っているか?

[✅] 外部公開を IReadOnlyDictionary で不変化しているか?

結論

データ構造の選択は「何が最強か」ではなく、どの条件に最適かの問題です。

Dictionary: buckets[] + entries[] のハッシュテーブル。順序不要な Key-Value で最良の基本選択。
SortedList: 整列済み配列ベース。省メモリとインデックスアクセスが強いが、中間挿入は O(n)。
SortedDictionary: 赤黒木ベース。整列維持しつつ更新が多い場面に強いが、ノード単位ヒープ確保で GC 負担が大きい。

内部構造を理解すれば、「なぜ遅いか」「なぜメモリを食うか」を素早く判断できます。描画パイプライン理解が最適化の基礎であるのと同じく、データ構造の内部理解は設計品質の基盤です。

C#, Data Structure

この記事は著者の CC BY 4.0 ライセンスの下で提供されています。

C# データ構造 - Dictionary と SortedList

はじめに

Part 1: 内部構造

1. Dictionary - Hash Table ベース

実際の内部構造: buckets + entries

検索手順 (TryGetValue)

リサイズ: 素数ベース

GetHashCode / Equals 契約

2. SortedList - 整列配列ベース

検索: 二分探索

挿入: 配列シフトのコスト

3. SortedDictionary - Red-Black Tree ベース

Red-Black Tree の規則

なぜ AVL ではなく RB Tree か

挿入時回転の例

Part 2: 性能比較

4. 時間計算量比較

5. 実測傾向

6. メモリ使用量比較

Part 3: 実戦活用

7. シナリオ別選択ガイド

マスターデータ参照 -> Dictionary

ランキング/リーダーボード -> SortedList (同点注意)

リアルタイムイベントタイムライン -> SortedDictionary

8. 実戦 Tips と注意点

Dictionary の初期容量設定

enum Key 使用時の注意 (Unity Mono)

SortedList 大量挿入最適化

9. 拡張: ConcurrentDictionary と読み取り専用公開

ConcurrentDictionary - マルチスレッド安全

ReadOnlyDictionary - 外部公開を安全化

10. 総合比較まとめ

チェックリスト

結論

人気タグ