スキルOfficialdevelopment

🎯finetuning-technique

プラグイン: sagemaker-ai
ソース: GitHub で見る ↗

説明

次のような場合に使用: ユーザーがファインチューニングを行うことを決定し、手法を選択する必要がある場合、またはモデルに対して手法の検証が必要な場合。ユーザーのユースケースに応じてファインチューニング手法（SFT、DPO、RLVR、または RLAIF）を選択し、選択したモデルで利用可能なレシピに対して検証を行います。事前に model-selection スキルを使用してベースモデルが選択済みであることが必要です。

原文を表示

Selects a fine-tuning technique (SFT, DPO, RLVR, or RLAIF) for the user's use case and validates it against the selected model's available recipes. Use when the user has decided to finetune and needs to choose a technique, or when the technique needs to be validated against a model. Requires a base model to already be selected (via model-selection skill).

ユースケース

✓ファインチューニング手法を選択する必要があるとき
✓選択した手法がモデルで利用可能か検証するとき
✓ベースモデル選択後に手法を決定するとき

本文（日本語訳）

ファインチューニング手法

ユーザーのユースケースに基づいてファインチューニング手法の選択をガイドし、選択したモデルとの互換性を検証します。

次のような場合に使用

ファインチューニングの実施を決定しており、手法を選ぶ必要がある場合
ファインチューニング手法を変更したい場合
選択済みのモデルに対して手法の検証が必要な場合

前提条件

ベースモデルが選択済みであること（model-selection スキル経由）。モデル名およびハブ名が既知である必要があります。
use_case_spec.md ファイルが存在すること。存在しない場合は、先に use-case-specification スキルを起動して生成してください。

ワークフロー

ステップ 1: ファインチューニング手法の決定

references/finetune_technique_selection_guide.md を参照し、ユースケースおよびユーザーのニーズに基づいて最適な手法（SFT・DPO・RLVR・RLAIF）を推奨します。

推奨内容とその根拠をユーザーに提示し、推奨手法で進めるか、別の手法を希望するかを確認します。

ステップ 2: 手法の利用可能性の検証

ユーザーが手法を確定したら、以下のコマンドを実行して選択済みモデルで利用可能なファインチューニング手法を取得します: python finetuning-technique/scripts/get_recipes.py <model-name> <hub-name>
- このコマンドは、モデルが実際にサポートしている手法のうち SFT・DPO・RLVR・RLAIF に絞り込んだ結果を返します。サポート対象はこの 4 手法のみです。モデルのレシピにほかの手法が含まれていても無視してください。
選択した手法がそのモデルで利用可能な場合は、ステップ 3 に進みます。
選択した手法がそのモデルで利用できない場合は、SageMaker 上では当該モデルがその手法をサポートしていない旨を説明し、選択した手法をサポートする別のモデルを選ぶために model-selection に戻ることを提案します。

ステップ 3: 選択内容の確認

以下のサマリーをユーザーに提示します:

選択内容:
- ベースモデル: [モデル名]
- ファインチューニング手法: [SFT/DPO/RLVR/RLAIF]

参考資料

references/finetune_technique_selection_guide.md — 手法ガイダンス（SFT/DPO/RLVR/RLAIF）

原文（English）を表示

Finetuning Technique

Guides the user through selecting a fine-tuning technique based on their use case and validates compatibility with the selected model.

When to Use

User has decided to finetune and needs to choose a technique
User wants to change their finetuning technique
Technique needs to be validated against a selected model

Prerequisites

A base model has been selected (via model-selection skill). The model name and hub must be known.
A use_case_spec.md file exists. If not, activate the use-case-specification skill to generate it first.

Workflow

Step 1: Determine Finetuning Technique

Consult references/finetune_technique_selection_guide.md to recommend the best-fit technique based on the use case and the user's needs (SFT, DPO, RLVR, RLAIF).

Present the recommendation and reasoning to the user. Ask if they'd like to go with the recommendation or prefer a different technique.

Step 2: Validate Technique Availability

Once the user confirms a technique, retrieve the finetuning techniques available for the selected model by running: python finetuning-technique/scripts/get_recipes.py <model-name> <hub-name>
- This returns only the techniques the model actually supports, filtered to SFT, DPO, RLVR, and RLAIF. Only these four techniques are supported — ignore any other techniques even if the model's recipes include them.
If the chosen technique is available for the model, proceed to Step 3.
If the chosen technique is not available for the model, explain that the selected model does not support it on SageMaker and offer to go back to model-selection to pick a different model that supports the chosen technique.

Step 3: Confirm Selections

Present a summary to the user:

Here's what we've selected:
- Base model: [model name]
- Fine-tuning technique: [SFT/DPO/RLVR/RLAIF]

References

references/finetune_technique_selection_guide.md — Technique guidance (SFT/DPO/RLVR/RLAIF)

原文・著作権は Anthropic および各プラグイン作者に帰属します。日本語訳は Claude API による自動翻訳です。