スキルOfficialmonitoring

📊review-agent-insights

プラグイン: amplitude
ソース: GitHub で見る ↗

説明

Amplitudeからすべての最新AIエージェント結果を取得・統合し、優先順位付けして提示します。 `get_agent_results` で利用可能なすべてのAgentタイプを照会し、データの鮮度を検証した上で、インパクト順にランク付けされた統合レポートを生成します。次のような場合に使用: ユーザーが「AIは何を検出したか」「Agentのインサイトを見せて」「AIの発見事項はあるか」「Amplitudeは何を発見したか」「AIインサイトを確認したい」と尋ねる場合、またはAmplitudeのAI Agentが最近発見した内容のダイジェストを求める場合。

原文を表示

Retrieves, synthesizes, and prioritizes all recent AI agent results from Amplitude. Queries every agent type available in get_agent_results, validates freshness, and produces a unified narrative ranked by impact. Use when the user asks "what has the AI found", "show me agent insights", "any AI findings", "what did Amplitude discover", "review AI insights", or wants a digest of everything Amplitude's AI agents have surfaced recently.

ユースケース

✓AIエージェントの検出結果を確認したい
✓Amplitudeの最新インサイトを見たい
✓複数のAgent結果を一覧で取得したい
✓AIの発見内容をダイジェストで確認したい

本文（日本語訳）

AIエージェントのインサイトをレビュー

Amplitude の AIエージェントが最近発見した内容をすべて表示します。
get_agent_results で利用可能なすべてのエージェントタイプを照会し、情報の鮮度を検証したうえで、影響度順にランク付けした統合ナレティブと具体的なフォローアップアクションを提示します。

重要: ツールリファレンス

主要ツール:

Amplitude:get_agent_results — Amplitude の AIエージェントによって事前計算された分析結果を取得します。複数のエージェントタイプをサポートしています（現在のリストはツールの agent_type 列挙型を参照してください）。エージェントタイプごとに個別に照会します。すべてのタイプで created_after、created_before、query、agent_params、limit によるフィルタリングに対応しています。

補助ツール:

Amplitude:get_context / Amplitude:get_project_context — ユーザー、組織、プロジェクト情報を初期化します。
Amplitude:get_deployments — フラグが立った問題に対する修正がリリース済みかどうかを確認します（鮮度の検証に使用）。

手順

ステップ 1: コンテキストの初期化（1〜2回の呼び出し）

Amplitude:get_context を呼び出して、ユーザーの組織、プロジェクト、最近のアクティビティ、主要ダッシュボードを取得します。複数のプロジェクトが存在する場合は、どれをレビューするかユーザーに確認するか、広範囲のスキャンを希望する場合はすべてをレビューします。
対象プロジェクトの設定と AIコンテキストを取得するために Amplitude:get_project_context を呼び出します。

ユーザーのリクエストからレビュー期間を決定します:

デフォルト: 直近 7 日間（最新性とカバレッジのバランスが良い）。
「今日の新着は？」→ 直近 1〜2 日。
「今月分をまとめて教えて」→ 直近 14〜30 日。
レビュー期間の created_after タイムスタンプを必ず ISO 8601 形式で算出してください。

ステップ 2: 全エージェントタイプの照会（並列実行）

get_agent_results ツールのディスクリプタを確認し、列挙型で利用可能なすべての agent_type を把握します。エージェントタイプごとに 1 回、並列で呼び出しを実行してください。 各呼び出しで指定する値:

agent_type: 列挙型から取得したエージェントタイプ
created_after: ステップ 1 で算出したレビュー期間のタイムスタンプ
limit: 10

ユーザーが特定の領域（例: 「オンボーディングのインサイト」）を尋ねている場合は、その領域に合致する query をすべての呼び出しに追加します。エージェントタイプが agent_params（例: 影響度評価、カテゴリ、ダッシュボード ID など）による追加フィルタリングをサポートしている場合、ユーザーの要求がより絞り込まれたスコープを示唆するときはそれを活用して結果を絞り込みます。それ以外の場合は agent_params を省略して最広範な結果を取得します。

返された各結果について、以下の情報を記録します:

どのエージェントタイプが生成したか
主要な発見内容またはサマリー
分析が実行された日時（作成日）
そのエージェントタイプ固有のメタデータ（影響度評価、カテゴリ、ダッシュボード ID など）

エージェントタイプごとに結果が 1 件のみ返された場合、アーティファクトは自動展開されます。複数の結果がある場合は、プレビューを確認したうえで、session_id を使用して最も関連性の高い 2〜3 件（最新のものまたはユーザーのフォーカスエリアに合致するもの）のフルアーティファクトを取得します。

エージェントタイプの結果が 3 件未満で、かつ agent_params フィルタリングをサポートしている場合は、フィルタを緩和した 2 回目の呼び出しでカバレッジを広げることを検討します。

ステップ 3: 鮮度の検証

AIエージェントのインサイトは数日で陳腐化します。統合する前に、信頼性の低い情報を除外またはフラグ立てします。

作成日を確認します。 各発見について、今日からの経過日数を記録します:
- 3 日未満: 高信頼度 — 現在の情報として扱います。
- 3〜7 日: 中程度の信頼度 — 含めますが、経過日数を明記します。
- 7〜14 日: 低信頼度 — 同じ領域をカバーする新しい発見がない場合のみ含めます。「情報が古い可能性あり」とラベルを付けます。
- 14 日超: 陳腐化 — メインのナレティブから除外します。その領域に新しい情報が存在しない場合のみ簡単に言及します。
デプロイメントとのクロスリファレンス（1 回の呼び出し）。 get_deployments を 1 回呼び出します。AIが検出した各問題について、分析が実行された後に影響を受けた領域へ修正や変更がリリースされていないか確認します。リリースされていた場合は、アクティブな問題として提示するのではなく、「[デプロイメント名] によって解決済みの可能性あり」と記載します。
エージェントタイプ間で重複を排除します。 同じ問題が複数のエージェントタイプから報告されることがあります。それらを複数のエージェントからのエビデンスを持つ単一の発見として統合し、同じ問題を重複して提示しないようにします。

ステップ 4: 統合とランク付け

影響度とエビデンスの強さでランク付けします。
- 複数エージェントによる発見（複数のエージェントタイプがフラグを立てたもの）を最上位にランク付けします。
- 単一エージェントタイプからの高影響度または高信頼度の発見を次のランクに置きます。
- 低影響度または古い発見を最下位にランク付けします。
エージェントタイプではなく、テーマでグループ化します。 発見内容を、どのエージェントが生成したかではなく、プロダクトのテーマや問題領域（「チェックアウトフロー」「オンボーディング」「検索機能」など）でまとめます。各テーマ内では、関与したすべてのエージェントタイプからのエビデンスを織り交ぜます。
ギャップを特定します。 最近の結果が返ってこなかったエージェントタイプや、カバレッジのないプロダクト領域を記録します。

ステップ 5: レビューの提示

PM がチームに転送できるナレティブダイジェスト形式で出力を構成します。

必須セクション:

サマリー（3〜4 文）: 照会したエージェントタイプ、レビュー期間、結果の総件数、最も重要な発見 1 つ、および全体的な評価を記載します。
主要な発見（影響度順に 3〜7 件）:

各発見について:

### [発見タイトル — アクション指向で 10 語以内]
**影響度:** [Critical/High/Medium/Low] | **エージェント:** [関与したエージェントタイプの一覧] | **鮮度:** [X 日前]

**AIが発見した内容:** インサイトの説明 — どのような異常、摩擦、または問題が
検出されたか。プロダクト領域と各エージェントからのエビデンスを具体的に記載。

**鮮度の確認:** 分析後にデプロイメントがリリースされたか、または改めて検証が
必要かを記載。発見が 3 日未満の場合はこの行を省略。

**推奨アクション:** 具体的な次のステップを 1 つ記載。

カバレッジギャップ（2〜4 件）: 結果が返ってこなかったエージェントタイプや、AIカバレッジのないプロダクト領域を記載します。それぞれについて、実行すべきエージェントと対象コンテンツを提案します。
フォローアッププロンプト: 発見内容に基づいた次に掘り下げるべき具体的な選択肢を 2〜3 個提示して締めくくります。

ライティング基準:

構造より narrative（ナレティブ）を優先します。発見内容はデータベースのレコードではなく段落として記述します。
インサイトを先頭に置き、エージェントタイプの帰属は補足エビデンスとして使用します。
概数表記を使用します: 「42.37%」ではなく「約 42%」。
能動態のみ使用します。
鮮度を必ず明記します: 「最近発見された」ではなく「2 日前に検出された」。
結果に URL が含まれている場合は、Amplitude UIのセッション/アーティファクトへのリンクをインラインで記載します。
主要な発見の総文字数の目安: 400〜800 語程度。簡潔にまとめてください。

エッジケース

どのエージェントタイプからも結果が返ってこない場合。 最近の AIエージェント分析が見つからなかったことを報告します。照会したエージェントタイプを列挙し、主要なダッシュボードやフローで実行するようユーザーに提案します。
1 種類のエージェントタイプからのみ結果が返ってきた場合。 取得した内容を提示し、結果がなかったエージェントタイプを記載します。ギャップをアクション可能な推奨事項として提示します。
すべての結果が陳腐化している場合（14 日超）。 明確な陳腐化の警告とともに簡潔なサマリーを提示し、分析の再実行を推奨します。
発見件数が多すぎる場合。 主要な発見を最大 7 件に制限します。影響度 × 鮮度 × エビデンスの幅でランク付けします。総件数を明記します。
ユーザーが特定の領域について尋ねている場合。 すべてのエージェントタイプの呼び出しに query パラメータを追加します。関連する発見のみを提示します。
未知のエージェントタイプの結果が返ってきた場合。 新しいエージェントタイプが未知の形式で結果を返した場合は、エージェントタイプ名と利用可能なメタデータとともに生の発見内容を提示します。エージェントタイプが不明なだけで結果をスキップしないでください。

使用例

例 1: 広範なレビュー

次のような場合に使用: ユーザーが「AIは最近何を発見しましたか？」と尋ねている場合

アクション:

コンテキストを取得 — 主要プロジェクトとダッシュボードを特定
利用可能なすべてのエージェントタイプに対して get_agent_results を照会し、created_after を 7 日前に設定して並列で実行
鮮度を検証 — デプロイメントとクロスリファレンスし、陳腐化した発見を除外
プロダクトテーマ別に統合・グループ化し、各発見に関与したエージェントタイプを記録
影響度順にランク付けした統合発見を提示し、結果のなかったエージェントタイプのカバレッジギャップを記載

例 2: 特定領域のレビュー

次のような場合に使用: ユーザーが「オンボーディングに関する AIインサイトはありますか？」と尋ねている場合

アクション:

コンテキストを取得
すべてのエージェントタイプに query: "onboarding"、created_after を 7 日前に設定して照会
オンボーディング関連の発見のみに絞り込む
発見内容と、オンボーディングに関して結果がなかったエージェントタイプのギャップ推奨事項を提示

例 3: 結果が

原文（English）を表示

Review Agent Insights

Surface everything Amplitude's AI agents have found recently. Query every available agent type in get_agent_results, validate for staleness, and synthesize into a unified narrative ranked by impact with concrete follow-up actions.

CRITICAL: Tool Reference

Primary tool:

Amplitude:get_agent_results — Retrieve pre-computed analyses from Amplitude's AI agents. Supports multiple agent types (check the tool's agent_type enum for the current list). Each agent type is queried separately. All support filtering by created_after, created_before, query, agent_params, and limit.

Supporting tools:

Amplitude:get_context / Amplitude:get_project_context — Bootstrap user, org, and project info.
Amplitude:get_deployments — Check whether fixes have shipped for flagged issues (staleness validation).

Instructions

Step 1: Bootstrap Context (1-2 calls)

Call Amplitude:get_context to get the user's org, projects, recent activity, and key dashboards. If multiple projects, ask which to review — or review all if the user wants a broad scan.
Call Amplitude:get_project_context for the target project's settings and AI context.

Determine the review window from the user's request:

Default: last 7 days (good balance of recency and coverage).
"What's new today?" → last 1-2 days.
"Catch me up on this month" → last 14-30 days.
Always compute the created_after ISO 8601 timestamp for the review window.

Step 2: Query All Agent Types (parallel)

Check the get_agent_results tool descriptor to discover every available agent_type in the enum. Make one call per agent type, in parallel. For each:

agent_type: the agent type from the enum
created_after: the review window timestamp from Step 1
limit: 10

If the user asked about a specific area (e.g., "onboarding insights"), add a query matching that area to every call. If an agent type supports additional filtering via agent_params (e.g., impact ratings, categories, dashboard IDs), use them to focus results when the user's request suggests a narrower scope — otherwise omit agent_params to get the broadest view.

For each result returned, note:

Which agent type produced it
The key findings or summary
When the analysis was run (creation date)
Any metadata specific to that agent type (impact ratings, categories, dashboard IDs, etc.)

If exactly 1 result is returned for an agent type, artifacts auto-expand. If multiple results, note the previews and fetch full artifacts for the 2-3 most relevant ones (by recency or by matching the user's focus area) using session_id.

If any agent type returns fewer than 3 results and supports agent_params filtering, consider a second call with relaxed filters to broaden coverage.

Step 3: Validate Freshness

Agent insights go stale within days. Before synthesizing, filter out or flag anything unreliable.

Check creation dates. For each finding, note how old it is relative to today:
- < 3 days old: High confidence — treat as current.
- 3-7 days old: Medium confidence — include but note the age.
- 7-14 days old: Lower confidence — include only if no newer findings cover the same area. Label as "may be outdated."
- > 14 days old: Stale — exclude from the main narrative. Mention in passing only if nothing newer exists for that area.
Cross-reference with deployments (1 call). Call get_deployments once. For each AI-detected issue, check if a deployment shipped a fix or change to the affected area after the analysis was run. If so, note the finding as "potentially resolved by [deployment]" rather than presenting it as an active issue.
Deduplicate across agent types. The same problem may surface from multiple agent types. Merge these into a single finding with multi-agent evidence — don't present the same issue multiple times.

Step 4: Synthesize and Rank

Rank by impact and evidence strength.
- Multi-agent findings (flagged by more than one agent type) rank highest.
- High-impact or high-confidence findings from a single agent type rank next.
- Low-impact or older findings rank lowest.
Group by theme, not by agent type. Organize findings by product theme or problem area ("Checkout flow," "Onboarding," "Search feature"), not by which agent produced them. Within each theme, weave together evidence from all contributing agent types.
Identify gaps. Note agent types that returned no recent results, or product areas with no coverage.

Step 5: Present the Review

Structure the output as a narrative digest that a PM could forward to their team.

Required sections:

Summary (3-4 sentences): Which agent types were queried, the review window, how many results total, the single most important finding, and overall assessment.
Key Findings (3-7 items, ranked by impact):

For each finding:

### [Finding Title — action-oriented, ≤10 words]
**Impact:** [Critical/High/Medium/Low] | **Agents:** [list agent types that contributed] | **Freshness:** [X days old]

**What the AI found:** Describe the insight — what anomaly, friction, or issue was
detected. Be specific about the product area and the evidence from each agent.

**Staleness check:** Note if deployments shipped after the analysis, or if the finding
needs fresh validation. Omit this line if the finding is < 3 days old.

**Recommended action:** One concrete next step.

Coverage Gaps (2-4 items): Agent types with no results, or product areas with no AI coverage. For each, suggest what to do — which agent to run and on what.
Follow-on prompt: End with 2-3 specific options for what to dig into next, framed around the findings.

Writing standards:

Narrative over structure. Write findings as paragraphs, not database records.
Lead with the insight, use agent type attribution as supporting evidence.
Approximate: "~42%" not "42.37%".
Active voice only.
Always state the freshness: "detected 2 days ago" not "recently found."
Link to Amplitude UI sessions/artifacts inline when URLs are available in the results.
Total length: 400-800 words for the main findings. Be concise.

Edge Cases

No results from any agent type. Report that no recent AI agent analyses were found. List the agent types that were queried and suggest the user run them on their key dashboards and flows.
Results from only one agent type. Present what you have and note which agent types had no results. Frame the gap as an actionable recommendation.
All results are stale (> 14 days old). Present a brief summary with clear staleness warnings and recommend re-running analyses.
Overwhelming number of findings. Cap at 7 key findings. Rank by impact × freshness × evidence breadth. Mention the total count.
User asks about a specific area. Add a query parameter to every agent type call. Present only relevant findings.
Unrecognized agent type results. If a new agent type returns results in a format you haven't seen before, present the raw findings with the agent type name and any available metadata. Don't skip results just because the agent type is unfamiliar.

Examples

Example 1: Broad Review

User says: "What has the AI found recently?"

Actions:

Get context — identify key project and dashboards
Check get_agent_results for all available agent types, query each in parallel with created_after set to 7 days ago
Validate freshness — cross-reference against deployments, filter out stale findings
Synthesize and group by product theme, noting which agent types contributed to each finding
Present unified findings ranked by impact, note coverage gaps for agent types with no results

Example 2: Focused Area Review

User says: "Any AI insights about onboarding?"

Actions:

Get context
Query all agent types with query: "onboarding", created_after set to 7 days ago
Filter to only onboarding-related findings
Present findings + gap recommendations for agent types that returned nothing for onboarding

Example 3: Nothing Found

User says: "Show me all AI agent insights"

Actions:

Get context
All agent type queries return empty within the review window
Present: "No AI agent results found in the last 7 days. Here's how to generate them:" — list each agent type that was queried, what it does, and suggest specific content to analyze

原文・著作権は Anthropic および各プラグイン作者に帰属します。日本語訳は Claude API による自動翻訳です。