スキルOfficialmonitoring

🔍replay-ux-audit

プラグイン: amplitude
ソース: GitHub で見る ↗

説明

複数のセッションにわたってAmplitude Session Replaysを検索・分析し、UXの摩擦パターンを抽出します。ユーザーが苦労している箇所、躊躇している箇所、離脱している箇所を示す、優先順位付きの摩擦マップを生成します。次のような場合に使用: PMやデザイナーが「どこに摩擦があるか」「何がユーザーを混乱させているか」「このページのUX課題」「このフローがぎこちない理由」「ユーザー体験の監査をしたい」と尋ねているとき、または特定の機能やフローにおけるユーザビリティ問題の定性的な根拠を求めているとき。

原文を表示

Finds and analyzes Amplitude Session Replays to surface UX friction patterns across multiple sessions. Produces a ranked friction map showing where users struggle, hesitate, or abandon. Use when a PM or designer asks "where's the friction", "what's confusing users", "UX issues on this page", "why is this flow clunky", "audit the user experience", or wants qualitative evidence of usability problems in a specific feature or flow.

ユースケース

✓UXの摩擦箇所を特定するとき
✓ユーザーが混乱している原因を調査するとき
✓ページのUX課題を分析するとき
✓フローのぎこちなさの理由を把握するとき
✓ユーザー体験の監査をするとき

本文（日本語訳）

リプレイ UX 監査

特定の機能・ページ・フローについてセッションリプレイを5〜10件視聴し、パターンを統合してランク付きのフリクションマップに落とし込みます。このスキルは、手作業で何時間もかかるリプレイ視聴を、実際のユーザー行動に基づいた構造的な UX レポートへと変換します。

重要: ツールリファレンス

主要ツール:

Amplitude:get_session_replays — イベントフィルター・ユーザープロパティ・時間ウィンドウに合致するセッションを検索します。特定の機能やフローを対象としたセッションの絞り込みに使用します。
Amplitude:get_session_replay_events — リプレイをインタラクションタイムライン（ナビゲーション・クリック・入力・スクロール）にデコードします。実際に「視聴する」際に使用するツールです。

補助ツール:

Amplitude:get_events — 有効なイベント名を確認します。イベント名を推測で使用しないでください。
Amplitude:get_event_properties — フィルタリング用のプロパティ（ページパス・機能エリアなど）を確認します。
Amplitude:query_chart — 定量的な文脈情報（ファネルのコンバージョン率・機能の採用状況など）を取得し、定性的なリプレイの知見を裏付けます。
Amplitude:get_feedback_insights / Amplitude:get_feedback_mentions — リプレイで発見したフリクションと、カスタマーフィードバックのテーマを照合します。

手順

ステップ 1: 監査スコープの定義

ユーザーのリクエストから、監査対象を特定します:

ページまたは URL パターン: 特定のページ（例: /settings、/checkout）
機能またはフロー: 複数ステップのプロセス（例: オンボーディング、レポート作成）
イベントベース: 特定のイベントを含むセッション（例: "Export Clicked"）
広範囲指定: 「プロダクト全体を監査して」という場合は範囲を絞ります。「どのエリアから始めますか？」と確認し、特定できる場合はトラフィックの多いページや既知の問題エリアから 2〜3 件の候補を提案します。

また以下も確認します:

時間ウィンドウ: 指定がない場合は直近 14 日間をデフォルトとします。
ユーザーセグメント（任意）: 特定のプラン・プラットフォーム・コホート・ユーザータイプ。

ステップ 2: コンテキストの取得とイベントの確認

Amplitude:get_context を呼び出します。複数のプロジェクトがある場合は、監査対象を確認します。
Amplitude:get_events を呼び出して、対象エリアに関連するイベントを特定します。以下を探します:
- 対象エリアのページビューまたはナビゲーションイベント
- フロー内の主要なインタラクションイベント（クリック・フォーム送信など）
- フリクションを示す可能性のあるエラーや失敗イベント
ユーザーがフローやファネルに言及している場合は、セッションをフィルタリングできるよう、フローの主要ステップのイベントを特定します。

ステップ 3: 定量的なベースラインの収集（任意だが推奨）

リプレイを視聴する前に、1〜2 件のチャートクエリでコンテキストを整理します。呼び出し上限: 最大 2 回。

ファネルの監査: Amplitude:query_chart で現在のコンバージョン率を取得し、最も離脱が多いステップを特定します。これによってリプレイで注目すべき箇所が明確になります。
ページの監査: ページのトラフィック量とエラー率をクエリして規模感を把握します。
機能の監査: 採用率・利用頻度をクエリして、どれだけのユーザーが利用しているかを把握します。

この定量的ベースラインがあることで、定性的な知見がより実行可能なものになります。「ユーザーがステップ 3 で混乱しているようだ」よりも、「ユーザーの 40% がステップ 3 で離脱しており、リプレイでは以下の行動が確認された」という形のほうが説得力があります。

ステップ 4: 対象セッションの検索

Amplitude:get_session_replays を使って 8〜12 件のセッションを検索します（リプレイデータが欠損しているセッションへの対応として limit: 12 を指定します）。

監査タイプ別のフィルター戦略:

ページ監査: 対象ページのイベントでフィルタリング（利用可能な場合はページパスプロパティを使用）。
フロー監査: フローのエントリーイベントでフィルタリング。必要に応じてフローを完了しなかったセッションを追加フィルタリングし、離脱箇所の特定に集中します。
機能監査: 機能の主要インタラクションイベントでフィルタリング。
セグメント比較: 各セグメントごとに 2 回検索し、行動を比較します。

ユーザーがセグメント（プランタイプ・プラットフォームなど）を指定した場合は、ユーザープロパティフィルターを追加します。

ステップ 5: セッションの視聴 — インタラクションタイムラインの抽出

各セッションについて、event_limit: 300 を指定して Amplitude:get_session_replay_events を呼び出します。

予算: 5〜8 セッション。 データが空または極めて少ないセッションはスキップします。

各セッションを分析する際は、以下のフリクションシグナルを記録します:

シグナル	タイムラインで確認すべきこと
レイジクリック	短時間内に同一座標で 3 回以上のクリック
躊躇	ページのナビゲーションから最初のインタラクションまでの長い停止（10 秒以上）
行き来	あるページに遷移し、戻り、また進む繰り返し
入力の放棄	フィールドへの入力を始めた後、送信せずに離脱
過度なスクロール	ユーザーが何かを探していることを示す大きなスクロール量
デッドエンドナビゲーション	ページに遷移して即座に離脱（数秒以内のバウンス）
繰り返し試行	同じアクションを複数回実行（フォームの再送信・ボタンの再クリックなど）

各セッションについて、以下の内容を簡単にまとめます:

対象エリアで訪問したページ
実施した主要アクション
観察されたフリクションシグナル（タイムスタンプ付き）
ユーザーが目的を達成できたかどうか

ステップ 6: フリクションパターンの統合

これが分析の核となるステップです。視聴したすべてのセッションの知見を集約します。

フリクションシグナルを発生場所でグループ化します。 発生したページやステップごとに観察内容をクラスタリングします。
頻度を集計します。 視聴したセッションのうち何件でそのフリクションが発生しましたか？「Y 件中 X 件で確認」という形で表します。
深刻度を評価します。 以下の基準を使用します:

深刻度	判断基準
Critical（致命的）	タスクの完了を妨げる。ユーザーが諦めるかエラーが発生する。セッションの 50% 以上で確認。
High（高）	大きな混乱や遅延を引き起こす。ユーザーは最終的に成功するが、明らかな苦労が見られる。30% 以上のセッションで確認。
Medium（中）	軽微な躊躇や非効率なパスを引き起こす。ユーザーはすぐに回復する。20% 以上のセッションで確認。
Low（低）	見た目上の問題や軽微な不便。セッションの 20% 未満またはエッジケースのみで確認。

根本原因の仮説を立てます。 各フリクションパターンについて、発生理由を仮説として整理します:
- UI ラベルや階層が不明確
- アクション後のフィードバックが欠如（ローディング状態・確認メッセージなど）
- 予期しない動作（クリックしても反応しない・ページが応答しないなど）
- 情報がユーザーの期待する場所にない（過度なスクロール・検索が必要）
- エラー状態からの明確な回復パスがない
- ステップ数が多すぎる・認知負荷が高い
フィードバックとの照合（利用可能な場合）。フリクションの知見からキーワードを抽出して Amplitude:get_feedback_insights を呼び出します。リプレイで確認している内容と同じことをユーザーが不満として挙げている場合、それは高い確信度を持つシグナルとなります。

ステップ 7: UX 監査の報告

PM やデザイナーが実際に行動できるフリクションマップとして出力を構成します。

必須セクション:

監査サマリー（3〜4 文）: 監査対象・視聴セッション数・最も重要な知見・全体的な UX の健全性評価。デザインレビュードキュメントにそのまま貼り付けられる形式のナラティブとして記述します。
スコープと方法論:
- 監査した機能・フロー・ページ
- 時間ウィンドウ
- 分析セッション数: N 件（リプレイリンク付き）
- ユーザーセグメント（フィルタリングした場合）
- 定量的ベースライン（ステップ 3 で収集した場合）
フリクションマップ — 深刻度順、次に頻度順でランク付け:

各フリクションポイントについて:

### [フリクションポイントのタイトル — 行動指向で 10 語以内]
**深刻度:** [Critical / High / Medium / Low] | **頻度:** Y 件中 X 件で確認

**発生内容:** ユーザーの行動を具体的に説明 — 何をしているか・どこで躊躇しているか・
何が問題になっているか。ページとインタラクションを具体的に記述します。

**推定原因:** このフリクションが存在する理由についての仮説。

**根拠:**
- このパターンを示すセッションリプレイのリンク
- 定量データ（利用可能な場合）: このステップのコンバージョン率・エラー率など
- カスタマーフィードバックの引用（見つかった場合）

**改善提案:** 具体的で実行可能な推奨事項を 1 件。

良好なパターン（1〜2 件）: うまく機能している点。セッション全体を通じてスムーズだった体験の部分。バランスをとり、維持すべき要素を明確にします。
推奨される次のステップ（3〜5 件、番号付き）: 各項目は動詞で始めます。影響度の大きい順に優先します。例:
- 「[具体的な要素] を再設計して [アクション] をより見つけやすくする」
- 「[アクション] 後にローディングインジケーターを追加してレイジクリックを削減する」
- 「[提案した変更] で A/B テストを実施して仮説を検証する」
- 「このフリクションを定量的に追跡するために [特定のインタラクション] のイベントを計測する」
- 「[特定のセグメント] に絞ってさらに 5 件のセッションを視聴し、セグメント固有の問題かどうかを確認する」

エッジケース

対象エリアのセッションが見つからない場合。 その機能のトラフィックが少ないか、該当ページのイベントが計測されていない可能性があります。この旨を報告し、「[エリア] にイベントトラッキングを追加することで、セッションリプレイのフィルタリングが可能になります」と提案します。
セッションが非常に短い場合。

原文（English）を表示

Replay UX Audit

Watch 5-10 session replays for a specific feature, page, or flow, then synthesize patterns into a ranked friction map. This skill turns hours of manual replay watching into a structured UX report grounded in real user behavior.

CRITICAL: Tool Reference

Primary tools:

Amplitude:get_session_replays — Find sessions matching event filters, user properties, or time windows. Use this to target sessions for a specific feature or flow.
Amplitude:get_session_replay_events — Decode a replay into an interaction timeline: navigations, clicks, inputs, scrolls. This is what you "watch."

Supporting tools:

Amplitude:get_events — Discover valid event names. Never guess event names.
Amplitude:get_event_properties — Discover properties for filtering (page path, feature area, etc.).
Amplitude:query_chart — Pull quantitative context (funnel conversion rates, feature adoption) to anchor the qualitative replay findings.
Amplitude:get_feedback_insights / Amplitude:get_feedback_mentions — Cross-reference replay friction with customer feedback themes.

Instructions

Step 1: Define the Audit Scope

Determine what to audit from the user's request:

Page or URL pattern: A specific page (e.g., /settings, /checkout)
Feature or flow: A multi-step process (e.g., onboarding, report creation)
Event-based: Sessions containing a specific event (e.g., "Export Clicked")
Broad: "Audit the whole product" — narrow this down. Ask: "Which area would you like me to start with?" Suggest 2-3 areas based on high-traffic pages or known problem areas if you can identify them.

Also determine:

Time window: Default to last 14 days unless specified.
User segment (optional): Specific plan, platform, cohort, or user type.

Step 2: Get Context and Discover Events

Call Amplitude:get_context. If multiple projects, ask which to audit.
Call Amplitude:get_events to find events related to the target area. Look for:
- Page view or navigation events for the target area
- Key interaction events (clicks, form submissions) within the flow
- Error or failure events that may indicate friction
If the user mentioned a flow or funnel, identify the key step events so you can filter sessions that attempted the flow.

Step 3: Gather Quantitative Baseline (Optional but Recommended)

Before watching replays, establish context with 1-2 chart queries. Budget: 2 calls max.

If auditing a funnel: Use Amplitude:query_chart to get the current conversion rate and identify the worst drop-off step. This tells you where to focus your replay attention.
If auditing a page: Query the page's traffic volume and any error rates to understand scale.
If auditing a feature: Query adoption/usage frequency to understand how many users interact with it.

This quantitative baseline makes your qualitative findings more actionable — "40% of users drop off at step 3, and here's what we see them doing" is stronger than "users seem confused at step 3."

Step 4: Find Target Sessions

Use Amplitude:get_session_replays to find 8-12 sessions (request limit: 12 to allow for some sessions with missing replay data).

Filter strategy by audit type:

Page audit: Filter by event on that page (use page path property if available).
Flow audit: Filter by the entry event of the flow. Optionally add a second filter for sessions that did NOT complete the flow (to focus on drop-offs).
Feature audit: Filter by the feature's key interaction event.
Segment comparison: Run two searches — one for each segment — to compare behavior.

If the user specified a segment (plan type, platform, etc.), add user property filters.

Step 5: Watch Sessions — Extract Interaction Timelines

For each session, call Amplitude:get_session_replay_events with event_limit: 300.

Budget: 5-8 sessions. Skip sessions that return empty or minimal data.

While analyzing each session, track these friction signals:

Signal	What to look for in the timeline
Rage clicks	3+ clicks on the same coordinates within a short time span
Hesitation	Long pauses (>10 seconds) between navigation and first interaction on a page
Back-and-forth	Navigating to a page, then back, then forward again
Abandoned inputs	Starting to type in a field, then navigating away without submitting
Excessive scrolling	Large scroll deltas suggesting the user is searching for something
Dead-end navigation	Visiting a page and immediately leaving (bounce within seconds)
Repeat attempts	Performing the same action multiple times (re-submitting a form, re-clicking a button)

For each session, write a brief summary:

Pages visited in the target area
Key actions taken
Friction signals observed (with timestamps)
Whether the user completed their apparent goal

Step 6: Synthesize Friction Patterns

This is the core analytical step. Aggregate findings across all watched sessions.

Group friction signals by location. Cluster observations by the page or step where they occurred.
Count frequency. How many of the watched sessions showed this friction? Express as "seen in X of Y sessions."
Assess severity. Use this rubric:

Severity	Criteria
Critical	Blocks task completion. User gives up or encounters an error. Seen in 50%+ of sessions.
High	Causes significant confusion or delay. User eventually succeeds but with visible struggle. Seen in 30%+ of sessions.
Medium	Causes minor hesitation or suboptimal paths. User recovers quickly. Seen in 20%+ of sessions.
Low	Cosmetic or minor annoyance. Seen in <20% of sessions or only in edge cases.

Identify root cause hypotheses. For each friction pattern, hypothesize why it happens:
- Unclear UI labeling or hierarchy
- Missing feedback after an action (loading state, confirmation)
- Unexpected behavior (click does nothing, page doesn't respond)
- Information not where users expect it (excessive scrolling/searching)
- Error state without clear recovery path
- Too many steps or cognitive load
Cross-reference with feedback (if available). Call Amplitude:get_feedback_insights with keywords from your friction findings. If users are complaining about the same thing you're seeing in replays, that's high-confidence signal.

Step 7: Present the UX Audit

Structure the output as a friction map that a PM or designer can act on.

Required sections:

Audit Summary (3-4 sentences): What was audited, how many sessions were watched, the single biggest finding, and overall UX health assessment. Written as a narrative you could paste into a design review doc.
Scope & Methodology:
- Feature/flow/page audited
- Time window
- Sessions analyzed: N (with replay links)
- User segment (if filtered)
- Quantitative baseline (if gathered in Step 3)
Friction Map — Ranked by severity, then frequency:

For each friction point:

### [Friction Point Title — action-oriented, ≤10 words]
**Severity:** [Critical/High/Medium/Low] | **Frequency:** Seen in X of Y sessions

**What happens:** Describe the user behavior observed — what they do, where they
hesitate, what goes wrong. Be specific about the page and interaction.

**Likely cause:** Your hypothesis for why this friction exists.

**Evidence:**
- Session replay links showing this pattern
- Quantitative data (if available): conversion rate at this step, error rate, etc.
- Customer feedback quotes (if found)

**Suggested fix:** One concrete, actionable recommendation.

Positive Patterns (1-2 items): What's working well. Which parts of the experience were smooth across sessions. This provides balance and highlights what to preserve.
Recommended Next Steps (3-5 numbered items): Start each with a verb. Prioritize by impact. Examples:
- "Redesign the [specific element] to make [action] more discoverable"
- "Add a loading indicator after [action] to reduce rage clicks"
- "Run an A/B test on [proposed change] to validate the hypothesis"
- "Instrument [specific interaction] to track this friction quantitatively"
- "Watch 5 more sessions filtered to [specific segment] to confirm if this is segment-specific"

Edge Cases

No sessions found for the target area. The feature may have low traffic or events may not be instrumented for that page. Report this and suggest: "Consider adding event tracking to [area] so session replays can be filtered to it."
Sessions are too short. If most sessions are <30 seconds with minimal interactions, the page may have a bounce problem rather than a friction problem. Report this as a finding and suggest investigating why users leave so quickly.
All sessions look smooth. This is a valid finding. Report that the UX appears healthy based on N sessions. Suggest looking at a different area or a specific user segment that may have different behavior.
Replay events are sparse. Some sessions may have limited interaction data (ad blockers, slow connections). Skip these and note how many were skipped. If most sessions are sparse, note it as a data quality issue.
User asks to audit "everything." Decline politely. Suggest starting with the highest-traffic flow or the area with the worst funnel conversion. Offer to audit additional areas after the first one.
nodeId limitations. Interaction timelines show coordinates and node IDs, not element names. Describe actions by page context and position: "clicks in the header area," "interacts with the form's third field." Avoid asserting specific element identity unless clearly inferable from the page URL and action sequence.

Examples

Example 1: Flow Audit

User says: "Audit the onboarding experience for new users"

Actions:

Get context, discover onboarding-related events
Query the onboarding funnel for conversion rates and worst drop-off step
Find 8-10 sessions of new users going through onboarding
Extract timelines, track friction signals at each step
Synthesize: "4 of 7 users hesitated for 15+ seconds on the workspace setup step. 3 users navigated back to re-read instructions."
Present friction map ranked by severity with replay links

Example 2: Page Audit

User says: "What's the UX like on our pricing page?"

Actions:

Get context, find pricing page events (page view, plan selection, CTA clicks)
Query pricing page traffic and click-through rate as baseline
Find 8 sessions that visited the pricing page
Extract timelines, focus on: how far users scroll, what they click, whether they compare plans, how long they stay
Synthesize patterns: excessive scrolling (plan comparison is below fold), hesitation on CTA (unclear pricing)
Present friction map with specific redesign suggestions

Example 3: Feature Audit with Segment

User says: "Are enterprise users having trouble with the report builder?"

Actions:

Get context, find report builder events
Filter sessions to enterprise plan users + report builder events
Extract timelines from 6-8 sessions
Focus on: completion rate of report creation, where users get stuck, any error patterns
Cross-reference with feedback filtered to "report" keywords
Present findings specific to enterprise segment, noting if this differs from general population

原文・著作権は Anthropic および各プラグイン作者に帰属します。日本語訳は Claude API による自動翻訳です。