Loading source
Pulling the file list, source metadata, and syntax-aware rendering for this listing.
Source from repo
Designs statistically valid A/B tests with proper hypothesis structure, sample size calculation, and measurement planning.
Files
Skill
Size
Entrypoint
Format
Open file
Syntax-highlighted preview of this file as included in the skill package.
references/test-templates.md
1# A/B Test Templates Reference23Templates for planning, documenting, and analyzing experiments.45## Contents6- Test Plan Template7- Results Documentation Template8- Test Repository Entry Template9- Quick Test Brief Template10- Stakeholder Update Template11- Experiment Prioritization Scorecard12- Hypothesis Bank Template1314## Test Plan Template1516```markdown17# A/B Test: [Name]1819## Overview20- **Owner**: [Name]21- **Test ID**: [ID in testing tool]22- **Page/Feature**: [What's being tested]23- **Planned dates**: [Start] - [End]2425## Hypothesis2627Because [observation/data],28we believe [change]29will cause [expected outcome]30for [audience].31We'll know this is true when [metrics].3233## Test Design3435| Element | Details |36|---------|---------|37| Test type | A/B / A/B/n / MVT |38| Duration | X weeks |39| Sample size | X per variant |40| Traffic allocation | 50/50 |41| Tool | [Tool name] |42| Implementation | Client-side / Server-side |4344## Variants4546### Control (A)47[Screenshot]48- Current experience49- [Key details about current state]5051### Variant (B)52[Screenshot or mockup]53- [Specific change #1]54- [Specific change #2]55- Rationale: [Why we think this will win]5657## Metrics5859### Primary60- **Metric**: [metric name]61- **Definition**: [how it's calculated]62- **Current baseline**: [X%]63- **Minimum detectable effect**: [X%]6465### Secondary66- [Metric 1]: [what it tells us]67- [Metric 2]: [what it tells us]68- [Metric 3]: [what it tells us]6970### Guardrails71- [Metric that shouldn't get worse]72- [Another safety metric]7374## Segment Analysis Plan75- Mobile vs. desktop76- New vs. returning visitors77- Traffic source78- [Other relevant segments]7980## Success Criteria81- Winner: [Primary metric improves by X% with 95% confidence]82- Loser: [Primary metric decreases significantly]83- Inconclusive: [What we'll do if no significant result]8485## Pre-Launch Checklist86- [ ] Hypothesis documented and reviewed87- [ ] Primary metric defined and trackable88- [ ] Sample size calculated89- [ ] Test duration estimated90- [ ] Variants implemented correctly91- [ ] Tracking verified in all variants92- [ ] QA completed on all variants93- [ ] Stakeholders informed94- [ ] Calendar hold for analysis date95```9697---9899## Results Documentation Template100101```markdown102# A/B Test Results: [Name]103104## Summary105| Element | Value |106|---------|-------|107| Test ID | [ID] |108| Dates | [Start] - [End] |109| Duration | X days |110| Result | Winner / Loser / Inconclusive |111| Decision | [What we're doing] |112113## Hypothesis (Reminder)114[Copy from test plan]115116## Results117118### Sample Size119| Variant | Target | Actual | % of target |120|---------|--------|--------|-------------|121| Control | X | Y | Z% |122| Variant | X | Y | Z% |123124### Primary Metric: [Metric Name]125| Variant | Value | 95% CI | vs. Control |126|---------|-------|--------|-------------|127| Control | X% | [X%, Y%] | — |128| Variant | X% | [X%, Y%] | +X% |129130**Statistical significance**: p = X.XX (95% = sig / not sig)131**Practical significance**: [Is this lift meaningful for the business?]132133### Secondary Metrics134135| Metric | Control | Variant | Change | Significant? |136|--------|---------|---------|--------|--------------|137| [Metric 1] | X | Y | +Z% | Yes/No |138| [Metric 2] | X | Y | +Z% | Yes/No |139140### Guardrail Metrics141142| Metric | Control | Variant | Change | Concern? |143|--------|---------|---------|--------|----------|144| [Metric 1] | X | Y | +Z% | Yes/No |145146### Segment Analysis147148**Mobile vs. Desktop**149| Segment | Control | Variant | Lift |150|---------|---------|---------|------|151| Mobile | X% | Y% | +Z% |152| Desktop | X% | Y% | +Z% |153154**New vs. Returning**155| Segment | Control | Variant | Lift |156|---------|---------|---------|------|157| New | X% | Y% | +Z% |158| Returning | X% | Y% | +Z% |159160## Interpretation161162### What happened?163[Explanation of results in plain language]164165### Why do we think this happened?166[Analysis and reasoning]167168### Caveats169[Any limitations, external factors, or concerns]170171## Decision172173**Winner**: [Control / Variant]174175**Action**: [Implement variant / Keep control / Re-test]176177**Timeline**: [When changes will be implemented]178179## Learnings180181### What we learned182- [Key insight 1]183- [Key insight 2]184185### What to test next186- [Follow-up test idea 1]187- [Follow-up test idea 2]188189### Impact190- **Projected lift**: [X% improvement in Y metric]191- **Business impact**: [Revenue, conversions, etc.]192```193194---195196## Test Repository Entry Template197198For tracking all tests in a central location:199200```markdown201| Test ID | Name | Page | Dates | Primary Metric | Result | Lift | Link |202|---------|------|------|-------|----------------|--------|------|------|203| 001 | Hero headline test | Homepage | 1/1-1/15 | CTR | Winner | +12% | [Link] |204| 002 | Pricing table layout | Pricing | 1/10-1/31 | Plan selection | Loser | -5% | [Link] |205| 003 | Signup form fields | Signup | 2/1-2/14 | Completion | Inconclusive | +2% | [Link] |206```207208---209210## Quick Test Brief Template211212For simple tests that don't need full documentation:213214```markdown215## [Test Name]216217**What**: [One sentence description]218**Why**: [One sentence hypothesis]219**Metric**: [Primary metric]220**Duration**: [X weeks]221**Result**: [TBD / Winner / Loser / Inconclusive]222**Learnings**: [Key takeaway]223```224225---226227## Stakeholder Update Template228229```markdown230## A/B Test Update: [Name]231232**Status**: Running / Complete233**Days remaining**: X (or complete)234**Current sample**: X% of target235236### Preliminary observations237[What we're seeing - without making decisions yet]238239### Next steps240[What happens next]241242### Timeline243- [Date]: Analysis complete244- [Date]: Decision and recommendation245- [Date]: Implementation (if winner)246```247248---249250## Experiment Prioritization Scorecard251252For deciding which tests to run:253254| Factor | Weight | Test A | Test B | Test C |255|--------|--------|--------|--------|--------|256| Potential impact | 30% | | | |257| Confidence in hypothesis | 25% | | | |258| Ease of implementation | 20% | | | |259| Risk if wrong | 15% | | | |260| Strategic alignment | 10% | | | |261| **Total** | | | | |262263Scoring: 1-5 (5 = best)264265---266267## Hypothesis Bank Template268269For collecting test ideas:270271```markdown272| ID | Page/Area | Observation | Hypothesis | Potential Impact | Status |273|----|-----------|-------------|------------|------------------|--------|274| H1 | Homepage | Low scroll depth | Shorter hero will increase scroll | High | Testing |275| H2 | Pricing | Users compare plans | Comparison table will help | Medium | Backlog |276| H3 | Signup | Drop-off at email | Social login will increase completion | Medium | Backlog |277```278