| Sample quality | Is the sample complete enough to inspect? | Entries, exits, skips, misses, exclusions, and dates are recorded. | Only selected winners or recent trades are visible. |
| Rule stability | Did the setup, prompt, and risk rule stay stable? | Version labels show what was tested and when it changed. | The sample blends several rule versions. |
| Risk behavior | Did paper size, drawdown, and exposure stay inside plan? | Losses, pauses, and size limits are documented before the result. | Paper gains depend on breaking the risk rule. |
| Execution gap | Were simulated fills reviewed against possible live slippage? | The review names fill, fee, latency, and liquidity assumptions. | The paper result assumes perfect execution. |
| Outlier dependency | Does one winner or one market regime explain most of the result? | Raw and normalized views are both shown. | One unusual trade drives the conclusion. |
| Behavior review | Were FOMO, revenge, hesitation, and rule breaks tagged? | Behavior tags connect mistakes to the next paper test. | The journal ignores emotional or process drift. |
| Outside risk | Are real-world risks explicitly named? | The review includes fees, taxes, liquidity, custody, and capital stress. | The paper sample is presented as proof of live readiness. |