Dataset Strategy: Diversity, Labeling Rules, and Leakage Prevention
Dataset Strategy: Diversity, Labeling Rules, and Leakage Prevention
Diversity
Cover lighting, angles, backgrounds, devices, seasons, and rare edge cases.
Labeling guidelines
Write rules, train annotators, audit samples weekly, and resolve disagreements.
Leakage prevention
Split by scene/user/time to avoid near-duplicate leakage.

