Synthetic Data
← Back to Data Management for ML
Artificially generated training data that mimics the properties of real data. Used when real data is scarce, expensive to label, or privacy-sensitive. Generated via simulation, rule-based methods, or generative models.
Related
- Data Labeling (alternative to synthetic)
- GANs (can generate synthetic data)