Smarter Sampling of Expensive Data
This project demonstrates how to reduce expensive labeling and testing in engineering. By comparing two Gaussian-process strategies across 13 benchmarks, it identifies when targeted sampling accelerates model building by 20–56% and when exploration is safer. It provides a practical rulebook for efficient, budget-limited active learning in data-modeling tasks.
Supervisor:
Prof. Markus Lange-Hegermann