Training data is a critical input for frontier AI models. The competitive landscape depends on data quality, diversity, licensing, and regulatory frameworks governing data collection and use.