AI Turns to Corporate Data

Updated: 2025.10.02 19D ago 1 sources
Goldman Sachs’ data chief says the open web is 'already' exhausted for training large models, so builders are pivoting to synthetic data and proprietary enterprise datasets. He argues there’s still 'a lot of juice' in corporate data, but only if firms can contextualize and normalize it well. — If proprietary data becomes the key AI input, competition, privacy, and antitrust policy will hinge on who controls and can safely share these datasets.

Sources

AI Has Already Run Out of Training Data, Goldman's Data Chief Says
msmash 2025.10.02 100% relevant
Neema Raphael on Goldman’s podcast: 'We’ve already run out of data,' citing DeepSeek’s use of model outputs and the need to mine enterprise data.
← Back to All Ideas