"Scaling law has to be tied to data quality. A model that understands the underlying laws may need only 200,000 data points to match what another model gets from a million."
— Biwei Huang, Founder and CEO of Aether AI and Assistant Professor at UC San Diego (十字路口Crossing)
If your dataset only teaches correlations, 5x more examples may buy confidence in the wrong shortcut. The hard operator question is which examples expose the mechanism your model must reuse when the environment changes.
🎙️ app.podwise.ai/dashboard/epi…
Link
Podwise - Podcasts, Notes, AI
Podcast summaries, transcripts, mind maps, notes and translations - all in one place. The premier learning app for podcast lovers.
app.podwise.ai