Too Long; Didn't Read
Before they gobbled up headlines everywhere, large language models ingested truly staggering amounts of data to train their models. That training data didn’t emerge from the ether: Some of it came from other people’s creativity and work. So it’s unsurprising that copyright concerns are front and center in the early days of this technological boom—we’ve been here before.
@TheMarkup
The Markup
Nonprofit organization dedicated to data-driven tech accountability journalism & privacy protection.
Receive Stories from @TheMarkup
RELATED STORIES
L O A D I N G
. . . comments & more!