Suchir Balaji on When does generative AI qualify for fair use?:
Model developers like OpenAI and Google have also signed many data licensing agreements to train their models on copyrighted data: for example with Stack Overflow, Reddit, The Associated Press, News Corp, etc. It’s unclear why these agreements would be signed if training on this data was “fair use”, but that’s besides the point.
Suchir Balaji was found dead in his home from apparent suicide earlier this week. I don’t want to lend credence to the air of murder mystery mystic by some news blogs. I’m starting to think these types of narratives only help corporations sell more widgets—if it’s worth killing over, it must be really good!
There’s only one post on Balaji’s blog, and it’s about fair use and ChatGPT. I think it’s worth the read.
I hope you found peace, Suchir Balaji.