Dan Romero pfp
Dan Romero
@dwr.eth
Wonder if ChatGPT will be the last major model to be trained on the open web? robots.txt specifically disallowing crawling from LLMs unless getting paid for the data?
11 replies
0 recast
0 reaction

William Saar pfp
William Saar
@saarw.eth
If AIs can generate enough value, it might be worth paying armies of Mechanical Turk-style workers to manually visit and rewrite web sites for copyright-approved training Facts and ideas can't be copyrighted, only particular expression
1 reply
0 recast
0 reaction

Travis A. Everett pfp
Travis A. Everett
@abathur
This would double down on the risk LLMs are laundering misinfo, no? I also expect avoiding liability is more complex than just paying someone with a pulse to thesaurus words. I think a jury would agree the work was copied if there's deep, undeniable structural similarity.
1 reply
0 recast
0 reaction