Judge dismisses majority of GitHub Copilot copyright claims

pnutzh4x0r@lemmy.ndlug.org · 18 days ago

Judge dismisses majority of GitHub Copilot copyright claims

rebelsimile@sh.itjust.works · edit-2 18 days ago

If you make a byte-for-byte copy of something why would you think copyright would not apply? If you listened to the dialogue of a Marvel movie, wrote it down line for line and so happened that the stage directions you wrote were identical to those in the movie, congrats, you’ve worked your way into a direct copy of something that’s under copyright. If you draw three circles by hand in exactly the right way, you might get a Mouse coming after you. If you digitally render those circles in Photoshop, same idea[/concept, yes I know one is a trademark issue].

ugo@feddit.it · 18 days ago

Looks to me like the ruling is saying that the output of a model trained on copyrighted data is not copyrighted in itself.

By that logic, if I train a model on marvel movies and get something that is exactly the same as an existing movie, that output is not copyrighted.

It’s a stretch, for sure, and the judge did say that he didn’t consider the output to be similar enough to the source copyrighted material, but it’s unclear what “close enough” is.

What if my model is trained on star wars and outputs a story that is novel, with different characters with different voices. That’s not copyrighted then, despite the model being trained exclusively on copyrighted data?