GitHub: We going to train on your data after all (www.theregister.com)
from cm0002@europe.pub to programming@programming.dev on 27 Mar 02:57
https://europe.pub/post/10777420

#programming

threaded - newest

MadMadBunny@lemmy.ca on 27 Mar 03:39 next collapse

lemmy.world/post/44803616

atzanteol@sh.itjust.works on 27 Mar 04:19 next collapse

It’s not GitHub. It’s Microsoft. Never forget that.

D1re_W0lf@piefed.social on 27 Mar 06:39 collapse
  • MicrosoftHub?
  • GitSoft?
  • MicroHub?

🤔

bugfest@lemmy.world on 27 Mar 07:45 next collapse

SlopHub

PodPerson@lemmy.zip on 27 Mar 17:00 collapse

Compuglobalhypermeganet.

garbage_world@lemmy.world on 27 Mar 05:20 next collapse

Oh no, they will improve the service with public data, how horrible

ugo@feddit.it on 27 Mar 05:37 next collapse

No, they will be using user’s copyrighted work, including that which is in a “private” repository, to extract value for themselves.

Lojcs@piefed.social on 27 Mar 06:05 next collapse

If you use a public ai agent on it your repo isn’t really ‘private’

Venat0r@lemmy.world on 27 Mar 11:28 collapse

Also the AI is trained on FOSS code but then used to make closed code that ignores the open licenses of the FOSS code that it copied.

MousePotatoDoesStuff@lemmy.world on 27 Mar 12:58 collapse

> Microslop

> improve

garbage_world@lemmy.world on 27 Mar 14:03 collapse

The amout and quality of training data correlates with quality of AI models.

theywanttocontrolyou@lemmy.world on 27 Mar 06:35 next collapse

Nothing intelligent will come of assimilating my code.

groucho@lemmy.sdf.org on 27 Mar 17:31 collapse

I think the only thing I have on my github account is an arduino light blinky thing I wrote at 4 AM while crossfaded because a friend needed a glowy wizard staff for a party. It can only make the model worse.

MonkCanatella@sh.itjust.works on 27 Mar 15:47 next collapse

I would bet my life savings they already trained on your data

Bieren@lemmy.today on 27 Mar 16:02 collapse

Right. My theory is that for all of the AI companies that have that don’t use my data option, it’s just an option on the screen. It doesn’t actually do anything. They still use your data.

Shin@piefed.social on 27 Mar 11:01 collapse

Stupid, even naive question. What about AGPL code used in this training? Would that means that the output is also AGPL?

PlexSheep@infosec.pub on 27 Mar 16:25 collapse

That legal question is not yet clearly answered. I think absolutely yes, but the Megacorps and pro-ai people don’t seem to care.