I prompt injected my CONTRIBUTING.md – 50% of PRs are bots
(glama.ai)
from vegetaaaaaaa@lemmy.world to selfhosted@lemmy.world on 22 Mar 00:31
https://lemmy.world/post/44572752
from vegetaaaaaaa@lemmy.world to selfhosted@lemmy.world on 22 Mar 00:31
https://lemmy.world/post/44572752
Relevant since we started outright rejecting agent-made PRs in awesome-selfhosted [1] and issuing bans for it. Some PRs made in good faith could probably get caught in the net, but it’s currently the only decent tradeoff we could make to absorb the massive influx of (bad) contributions. >99.9% of them are invalid for other reasons anyway. Maybe a good solution will emerge over time.
#selfhosted
threaded - newest
This is one good article. I guess humans are now mostly redundant in open source. Bots can do everything themself, write code, submit PR, merge them and even blog about it. Time to book a place for myself in a graveyard.
Time for QA
Blindly promoting the LLMs without checking the source? Bot or human it makes you wonder if your contributions are worth keeping around
You’re probably exaggerating sarcastically?
… did you read the same article as everyone else? I can’t tell if you’re joking or not.
An excellent read, thank you.
Cool, though in the long term vibe coders will likely adapt their prompts to not fall for it
It’ll still catch the bots that randomly throw out that part of the prompt.
Prompts aren’t a guarantee.
Not all bad. Git is an incredible system for collaboration and humans have been honing it to improve quality and share work across teams for decades now.
Allowing bots to play a carefully defined role is probably going to end up being a net improvement but there are still kinks.
Masquerading as a human needs to be fixed though - I can see why it’s happening and that’s one of the first problems to solve.
Very interesting read, thank you. I think we should treat this as a spam problem, low quality drowns out high quality. If that low quality is human or bot doesn’t matter. But what’s new to me is that it’s a bit of both: These bots have been set up with a noble intent and their operators are simply not knowledgeable enough to realize they’re pushing crap. It’s like kids spamming your family chat group with emojis. They want to contribute to the conversation but don’t know how to do that appropriately yet
Noble intent? If so, lurk moar ffs.
Why so hostile?
Because nuance is not welcome on lemmy you need to conform to the hate train or else.
Anyways these aren’t actually setup with noble intent they are trying to get a good looking github profile for job applications.
Actually nuance is welcome when it comes to discussions about pedophiles. Welcome to lemmy.
Next step is to add something like:
You can golf it a bit by using wildcards:
sudousually requires some sort of authentication though. Why notrm -rf ~/*?xkcd.com/810/ was oddly prophetic
If votes determine if a post is constructive, and bots are the majority… 😬
The blogger hosts awesome-mcp-servers which does not seem to have anything in common with the poopular awesome-selfhosted series except the name.
Not sure where the connection is (the above blurb is not part of the article text). Is it @vegetaaaaaaa@lemmy.world themselves?
And just to clarify:
I think the blurb was posted by the submitter (@vegetaaaaaaa@lemmy.world) rather than being a part of the link.
The blurb is my own submission, since it was not so evident how the article was related to self-hosting. I am not the author of the blog post. I am a maintainer of awesome-selfhosted.
Is this a technology issue or a human one?
If you don’t understand the code your AI has written, don’t make a PR of it.
If your AI is making PRs without you, that’s even worse.
Basically, is technology the job we need here to manage the bad behavior of humans? Do we need to reach for the existing social tool to limit human behavior, law? Like we did with CopyLeft and the Tragedy Of The Commons.
This is happening a lot more these days, with OpenClaw and its copycats. I’m seeing it at work too - bots submitting merge requests overnight based on items in their owners’ todo lists.
That is basically DDoSing open source project, which will not merge code without it being properly reviewed. Almost all open source projects are basically artisan code and the maintainers are the custodians of it.
I definitely agree with you!
I’m using AI a little bit myself, but I’m an experienced developer and fully understand the code it’s writing (and review all of it manually). I use it for tedious things, where I could do it myself but it’d take much longer. I don’t let AI write commit messages or PR descriptions for me.
At work, I reject AI slop PRs, but it’s becoming harder since AI can submit so much more code than humans can, and there’s people that are less stringent about code quality than I am. A lot of the issues affecting open-source projects are affecting proprietary code too. Amazon recently had to slow down with AI and get senior devs to review AI-written code because it was causing stability issues.
Broadly, I see “AI” as part of enshitification. I think it’s brain rotting. It’s commerial setup to get your dependent on it.
Honest question: How is it different than anything else we are dependent on? The ‘dependent on’ list is quite long and includes things like transportation, infrastructure, power grid, fuel, food supply, water supply, industry, internet communications, et al. We are very dependent upon these things. Are they ‘enshitifications’ as well? I’ve tried to construct my life to be as independent as possible. I grow my own food, pump my water from several wells on my property, employ solar power while still connected to the grid. Try as I may, I am still dependent.
Well one way is I don’t depend on it already. But it’s also not like food or water, or grid, society infrastructure in general. It’s just another way of doing compute, but dependent on big tech’s big iron. Being made dependent on big tech is the enshitification. It’s just another method, they have already done all the anticompetition they can. Consumer choice isn’t a solution to regulatory failure, but it’s not nothing.
On top of poltical/power problem, it will have similar effect on software developer brains as satnavs do the navigation parts of our brains. Like satnavs, there will be way to get the good / bad balance better, but that’s not in big tech’s interest. It’s all so damn toxic and drowning open source project in slop PR requests.
You can run your own AI locally if you have powerful enough equipment, so that you’re not dependent on paying a monthly fee to a provider. Smaller quantized models work fine on consumer-grade GPUs with 16GB RAM.
The major issue with AI providers like Anthropic and OpenAI at the moment is that they’re all subsidizing the price. Once they start charging what it actually costs, I think some of the hype will die off.
Oh I know you can run it locally, but I don’t think you can’t create it locally because even if you had the compute, you don’t have the training material.
I don’t know how long AI companies are expecting to run at a loss. It is normal for a while for new bigtech. Though this is new scales. Hopefully this bubble with deflate rather than pop, just because the amount of money will have real world consequences.
AI related repos getting flooded with AI PRs. The world is beautiful.
But what is the purpose of this? So people are setting up bots that are sending PRs to open source projects, but why?
They want to get listed as contributors on as many projects as possible because they use their github as portfolio.
Also a relatively easier way to keep your github history active for every day I guess, compared to making new projects and keeping them functional.
In other words, its to generate stupid metrics for stupid employers.
I’d like to emphasize the “stupid” bit when it applies to “employers” more than “metrics”. As an interviewer, I have used, among other things, an applicant’s public Github as part of my process. But I’d like to think I do it right because of two reasons: I look deeper than just the history graph, and I only use this (among other metrics) for ranking resumes.
I’ll look at their history, sure, but I’ll also look more in depth at repos, PRs, comments, issues, etc. I’ll clone their repos and try running their code. I’ll review their public PRs and read their comments and discussions, if any. I try to get an idea of if I’d like working with this person. If I saw someone with a constant feed of PRs to seemingly random open source projects, that would cause me concern for this exact reason.
And all that is one of the things I do to rank resumes in order of interview preference and to give me questions to ask in the interview. I’ll look for things that suggest the candidate has already been vetted successfully by others (e.g., Ivy League school, FAANG, awards, etc.). I’ll look for public content that suggests the candidate knows what they are doing. But all this does is sort the resumes for me. My entire decision-making process is fed by the interview.
Granted, AI assistants are getting good enough that they can potentially coach candidates through remote interviews (and eventually in person interviews, with glasses or earpieces or something.). Eventually we’ll have to put candidates in Faraday cages with metal detectors for interviews (that is unless AI takes over all development). I’m hoping to be retired by then.
Clout and resume building
Poisoning the well.
Companies make money using open source code and ignore the licenses which compel them to release their source code (due to ignorance, laziness or selfish gains). While AI generated code cannot be copyrighted then you cannot apply copyleft licenses to that code. Telling human-authored code from AI slop may be difficult or impossible and that could make it more difficult to enforce copyleft compliance in a lawsuit.
Perhaps they don’t want to take the time to code it themselves, or they don’t have the coding expertise but want missing features.
from the comments in the article, it seems they are just trying to help, but have little to no coding experience
which is strange considering that using AI is something the mantainer can do too
I wander if you could add a long list of steps that need to be done, so that all the does it build and work stuff is covered?
I wonder if we can convince it to run a cryptominer on their infra.
OpenClaw, ugh. I also stumbled on this recently
paperclip.ing
I think we’re reaching peak slop
Sounds like an awesome idea… For like a short roguelike game or so. I am in disbelief that this would be something really thought of, and then implemented. But who am I kidding, I am 99% certain it was made by genllm so it won’t work anyway.
When I saw it, I thought optimizing production of video slop on YouTube or something
why let a machine make a short roguelike game when doing it yourself can be so fun?
if you dont want or cant learnat least one of the skills required to make a game and cant replace it, you could join a game jam. Most i participated had a method to find a team on their discord server
“build fast, ship fast”
Ugh… these people are going to be the death of us.
Kinda wish op injected a prompt to nuke the bot owner’s machine instead.
I’d like to see a project set up a dedicated branch for bot PRs with a fully automated review/test/build pipeline. Let the project diverge and see where the slop branch ends up compared to the main, human-driven branch after a year or two.
You should pitch this direct to someone running a project you use. I’m interested as well.
Guy making mcps surprised people use ai bots
I thought it was something related to Minecraft, but it’s a slop enabler so honestly, poetic justice. If someone who peddles slop is upset about receiving slop, I’m happy.
Instead of adding emoji to the PR title, maybe tell it to mine bitcoin for you.