A peek into Reddit's anti-spam internals

(lyra.horse)

104 points | by OuterVale 3 days ago

9 comments

leviathant 9 hours ago
I cannot emphasize strongly enough just how deeply pervasive the spam is at Reddit. I'm a mod at the ecommerce subreddit, and I've only caught some of the AI-powered marketing operations because in one particular campaign that was making fictional claims about things I had direct knowledge about. Once I looked into the post history, and started to untangle the web of accounts that formed a self-supporting community of posters and commenters, just subtle enough to get genuine engagement, but specific enough to make the kind of posts that the LLMs will siphon up and regurgitate.
It's not just shady little operations. I'm speaking specifically about the SCAYLE ecommerce platform, in my example. They've got Zalando money to play with, and as a German platform that's trying to break into the North American market, it appears they've made a bet on indirectly spamming the LLMs with fictional tales of commerce replatforming horror stories. At first, they're some of the more interesting topics in a sea of really useless posts, with contributions from people who seem to have some real experience with enterprise ecommerce. I was a little suspicious, but these interaction campaigns were spread out enough that I didn't put the pieces together for months. Of course, to go back on what I said at the top of the paragraph, maybe SCAYLE is shady, and I'm giving them too much credit.
The good news is, some of the AI powered tools that mods have access to are getting better at surfacing suspicious patterns of behavior. However, I still find I have to manually address these campaigns.
In the cat-and-mouse game with these marketing jerks, I'm always reluctant to surface what's working and what isn't. This is an interesting post, but it's going to make things worse. Ah well.
[-]
- jamesfinlayson 5 hours ago
  I remember reading years ago about some corrupt mod in one of the image subreddits - he or his friend had started some image hosting site and had six different Reddit accounts that he used to upvote posts that used his site and downvote all other posts. It took people a long while to notice what he was up to.
randysalami 3 days ago
Reddit must have some mechanism specifically for non-spamming bots that isn’t covered in this article. I wonder how it works. I imagine the mechanisms are more complex and opaque than anti-spam (with various levels being exposed to the hierarchies of Reddit and government backdoors). These days, I’ve noticed an almost forcing-function that operates to put the minimum spin needed on posts and comments to turn signal to noise. It seems smart enough to not only generate noisy comments but create comments to amplify existing organic noisy comments. I’m sure these systems are decentralized, emergent, and split across numerous nation-states and actors. I’m also fairly certain what we have now is a tenuous balance that has emerged from all these actors and Reddit policing actions as well.
I imagine Reddit has a high-level of insight into this and a certain level of permissibility it grants, both to inflate user counts and to steer public discourse and insight into less productive mean (or productive to certain interest groups at the expense of the people). I think is also an effect that Reddit has become more global and consensus of the USA people is very antagonistic to the consensus of the people of the world so that doesn’t help (+ access to LLMs to make English writing no longer a barrier to entry).
[-]
- asdff 11 hours ago
  There is some sort of wink wink nudge nudge agreement going on with certain spam accounts. You will see them post article spam with hidden history, and if you look up their posts either via google or any other reddit crawling tool, they are posting all over various subreddits that same article maybe dozens of times. If they comment it is really basic and formulaic and found all over their post histories as well.
  I feel like reddit enjoys it as these posts (often political in some way) usually get good engagement which is in line with reddits own incentives for courting advertiser money.
  [-]
  - hightrix 6 hours ago
    This practice was perfected by gallowboob years ago.
    He would spam a link/pic/post and monitor, if the post didn’t gain traction, he would delete and post again as to not trigger protections against the same link being posted.
    He was a cancer on Reddit and I’m sure he still exists under different monikers. But now there are 100s of gallowboobs.
ingvay7 3 days ago
Neat rabbit hole. Reminds me of having to deal with email spam - it was a similar deal with rule-based filters, ML scores, domain bans,IP filtering, browse fingerprinting etc and mishmash of ever evolving scripts surviving org and personnel changes. Glad i dont deal with it anymore as the frontier seems to be 2 fronts now with human and agentic spam.
pedalpete 8 hours ago
Based on the current status of my shadowbanned account (I suspect a competitor in our space retaliating), it looks like `banall` only flags posts from the last 6 years.
Of course, nobody can view my profile anymore anyway (I'm waiting on appeal), but on my account, only posts from the last 6 years have the "Sorry this post was removed by reddit filters" message.
Oarch 11 hours ago
Can't you just append ".json" to the end of any Reddit link and read all sorts of these fields?
[-]
- rebane2001 11 hours ago
  No, the API will not return the admin-only removal reason. The code path that causes this is in the post.
Terr_ 3 days ago
Damn, maybe I can finally find out why my 10+ year account was globally (and retroactively) shadowbanned, even though the appeal was allegedly granted.
In the past, those post removals didn't even exist in the moderation log, so perhaps a reason could give me a clue... On the other hand, I'm taking a kind of emotional damage just remembering.
[-]
- qingcharles 10 hours ago
  Is it still shadowbanned? If you go to reddit.com/appeal what does it say?
montoyaig 11 hours ago
[flagged]
busymom0 12 hours ago
I swear I read this article 2 or 3 days ago and the comments on that post were also same as this post. Am I missing something here?
[-]
- yorwba 12 hours ago
  You're missing the second-chance pool https://news.ycombinator.com/pool which allows certain posts to reappear as if they were new.
  [-]
  - khurs 24 minutes ago
    That doesn't show for me unless I click the link.
    How does it work?
  - asdff 11 hours ago
    Once again expressing my opinion that this is the worst anti feature of the site. Threads are like commenting in the void because most people are not going to be looking for replies to comments they made days or weeks ago. I see the true datestamp of the comment I replied upon upthread was not 3 hours ago, but 3 days ago.
    The fact that they change the timestamp is also very stupid (yes you can hover and still return the datestamp, but this is by definition a dark pattern). These posts should preserve the timestamp vs masking it and even should be flagged as [Second Chance] in the title imo.
- rebane2001 12 hours ago
  wait what the heck yeah, this is the same post as from a few days ago, i guess the comment and post timestamps got glitched??
  [-]
  - nancyminusone 12 hours ago
    That'a thing HN's second chance pool does. If you mouseover the "3 hours ago" or whatever on a comment, it will tell you the actual post timestamp.
    [-]
    - mvdtnz 11 hours ago
      That is insanely bad borderline hostile UI.
      [-]
      - dang 10 hours ago
        Before we started doing it this way, the threads would fill up with far more off-topic comments ("why is this 3 day old post on the frontpage?" and so on). That was a much bigger problem. Relativizing the timestamps to reflect the re-up time was our attempt to address that. Overall, it has worked ok—that is, while it does still lead to offtopic confusion and complaints in the threads (boo) there is far less of it than there was before (yay).
        I'm open to suggestions of how to do it better! But you also need to consider the cost of adding explicit details to the UI. If we did that every time something like this came up, HN would have become an unreadable mess a long time ago.
        [-]
        argee 4 hours ago
        I guarantee you've thought more about this than I have, but the first impression I had of the "second chance" pool was that it would essentially be a repost of the top-level post and not the comment threads. I think part of the reason people bring it up is because they see the same post PLUS the same comments with new timestamps and feel disoriented.
  - qingcharles 10 hours ago
    There have been almost zero visible changes to the HN codebase in over a decade, but the one thing I would love to see them add is a little flag on the heading of the post to say it is 2nd Chance Pool to avoid all these comments every time this happens and everyone is confused :)
    [-]
    - igjjigjk 3 hours ago
      [dead]
    - akkkan 8 hours ago
      [flagged]
felooboolooomba 11 hours ago
My friend got shadowbanned for posting a youtube link, part of a interview with Sascha Riley (the one where the explains the thing with the tent peg):
https://www.youtube.com/watch?v=84PHEMLab6g&t=2807s