Exposed: A Blog Comment Spammer's Source Template
I've been getting a LOT of Blog Comment Spam lately, just in the at two weeks. I run all my comments through the Akismet Service, and I pay for it. However, this particular flavor of spam has been making it through consistently. It has a pattern, through, and I'd been trying to figure it out when this LARGE comment showed up.
Apparently while they were messing about trying to spam me, they posted their entire source template.
I'm embedding it below as a Gist, rather than copy/pasting it into my blog engine. It's so spammy, I'd hate to get delisted from Google looking rather like a splog.
Note the comments for the Gist as well.
One fellow says
"I used to do comment spam and this is not the most advanced one."
Really? Does one put Comment Spammer on their resume?
Another comment says that we're hating on spammers. We should embrace them because:
"Sure for the 1% of super popular blogs out there this might be unnecessary, but in a world filled with bloggers blogging blogs most people never read, the fake recognition and pleasantry might be just what these writers need."
I'm pretty sure that fake comment spam isn't as emotionally uplifting as you think.
Start scrolling down! If you are viewing this in an RSS reader, you MAY need to visit this post directly to see it.
Your comments, Dear Reader? Cue spam comment-related jokes...now.
Sponsor: The Windows Azure Developer Challenge is on. Complete 5 programming challenges for a chance at spot prizes, Stage prizes and the Grand Prize. Over $16,000 is up for grabs with 65 chances to win!
About Scott
Scott Hanselman is a former professor, former Chief Architect in finance, now speaker, consultant, father, diabetic, and Microsoft employee. He is a failed stand-up comic, a cornrower, and a book author.
About Newsletter
The other day I had a spammer who had actually read the article and did comment on the actual content (although it was a quite superficial comment). Anyhow, looking at the link and the user it was obviously spam.
Lately you have to be quite vigilant to detect spam if they are really going to start commenting on content.
Oh well, nobody says spammers are smart.
There are many ways to reduce dimentions of this problem but the fact is that it should define a connection between author article and comment and reader. If connection is not strong then flag it.
As i was wondering there was no security/moderation to Comments section of your blog. As you said...
There's lots of techniques like Captcha, OpenID, etc, but the spammers ALWAYS find a way around it. Always. I've been blogging for over 10 years and it ebbs and flows like the ocean. We're at high tide right now.Yeah i agree, but what about using some real good commenting systems like Disqus, Intense Debate or the new way of embedding Facebook Commenting System?
Hope this will make the visitors transparent and also will reduce our stress to a open platform and their authentication ;-)
>re:Captcha, OpenId, etc..
Blogger recently added support for Google+ comments in their posts - maybe requiring an account with a verified "real name" could be sufficient to deter bad behavior.
I always think of when the mechanical turk came out in the discussion of spam / spammers. No matter what you do, a spammer can pay people to spam if it's sufficiently inexpensive - or, in cases like this, author templates that get plugged into a bot.
Does one put Comment Spammer on their resume?
Apparently, yes. I did have a resume sent to me from a guy who's crowning achievement was writing and selling software to spam forums. No attempt to hide it either, he seemed very proud of the ingenuity involved in hijacking other peoples communities for profit.
Needless (I hope) to say... no hire.
appreciate you {writing this|penning this} {article|post|write-up}. I {saw|have seen} {many|several|lots|a lot} of these on my own blog.
What I really need is a way to interest you in doing a blog post about my personal open source project at {
Hmmm... Is that too spammy?
Thanks for blogging.
György
But still send them a 200 so that the software thinks they have succeeded. I need to change it about once a year. You can even see when they are in the process of changing their scripts.
// Ryan
i really wonder why the spammers bother ... there must be betters ways to get SEO such as writing their own articles (AFAIK some to many do just that).
my point: the prolific Scottha's of the world with significant and frequent content have garnered 99% control via their use prophylactic plugins like Askimet ... for the less significant authors, simply by turning on full moderation, we can likely create an environment where spamming articles becomes not worthwhile because there will be virtually nowhere left for such spam in the blogoshere. or, maybe not?
I like @chrissie1's suggestion, changing the ID every now and again sounds simple enough. It wont help anything that's using a scraper (or something more advanced like phantomjs to load the page) but it's a start if you are running an off the shelf system.
I also had some bad time dealing with spam, I have been forced to disable comments in my blog, in order to not have to deal with it.
I have just re-enabled comments (Disqus), and so far so good, I am seeying not spam as months ago. Maybe spammers have not yet discovered that comments are enabled again.
There is no good type of spam. Spam is just that Spam/Trash
My previous web host seemed to think that setting the spf record information on a domain constituted as spam protection and any spam after that was clearly email I had asked for.
After various lengthy debates of this nature I conceeded that these people are beyond help and decided to get my domain off their infrastructure.
Recently a friend of mine has been having this same problem and upon mentioning this issue to said host their suggestion was "ok we will setup the spf record for domain" ... i'd love to know what spf records have to do with blog posts?
Spam has been a major problem since practically the birth of the internet and I don't think it's likely to be possible to remove spam without making the "originating account" at the ISP of the sender accountable for every bit of traffic they send to everyone else online.
The issue there of course is that this requires some level of "rating" an account at an ISP, I'm suprised there isn't a service that does this type of thing out there somewhere but the issue there of course is that anyone can basically "anonnymously" setup a website / server on a cloud somewhere and spam from that.
Internet standards need to fundamentally change before the issue will go away.
In the meantime it's up to the programmers to try and filter the good from the bad which screams "fix the symptom not the problem" to me.
Oh well :(
Customer: "Well, can I have spam instead of baked beans then?"
But seriously folks. I just love the justifications of the spammers: "the fake recognition and pleasantry might be just what these writers need."
Wow. That's like stabbing someone in the back and then saying you were helping them get over their iron deficiency.
Keep up the good work (when is the next Developers Life? ;) )
Really? Does one put Comment Spammer on their resume?
Probably yes. People sitting at the places like Nigeria may have 'Spammer' experience in their resume. Because this is what they might be being paid for. Spammer's don't do this for fun. Somebody is paying them.
I stopped blogging there because of the spamming.
Regards,
Jalpesh
So have you tried turkey bacon?
Tastes like real bacon for the first day or so- then it tastes like turkey...
This is not the same as an earlier round in which the spammers just took other comments on the post and reposted them with their spammy links.
The internet has changed from the wild west days of usenet flamewars. I get that there are times where anonymity is required (e.g., whistleblowing) but think that anonymity should be the exception - real attribution should be the norm.
ps-somewhat surreal to read through that template. It's like looking into the dark underbelly of social media.
I'm really surprised you get as much comment spam, considering your blog has no follow tags in it.... it still does nothing for their rankings.
While Google officially doesn't consider "nofollow" links, there's lots of discussion on SEO forums that Google may use them to determine which sites are spamvertised. If a site has a high percentage of inbound links marked as "nofollow" it could be argued these are likely to be a result of comment spam or similar, and thus the target site could actually receive lower ranking as a result.
If this theory is correct, then perversely you're helping to defeat spammers by publishing their spammy comments!
Which also makes me wonder if I shouldn't have put my home page link on this post?
Personally, the blackhat community is actually one of the most interesting ones in regards to development of technologies and strategies to counter the ever evolving landscape of anti-spam mechanisms. Seeing what these people do is quite impressive if you remove the moral aspects of what they are actually doing and focus merely on the business case scenario.
The example template you have is rather a low level spinning, swapping out synonyms and uses no nested spinning techniques for sentences and paragraphs. As a template itself, there isn't much work put into it that TBS (The Best Spinner) or SpinnerChiefII could do instantaneously. The interesting part comes in is what they are achieving in scraping and n-gram analysis.
Software like WordAI (which is web based, google the youtube vid of it) can automatically spin comments and with API use of other software which gathers comments from scraped sources (other blogs or articles) that are mathematically similar to yours in regards to language, technology, verbiage and even tonality you can create unique looking comments purely on autopilot with little costs to the person running the software.
What fascinates me is the mechanisms that evolve naturally within this blackhat economy to counter Akismet, Mollom, Honeypot, CAPTCHA and other mechanisms involved. Capthcha services like DeathByCaptcha streamlines the captcha problem by sending it to a microworker overseas who manually input the CAPTCHA for fractions of a cent (which adds up over time over long campaigns)combining the efforts of manual processing with automated software.
It provides interesting challenges for people fighting spam, learning how to get rid of footprints (having the words "Leave a comment" on a blog post is open season for scrapers and list builders), creating sophisticated Captcha which do not deter from the User Experience (knowing that most Captcha solving software converts the Captcha to greyscale to help with OCR and manual input for readability, so simply tell the user not to input the letter in Red for example), embedding footprint type text and form labels into Data URIs so the crawlers who read like robots cannot decipher it yet is still presentable to the user (if your email appears at all on a website, you should put it into a Data URI formate so you don't receive email spam).
Once you understand and respect the methodology of the webspam industry, you can devise methods to combat it and reduce spam by several orders of magnitude by targeting specific areas (removing footprints for scrapers, adding a invisible form for the honeypot method to avoid comment posting software) until you are only left with Manual VAs you research and craft their comments which is a fraction of a fraction of the web spam available.
Hi there, i read your blog occasionally and i own a similar one and i was just wondering if you get a lot of spam comments? If so how do you prevent it, any plugin or anything you can advise? I get so much lately it's driving me insane so any help is very much appreciated.
I thought maybe it was legit (and intelligent considering the correct spelling of "advise"), but I followed the link back to a blog with little content and nothing but spam comments with links. Akismet didn't catch this one.
So I did a Google search (which I do whenever I'm suspicious) of the first two lines and your blog came up first.
I especially love when spammers say how much they love reading my articles when my site is audio-driven.
Idiots!
Used in pretty much any SEO Tool.
{hey|hi|hello} there {human|alien}
Would produce various comments like:
hey there human
hey there alien
hi there human
hi there alien
hello there human
hello there alien
etc
Easier for spammers to get "unique" comments rather than create 500+ variations.
oh my god, when I was reading some testimonies on the web, I thought they were just some sort of lies been fabricated by some stupid people, not until I decided to give one of the great prophet called prophet Ozanga the chance to help me after lot of testimony to his credit and to my heart shocking breath, he surprised me by actually bringing back my lover within just seven days as he promised.
now I believed, some of these testimonies are real when you actually meet the right priest to help. for help you can contact him by yourself with this mail :prophetozanga@yahoo.com
Some of the comments I read are ridiculous, surely the idea is to get the link from the comments to stick?
I mean, I know it mostly software like scrapebox spinning nonsense but it is so frustrating waking up logging into your blog and seeing 20k comments awaiting moderation.
Its incredible not one human comment ..
I am amazed web owners haven't figured out just to pay some one in India $ 5 an hour to write comments. they probably could post 20 per hour to generate 500 -800 a week of good back links . one year 40,000 links !
My site is B to B for real physical merchandise so I need to find blogs on marketing, trade shows and work places so that my links are relevant. Hope google cares !
Thanks!
Comments are closed.