Welcome to Geeklog, Anonymous Wednesday, October 23 2024 @ 03:26 am EDT

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Saturday, September 16 2006 @ 01:21 pm EDT
Contributed by: mevans
Views: 24,896

There has been a lot of discussion here recently regarding strange users registering on my site. There have been several potential solutions discussed as well. One of the solutions discussed is to use CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) to prevent spam bots from registering on your site. To address this need, I have released gl-captcha-1.0, a CAPTCHA implementation for Geeklog utilizing the custom registration feature.

gl-captcha-1.0 is a combination of the previous beta releases and contains both the dynamic and static image support. This version also supports the use of a language file and improvements to the memberdetail.thtml template to allow users to refresh the CAPTCHA image and to email the administrator if having difficulties registering.

Why another CAPTCHA implementation?

I spent a lot of time logging and reviewing how spam bots were registering on my sites. What I found is that most of them completely bypass the users.php registration screen; instead they call the users.php module directly, posting the required variables. This can easily be done using a tool called curl, where you can automatically create an account. For example:

curl -d mode=create -d username=somename -d email=somewhere@email.com  http://www.geeklog.net/users.php

This command will usually create an account on any standard Geeklog install. Even with the Bad Behavior plugin installed many of these requests will still get through. So what I found was that any solution that relied solely on the registration screen would fail as a protection method since the registration screen can be completely bypassed.

What I've done is develop a CAPTCHA implementation that uses PHP's session variable to store the CAPTCHA string. During the registration processing (the HTTP POST to users.php; i.e.; submit button), I validate the user entered CAPTCHA string is equal with the string set in the PHP session variable. If the PHP session variable is NULL (empty) or the user entered CAPTCHA string is NULL, then I force the user back to the registration screen. This prevents bots for bypassing the user registration screen and posting directly to users.php. So far, this has been a successful method to prevent spam bots from registering on my sites.

Whether or not a CAPTCHA implementation is the correct solution to meet your needs is only a question you can answer. CAPTCHA's do have drawbacks; the main drawback to any CAPTCHA implementation is that is makes it almost impossible for visually impaired individuals to use. In some cases, even those users who are not visually impaired may have a difficult time reading the CAPTCHA string since they are designed to be difficult to read. Also, there may be accessibility laws in your area that you must conform to as well.

To minimize these drawbacks, gl-captcha-1.0 will provide a link on the custom registration screen to allow a potential new user to email the site admin with a request for registration ($_CONF['emailuserloginrequired'] must be set to 1 in Geeklogï¿½s config.php for this feature to work). It also states on the registration screen that a screen refresh will provide another CAPTCHA string, giving users the ability to try again if they are having difficulty in reading the current string.

CAPTCHA's are not fool proof and they are not a final solution against spam bots. OCR (Optical Character Recognition) has been used to break many CAPTCHA implementations. I have tried to use various fonts and background noise in generating the CAPTCHA images to minimize the risk, but there is no assurance that a determined spammer cannot use OCR to break this implementation although I believe the chances are slim. Also, there have been reports on using cheap 'sweat shop' labor to get around CAPTCHA implementations by having people perform the registrations en mass. See Wikipedia for a more detailed discussion on drawbacks and how CAPTCHA can be circumvented.

For me, using the Bad Behavior plugin, Dirk's SLV Spam-X class, trackback validation and gl-captcha-1.0 has proven to be a very successful arsenal against the various types of spam we Geekloggers face. I have no doubt that the spammers will continue to improve their technology and that the Geeklog community will also continue to answer the challenge and evolve our protections.

Comments

CAPTCHAs and Geeklog - Another tool for combating spam bots? | 15 comments | Create New Account

The following comments are owned by whomever posted them. This site is not responsible for what they say.

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: Laugh on Saturday, September 16 2006 @ 06:48 pm EDT

Hey Dirk,

This sounds like something that should be integrated directly into geeklog in the next version.

It would also be good if their was plugin support for it so anything that requires a submit could use it (links, comments, forum posts, etc..).

Just a thought...

---
One of the Geeklog Core Developers.

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: tingo on Saturday, September 30 2006 @ 10:48 pm EDT

I would also like to see captcha as a general plugin / function included in Geeklog, that all plugins could use whenever a submit is needed. Some plugins (the chatterblock plugin for example) have limited screen estate available, so perhaps a not-so-complete turing test could be used there instead ("enter this code to prove you are not a robot").

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: winnerdk on Saturday, September 16 2006 @ 09:02 pm EDT

I have it installed and it's working great. I've been getting hammered by these scripts and this is exactly what I needed. Thanks.

Don Winner
www.panama-guide.com

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: tingo on Saturday, September 30 2006 @ 10:53 pm EDT

A second vote - gl-captcha installed and working great. Thanks!

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: studioq on Sunday, September 17 2006 @ 07:59 am EDT

I have already implemented the older CAPTCHA labled GL_captcha_hack. Are there big differences between that and the new one? Also, if you know - How would I back the old one out and put the new one in..
Thanks.
Studioq

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: mevans on Sunday, September 17 2006 @ 08:33 am EDT

If it is one of the beta releases of the one I wrote (you would have downloaded it from http://www.mediagallery.org), there are not any major enhancements to require you to upgrade. Basically, I added a few tweaks to address accessibility issues (email admin / refresh graphic), move all the text to a language file and cleaned things up a bit.
If you do want to upgrade, just follow the README instructions, you will end up doing a re-install when it is all said and done (just copying the new stuff over the old and cutting/pasting the new code into lib-custom.php over the old).

Thanks!
Mark

---
Media Gallery - the ultimate gallery plugin for Geeklog - www.mediagallery.org

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: Tom_D on Monday, September 18 2006 @ 01:54 pm EDT

Thanks for all the work on this. One of my GL sites was getting a lot of porn comment spam and this has stopped it.

Like someone mentioned, I'd like to see this get installed with GL and a switch to turn it off in the config file. (I think it should default to on.)

Thanks again.
Tom

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: JohnG7 on Friday, September 29 2006 @ 02:39 am EDT

Could someone please explain how to install this? We're running cpanel/unix.
Sorry, I'm new to this stuff.

---
Looking for cool stuff - www.cubicleamusements.com

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: mevans on Friday, September 29 2006 @ 08:53 am EDT

No problem being new to this, we all started there at some point. I would recommend you take a look at the installation instructions and then use the support forums here or at http://www.mediagallery.org to ask more specific questions, or just yell for help in the forums. Using the comment feature for stuff like this gets to be a little awkard, I prefer to use the forums if we can.

Thanks!
Mark

---
Media Gallery - the ultimate gallery plugin for Geeklog - www.mediagallery.org

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: tingo on Saturday, September 30 2006 @ 11:13 pm EDT

For those wanting more information about CAPTCHA weaknesses, see the PWNtcha page.

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: Tony on Friday, October 06 2006 @ 10:37 am EDT

mevans,

I think this should live in the plugins subsystem. I realize this isn't quite a plugin in that you have to boostrap it to the GL user registration system *but* organizationally I don't like it in the root Geeklog directory. By doing this you can also have your own config.php with the settings in your current captcha.php...which is preferred as it moves the config stuff outside the webtree. If you don't like this idea, maybe at least introducing a $_CONF['path_captcha'] would be preferred then we can put the library wherever we want. Looking ahead, I say having the CAPTCHA as a plugin is best as I can see other places where people may want to integrate the use of it (for example, comments for forum posts, etc).

Also, for the image library I think you need a setting for $gfxDriver that uses the Geeklog default image library as I don't see the need of explicitly specifying one if I already configured one for use with articles/userphotos in the main Geeklog config.php.

Just some constructive criticism...good job, btw, it's a great start.

--Tony

---
The reason people blame things on previous generations is that there's only one other choice.

---
The reason people blame things on previous generations is that there's only one other choice.

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: mevans on Friday, October 06 2006 @ 11:07 am EDT

Tony,

Great feedback, I appreciate it. I'll play around with your suggestions over the weekend and see what I can come up with.

Thanks!
Mark

---
Media Gallery - the ultimate gallery plugin for Geeklog - www.mediagallery.org

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: jackknyc on Sunday, October 08 2006 @ 06:17 pm EDT

While looking up this utility I found this page thar says it can beat the system: http://www.cs.sfu.ca/~mori/research/gimpy/

Jack

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: studioq on Wednesday, October 11 2006 @ 05:28 am EDT

I just wanted to return and let you know that I am using GL Current. I installed this and removed the old GL CAPTCHA hack and it worked like a charm. So far, not one bot has registered.. Bravo!!! - Thanks..

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Authored by: garfy on Thursday, October 12 2006 @ 03:02 pm EDT

but is it possible to make it as default in the next version

i do not have time to install this every time on each geeklog site. Is there a way?

Warning: Javascript required to enable functionality

CAPTCHAs and Geeklog - Another tool for combating spam bots?

Search

Resources

About

Getting started

Support

Development

Topics

User Functions

What's New

Articles last 4 weeks

Comments last 4 weeks

Pages last 4 weeks

Links last 4 weeks

Downloads last 4 weeks