reCAPTCHA – yay or nay?

I’m utilizing reCAPTCHA to stop comment and/or email spam by “reading books”. (If you don’t know what a CAPTCHA is, their website explains it comprehensively.) Installing the plugin and getting the API keys were simple.

The book pages are being photographically scanned, and then, to make them searchable, transformed into text using “Optical Character Recognition” (OCR). The transformation into text is useful because scanning a book produces images, which are difficult to store on small devices, expensive to download, and cannot be searched. The problem is that OCR is not perfect.

reCAPTCHA improves the process of digitizing books by sending words that cannot be read by computers to the Web in the form of CAPTCHAs for humans to decipher. More specifically, each word that cannot be read correctly by OCR is placed on an image and used as a CAPTCHA. This is possible because most OCR programs alert you when a word cannot be read correctly.

This seems like a win/win situation, right? So why don’t more websites (i.e. blogs) utilize this service?

Published by

Bryan Villarin

Bryan works at Automattic. Cat whisperer. Sometimes, a photographer and card magician.

8 thoughts on “reCAPTCHA – yay or nay?”

  1. My only issue with Captcha (and moreso reCaptcha) is the fact that it’s sometimes difficult to actually see what the words are, requiring numerous attempts more and more frustrating.


  2. How well did reCAPTCHA work while you had it installed? Honestly, I hate captchas too, but I’ll be going on a bit of an extended vacation soon and am thinking about adding this as an extra layer of protection while I’m gone. At least it’ll contribute to a good cause too, right?


  3. @James: I think it worked well. Nothing really comes through anyway, since I don’t get too much traffic. 😉

    Wherever you’re going for vacation, have a great time!


  4. I think that you should do like what does. Before you post there, you are asked a random question such as, “How many ears does the average human have?” That is such a simpler way to filter out spam and such.


