Chatbots.org RSS Facebook Twitter
Chatbot listing, virtual agents, virtual assistants, chat bot directory, conversational agents, virtual human news, chatterbot list
  • Log In
  • Register
Directory RenewedBusiness NewResearch Awards Community Forums

Chatbot Suzette Fools One Judge During Loebner Prize Contest

Bruce Wilcox 37|8540 By Bruce Wilcox, Nov 3, 2010 in Award news

Summary: Chatbot Suzette Fools One Judge During Loebner Prize Contest

My chatbot, Suzette, won this year’s Loebner Prize contest and even confused a judge into voting for her over a human (or should I say he confused himself). But here is the blow-by-blow of this weird event.

When I arrived at the contest, I figured I had good odds to win if nothing went horribly wrong. Yes, Suzette had easily qualified over the 3 other competitors (her score 11 pts, the nearest competitor 7.5). Her design and data naturally gave her an edge over her competitors on the human knowledge test questions of the qualifiers. But human judge chat was an entirely different matter than the qualification test. Still, I felt she could carry on a detailed conversation better than the others and should win.

Initial installation of the programs occurred on Friday. From prechat conversations with the other contestants I learned that A.L.I.C.E. came with 3 redundant disks. Yet all three turned out to be blank! What a scare that must have been. Dr Wallace managed to install by retrieving the program over the Internet. Cleverbot is now at 45 million lines of memorized user chat (at a rate of doubling every year). And UltraHal is now listening to tweets, so has 300K of user chat it learned and 400K of tweets it has accepted for learning (code decides if the user has had enough responses and doesn’t trigger any red flags).

Then we get to the competition. While the CalState organizers had initially planned to have various interdepartmental professors act as judges (like English dept, etc), they backed out at the last minute, so all the judges were from the Engineering/Computer Science dept. Talk about guys who might know what to expect from chatbots! And all the humans were students from the same departments. What a weird mixture to compete in. And then, each round was 25 minutes. That’s bad if you want confuse a judge about who is human. But really, the programs have no chance for that. So it’s good because it gives the human time to compare each program against the other. Though it’s not clear to me that the judges tried to use their time to do that.

And the students didn’t really understand their role. It was merely to BE HUMAN and convince the judges of that. Before startup there was informal chatting between humans and judges, which was obviously inappropriate and it was then pointed out to the humans that since the judges already knew their names, they had best use false ones in the competition.

So, Round 1. After a few exchanges, somehow Suzettte got stuck into repeating exactly what the judge said for the rest of the round. I have no idea how. The round is a total disaster. I’ve never seen such a bug before. Maybe it’s in my only-lightly-tested protocol for the competition. I have no idea. But it completely derails my hopes for Suzette. She could still win on points only if she outdoes her opponents for every other judge and the other contestants vary all over the place.

Round 2, a great demonstration of Suzette. She should win on this round alone.

Round 3 gets off to a horrible start. Somehow, Suzette can hear the judge but the judge can’t hear Suzette. Makes no sense. A couple of restarts of Suzette doesn’t fix this. Eventually they restart the judge program, and that clears it (not that that makes any sense either). Then, after a few rounds, it’s clear Suzette has the judge from hell. He wants to know who she’s going to vote for in the upcoming election (the unspecified California governor’s race). And when she has no useful answer he wants her to name a candidate in the race. And when she has no answer to that, he simple keeps repeating the question ad nauseum, insisting she answer it. Suzette gets irritated. Then she gets angry. Suzette then gets bored. Suzette threatens to hang up on him The judge doesn’t back down until the last seconds of the round. I figure that’s the end of life as we know it.

Round 4 is a mixed bag. Suzette is ok but not great. It’s all over.

When the scores are tallied, Suzette ties with Rollo Carpenter’s Cleverbot for 2nd-3rd. Yet, it turns out, the 3rd round judge got the human subject from hell. Poetic justice! The human was all over the place—confusing, vague. The judge voted irritated/angry/bored Suzette as human. Instant win since no other program swayed the judges.

What more can I say?”

Related Chatbot: Suzette
 
  • Tweet

Comments

There are 3 comments:
Portrait Netherlands Iwein Fuld arrow May 27, 2011
I tried to chat with Suzette to see if she was any better than cleverbot, but I got the same problem that you had in round 2 with the online variant. (http://ai.bluemars.com/chat/)

Is there another online version that does work?
Portrait United States Mark Ferris arrow Jun 24, 2011
Is the transcript that fooled the judge available?
Portrait United States Nicholas Finch arrow Nov 28, 2011
How funny that, that should happen. what a cruddy judge! is that part of the rules? i guess so because the criteria is to discern the realism of chatbots.

well i guess next time you know what to add to suzzete.
 

New Comment

*

We will never send unsolicited emails or share your address with third parties. See our privacy policy.
*
Your comment (you can use Emoticons):

If you sign-up now and join the fastest growing chatbot community in the world, you'll never have to type those annoying anti-spam characters again. And membership is free, obviously. OK, I join (or login)

Privacy statement

Summary:

We will NEVER spam you, nor publish or sell your details to any third party. We hate spam, just as much as you do.

What data does chatbots.org store?

We’ll store all the details you enter on chatbots.org in our database and we maintain statistics of your visits with the sole reason to give you the best personalized service possible.

How does chatbots.org store my data?

We make use of Expression Engine, one of the largest weblog publication systems in the world. US President Barack Obama has used it for WhiteHouse.gov. Our system makes use of a MySQL database.

How do I access my data?

If you are a chatbots.org member, you can access your personal data through your account panel after you login. Additionally, your statistics (number of visits, numbers of reactions, duration of your visits etcetera) will be accessible to you in the future. If you are a guest, please contact Erwin van Lun, founder and managing director of chatbots.org,with your question.

What data is shown?

Chatbots.org allows members to build their profile on a dedicated profile page and show it to the outside world to help them to build their reputation as a chatbot expert. Chatbots.org also allows members to turn off this option if they prefer. However, when members have written a post or a reaction, the name they’ve entered in their profile will always be shown, including a link to their profile. If people click this link, a blocked page may be shown (dependent on your preference). We will create an ‘alias’ option in the future for those members who do not want to use their real names, but we strongly believe that professionals should reveal their identity.

If you aren’t a member, your e-mail address will be necessary when you leave a comment on the site, for follow-up comments, any questions we might have about your comment (which isn’t very likely) or for direct reactions.

When will you use my contact details?

As a member or a guest, we probably know your e-mail address and in some situations also your telephone number or residential address details.  We will NEVER spam you, nor publish or nor sell your details to any third party. We hate spam, just as much as you do.

We will use your e-mail address to notify you about new comments in a post you commented on earlier (you are able to turn this option off for eacharticle), for account settings confirmations (if you’ve changed your password, for example) or for occasional notifications on major changes in the site (typically 1-5 per year). Obviously, we’ll use your e-mail address to send you the e-mail newsletters you’ve subscribed to. We may also approach you if you’ve left some brilliant comments on the site: we might want to work with you! We will not send you product or service offerings!

We will use your telephone details after we’ve tried to contact you via e-mail and this e-mail bounced, resulted in other error messages or you simply didn’t answer for some kind of reason. If we have the impression your e-mail address doesn’t work (anymore) we might contact you via phone.

We will use your residential address details when we need to ship something to you that we can’t send by email. Additionally we’ll mention your address on invoices.

Who can modify my data?

We have a very small team (typically max five persons) that has access to your personal data. Please contact Erwin van Lun, founder and managing director of chatbots.org, for the most recent list of the individuals who can access your data. If you have subscribed to one of our newsletters, details like name and e-mail address will be made available to our e-mail service provider for single usage.

Is my data secure?

We´ll do all that we reasonably can to protect your data. Reasonable as we are not a large financial international institution or a military organization. You can expect us to follow all Expression Engine security guidelines, make backups and we don´t provide passwords to other individuals.

I received comment spam!

Unfortunately third parties try to destroy the web by putting comment spam (comments placed by robots, with links to dubious websites) onto websites and thus also on chatbots.org. We’ll do everything we can to avoid comment spam whilst also avoiding barriers for people to react (making reactions too secure or too complex will kill the dynamics of the site). It´s all about balance. Comment spammers don’t have access to your e-mail address. You can always unsubscribe to notifications on specific postings.

Any other questions

Please contact Erwin van Lun, founder and managing director of Chatbots.org, if you have any additional questions.

Click on an image to add it to your comment

grin LOL cheese smile
wink smirk rolleyes confused
surprised big surprise tongue laugh tongue rolleye
tongue wink raspberry blank stare long face
ohh grrr gulp oh oh
downer red face sick shut eye
hmmm mad angry zipper
kiss shock cool smile cool smirk
cool grin cool hmm cool mad cool cheese
vampire snake excaim question

Loebner Prize

  • What is the Loebner Prize?
  • Why a Loebner Prize
  • 2012
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001
  • 2000
  • 1999
  • 1998
  • 1997
  • 1996
  • 1995
  • 1994
  • 1993
  • 1992
  • 1991

••••••••

Chatterbox Challenge

  • What’s Chatterbox Challenge?
  • 2011
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • 2004
  • 2003
  • 2002
  • 2001

••••••••

IVA Gala

  • About GALA
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005

••••••••

BCS Machine Intelligence

  • Overview
  • 2009
  • 2008
  • 2007
  • 2006

••••••••

Terasem Turing Prize

  • The contest
  • Contest Rules
  • About Terasem
  • 2009

••••••••

Bot Prize

  • The Bot Prize Contest
  • 2009
  • 2008

••••••••

Soon listed here

Twuring

Personal Archievements

AI awards

Hot

  • Alan Turing year
RSS Awards News  
RSS Feed
Twitter Follow
Chatbots
 
Add Award

Award diary

- 2012 - Loebner Prize
- 2011 - Chatterbox Challenge

Most Awarded

A.L.I.C.E. (17x)
Ultra Hal Assistant (13x)
Talk-Bot (12x)
Elbot (12x)
Jabberwacky (12x)

Most awarded last 3 years

DEIRA (2x)
Cleverbot (2x)
Skynet-AI (2x)
Captain Jack Sparrow (2x)
Zoe (1x)

Now on Turing100 in 2012

  • University of Reading Announces Machines in Turing Test contest on Alan Turing's 100th birthday: 23 June 2012
  • Turing100 at Bletchley Park in June : Honouring Alan Turing
  • Alan Turing, Father of The Modern Computer
  • 5 months to Turing centenary with major events at Bletchley Park, in Cambridge and in Manchester
  • Elizabeth Truss MP mentions Turing on BBC2 Newsnight

Now on ErwinVanLun.com

  • Scale of the Universe
  • Animated 3d logo tool
  • An awesome 3D See-Through Display!
  • Microsoft shows future vision
  • Ticketmaster offers seats next to your friends
 
Bot

The Team (read about the community)

  • Arthur de Wolf
  • Dave Morton
  • Erwin Van Lun
  • Jetty van Kooij
  • Karolina Kuligowska
  • Xander Verduijn

Research Statistics

  • Library: 379 books
  • Publications: 1,525 journals & papers
  • Events: 571 academic conferences
  • Universities: 14,038 universities

Chatbot Statistics

  • Directory: 967 chatbots
  • Companies: 569 developers
  • Community: 30,063 members
  • Synonyms: 155 synonyms
© 2012 AI4US Ltd. — Concepted by futurist Erwin Van Lun — About Us — Contact Us