How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation
Elie Bursztein, Steven Bethard, Celine Fabry, John Lab Security StanfordComputer Mitchell, Dan Jurafsky, E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 ?
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 ? users
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 ? bots users
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 ? bots users
CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 93%
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 93%
86%
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 93%
86%
70%
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 7.3 sec
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 8.2 sec 7.3 sec
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 8.2 sec 7.3 sec
9.3 sec
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 8.2 sec 10.6 sec 7.3 sec
9.3 sec
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Outline
• Study methodology • Population demography • Captcha measures
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Accuracy
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Accuracy
Solving time
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 The supply chain
Websites Paper
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 The supply chain
Websites Paper
Scraping
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 The supply chain
Websites Paper
Scraping Solving
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 The supply chain
Websites Paper
Scraping Solving Data Mining
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Scraping
• Alexa top 50 • 23 scheme • 10 000 captcha samples • Custom scraper • Cookies • Javascript events • Ip rate limiting • User agent
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Scraping
• Alexa top 50 • 23 scheme • 10 000 captcha samples • Custom scraper • Cookies • Javascript events • Ip rate limiting • User agent
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
captcha.net
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
captcha.net
eBay
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
captcha.net
eBay
Digg
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
captcha.net
eBay
Digg
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Baidu
captcha.net
eBay
Digg
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu
captcha.net
eBay
Digg
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu Microsoft
captcha.net
eBay
Digg
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu Microsoft
captcha.net recaptcha
eBay
Digg
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu Microsoft
captcha.net recaptcha
eBay Skyrock
Digg
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu Microsoft
captcha.net recaptcha
eBay Skyrock
Digg Slashdot Google
Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Yahoo
Baidu Microsoft
captcha.net recaptcha
eBay Skyrock
Digg Slashdot Google mail.ru Blizzard
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Digg
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Digg
eBay
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize
Digg
eBay
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Microsoft
Digg
eBay
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Microsoft
Digg recaptcha
eBay
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Microsoft
Digg recaptcha
eBay Slashdot
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Authorize Microsoft
Digg recaptcha
eBay Slashdot
Google Yahoo
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
1000
0.1% Precision accuracy
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
1000 x 3
0.1% Precision Knowing the accuracy probable answer
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
1000 x 3 3000
0.1% Precision Knowing the by scheme accuracy probable answer
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Precision is costly
63000 captcha
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Underground API
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 MTurk by Amazon
Worker(s) Requester (us)
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Requester interface
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Worker interface
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes • 13 images schemes
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes • 13 images schemes • 1000 x 3 image captchas / scheme (bypass-captcha)
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes • 13 images schemes • 1000 x 3 image captchas / scheme (bypass-captcha) • 3500 x 3 audio captchas / scheme (MTurk)
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes • 13 images schemes • 1000 x 3 image captchas / scheme (bypass-captcha) • 3500 x 3 audio captchas / scheme (MTurk) • 5000 x 3 image captchas / scheme (MTurk)
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Largest captcha experiment ever
• 8 audio schemes • 13 images schemes • 1000 x 3 image captchas / scheme (bypass-captcha) • 3500 x 3 audio captchas / scheme (MTurk) • 5000 x 3 image captchas / scheme (MTurk) • 318 000 captchas annotated overall
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Language repartition
Other 149 Russian 12 Balochi 13 Portuguese 15 Hebrew 15 Punjabi 15 Vietnamese 15 Bikol 16 Cebuano 17 Arabic 19 Macedonian 21 Dutch 21 French 23 German 28 Gujarati 30 Slovene 33 Marathi 39 Mandarin 51 Bengali 52 Kannada 64 Spanish 71 Romanian 95 Telugu 331 Hindi/Urdu 578 Malayalam 625 English 2791 Tamil 3502 0 500 1000 1500 2000 2500 3000 3500 4000 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Repartition by education !"#$%&'%#'&()'**+,#-.#&/$0)*+,# &"#&"#
&'"#
!!"# $%"#
()*+,-./0# 123+#4*+..-# 5)06,/# 7.#8./9)-#,:;*)<.=# >+?@#
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Age repartition
800 727 708
700 686 658 647
600
500 429
400 388 364 361 351 318
Numberusersof 300 268 239 230 205
200 182 177 137 133 132 113 106 104 103
100 91 76 70 63 58 56 46 42 38 37 36 35 26 26 26 21 18 16 15 11 10 6 2 2 2 2 1 0 1 18192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646667687172
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Nb of user distinct answer for image scheme (bp)
1 answer 2 answer 3 answer 3.8934 Yahoo 25.615 70.492 3.8462 Blizzard 24.519 71.635 19.306 Slashdot 33.839 46.855 1.4778 Skyrock 18.966 79.557 21.729 Recaptcha 40.576 37.694 5.6893 Microsoft 25.821 68.49 37.44 Mail.ru 42.512 20.048 5.3333 Google 24 70.667 2.7311 eBay 16.597 80.672 5.2036 Digg 30.543 64.253 14.139 Captchas.net 42.008 43.852 4.3678 Baidu 21.839 73.793 0.69124 Authorize 13.825 85.484 0% 20% 40% 60% 80%
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Nb of user distinct answer for image scheme (mk)
1 answer 5.2632 2 answer Yahoo 26.032 68.705 3 answer 0.87449 Blizzard 13.632 85.494 5.7506 Slashdot 26.132 68.117 0.89514 Skyrock 11.995 87.11 18.689 Recaptcha 38.565 42.747 13.119 Microsoft 33.633 53.248 24.388 Mail.ru 41.004 34.608 7.8974 Google 25.385 66.718 1.9013 eBay 15.827 82.271 2.4745 Digg 19.031 78.495 7.9385 captchas.net 32.113 59.949 2.7027 Baidu 16.319 80.978 0.23095 Authorize 6.5178 93.251
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Nb of user distinct answer for audio scheme
1 answer 29.197 2 answer Yahoo 36.439 3 answer 34.363
28.04 Slashdot 39.474 32.486
66.62 Recaptcha 25.512 7.8678
87.65 Microsoft 10.823 1.5264
95.403 Google 4.1875 0.40965
36.417 eBay 38.797 24.787
86.372 Digg 12.615 1.0129
41.655 Authorize 39.232 19.113
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Image solving time
200 authorize 0 baidu 200 captchas.net 0 digg
100 ebay 0 google 200 mailru 0 mslive 200 recaptcha 0 skyrock 100 slashdot 0 blizzard 50 yahoo 0 100
0 100
0 100
0 200 0
100 0 100 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Audio solving time
50 Authorize Digg 0 eBay 50 Google Microsoft 0 Recaptcha
50 Slashdot Yahoo 0
50
0
50
0
50
0 50
0 50
0 3 4 5 6 7 8 9 10 11 1213141516171819202122232425262728293031323334353637383940
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Accuracy by education
Image captcha Audio captcha
0.8
0.7
0.6
0.88 0.88 0.5 0.87 0.87 0.85
0.4 0.54 0.54 0.51 0.52 0.51 0.3
0.2
Not formal High School Bachelor Master Phd
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Solving time by education
60 Solving time for image Solving time for audio 50
40
30 seconds
20
23.67 23.25 10 19.75 19.44 21.33
9.6 8.49 9.36 9.16 7.64 0 Not formal High School Bachelor Master Phd
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Image captcha accuracy time by language
Native speakers Non-Native speakers 0.881 Yahoo 0.874 0.946 Blizzard 0.955 0.867 Slashdot 0.890 0.954 Skyrock 0.956 0.738 Recaptcha 0.772 0.804 Microsoft 0.794 0.700 Mail.ru 0.704 0.861 Google 0.873 0.935 eBay 0.935 0.919 Digg 0.925 0.837 captchas.net 0.848 0.927 Baidu 0.928 0.976 Authorize 0.979 0.70 0.75 0.80 0.85 0.90 0.95 1.00
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Image captcha solving time by language
Native speakers Non-native speakers 11.1 Yahoo 9.4 9.7 Blizzard 8.3 8.3 Slashdot 6.2 8.3 Skyrock 7.0 12.8 Recaptcha 9.4 13.7 Microsoft 11.5 13.5 Mail.ru 11.1 10.4 Google 8.0 7.5 eBay 6.7 8.6 Digg 7.2 8.7 captchas.net 7.0 7.4 Baidu 6.3 7.1 Authorize 5.9
0 s 2 s 4 s 6 s 8 s 10 s 12 s 14 s
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Audio captcha accuracy time by language
Native speakers Non-native speakers
0.67 Yahoo 0.71
0.65 Slashdot 0.73
0.45 Recaptcha 0.50
0.37 Microsoft 0.39
0.35 Google 0.35
0.62 eBay 0.65
0.37 Digg 0.39
0.57 Authorize 0.63
0.2 0.3 0.4 0.5 0.6 0.7
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Audio captcha solving time by language
Native speakers Non-native speakers
27.27 Yahoo 25.00
18.39 Slashdot 11.70
30.63 Recaptcha 30.12
19.48 Microsoft 16.63
33.21 Google 35.20
14.49 eBay 11.84
18.89 Digg 14.83
15.41 Authorize 11.94
0 s 5 s 10 s 15 s 20 s 25 s 30 s 35 s E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Conclusion / ongoing work
• Captcha security rely on many different techniques • Not clear how to design them well • Ongoing work • What makes a captcha hard/slow for human ? • What makes a captcha secure ?
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11 Features interactions
E. Bursztein, S. Bethard, C. Fabry, J. Mitchell, D. Jurafsky How Good are Humans at Solving CAPTCHAs? A Large Scale Evaluation http://ly.tl/p11