From Unsung Science with David Pogue: The Man Who Stopped the Spammers

Speaker 1

00:15

Pushkin. Hi, It's Jacob Goldstein and I'm here today with another podcast I think you might like. The show is called Unsung Science and it's hosted by David Pogue. You might know David from CBS Sunday Morning, where he's a correspondent covering topics like science, tech, and innovation, topics like the ones we talk about here on What's Your Problem. In the episode You're about to hear, David chats with Luis Vonon, the founder and CEO of the popular language

00:44

app Duolingo. You might recall I talked with Louise earlier this year about Duolingo and language and the current limits of artificial intelligence. But the show you're about to hear is about what Louise did before he started Duolingo. He invented this thing called Capsha. Capsha is that test that you have to take all the time on the Internet to prove that you're not a robot. And yes, Louise

01:05

knows that the test is super annoying. But the story of capsha and what happened with it is really interesting. It's got some great twists. By the year two thousand, the Internet was already becoming a cesspool software bots were signing up for millions of fake email accounts for sending

01:26

out spam. Luis Vaughan stopped them. He invented the CAPTA, the website login test where you have to decipher the distorted image of a word, where you have to find the traffic lights in a grid of nine blurry photos. The only problem we hate that test. I would be at a party and you know, people would ask me what I did, and I would tell them that I helped invent that thing, and people would tell me, oh,

01:50

I hate you. I'm David Pogue And this is Unsung Science, Season one, episode fourteen, The man who stopped the spammers. In his forty three years on this earth, so far, Luis vonn has had three ingenious innovative world changing ideas. I guarantee that you've encountered his second one, probably hundreds of times. Actually, most of us have zero world changing ideas. Occasionally somebody has won, but three times. His first idea

02:34

came to him in Guatemala, where he grew up. They wanted to start a gym where instead of charging people to show up, let people just show up for free. We're going to connect all the machines to kind of the power grid, and we're going to use the kinetic energy that people had whenever they were exercising to generate power. And I thought we could make a lot of money from that. Now you will note that I did not say that all three of his world changing ideas actually

03:00

succeeded in changing the world. I thought I was the first person to have this idea. It turns out it's a very old idea. It also turns out it doesn't work. That's right, the pedal power Jim idea flopped. It turns out this is not a good idea for many reasons, the biggest one of which is that humans are just not very good at creating energy. Oh, you just just don't make a lot of money from this. There's another

03:20

reason why this doesn't work a lot. It turns out Jim's make most of their money from people who don't show up. Of course, here you kind of need people to show up to be fair. He was pretty new at the game when he had this first idea. And how old were you at this point, twelve years old, eleven years old. Things started going better six years later, when he came to the United States to attend Duke University.

03:43

As the year two thousand dawned. Luis was at Carnegie Mellon in his first year of working toward a PhD in computer science, and one fateful day he went to a talk by an Israeli computer scientist named Udi Manber, who at this point was the chief scientist at Yahoo. By the way, at end the year two thousand, Yahoo was the biggest biggest tech company in the words like the Google of today. And you know, he was giving a talk about ten problems that they didn't know how

04:12

to solve inside inside the company. And one of those ten problems that the greatest minds at Yahoo could not solve was automated software spam bots signing up for free Yahoo mail accounts. By the millions. Yahoo gave up free email accounts, and there were people who wanted to send spam from Yahoo accounts. But each Yahoo account only allowed

04:34

you to send like five hundred messages a day. If you wanted to send millions of emails spam emails per day, then what these people did is they wrote programs to obtain millions of Yahoo accounts every day, and they didn't know how to solve that problem, how to stop that. So I started talking about it with a person who had just become my PhD advisor. His name was Manuel Blum or is Manolum. He's still he's most definitely still alive. And you know, we started thinking, and this is where

05:00

this idea of a capture came up. The idea was this, anytime you tried to sign up for a Yahoo Mail account, you'd encounter a little puzzle, something easy for a person to solve, but hard for a spambot. The way to stop these spammers was to have a test that can distinguish between whether you're a human or a computer. If you are a human, then presumably you can't get millions of email accounts because you get bored, whereas if you're

05:26

a computer, you can get millions. So if the only entity is that we were giving email accounts to where humans, then that would stop the spam. KAPTA, the name he gave his online mini puzzle, is an acronym. It stands for completely automated public touring tests to tell computers and humans apart more or less. Not sure if you've heard of the touring test, but it is incredibly famous among

05:52

computer scientists. It's this experiment proposed by British mathematician and computer scientist Alan Turing, who's known as the father of artificial intelligence. There was actually a movie about him called The Imitation Game, where Benedict's Cumberbatch played Alan Tour. Would you like to play play? It's a game, a test of songs for determining whether something is machine or a human being. Anyway, the Touring test is intended to set a standard for determining if a computer has achieved true

06:29

artificial intelligence. When can we tell that a computer is actually intelligent. This is kind of like a philosophical test that said, like, look, we're going to have a human judge ask questions to two entities. One is the computer, one is the human. The computer and the human are hidden behind two curtains. The judge can't see them. The judge types in questions and then looks at the text

06:51

of the responses. If it's impossible to tell which answer came from the person in which from the computer, the computer has passed the Turing test. The judge can just ask whatever questions they want, and if we really can't distinguish them, we'll say the computer is really intelligent. To this day, we have not made a computer that can actually pass the turning test successfully. It's just it's just

07:13

too hard. The funny thing is, if you really think about it, the capture problem is the opposite of the touring test. The touring test is successful if the judge can't tell the difference between a person and a machine. The whole point of Louis Vanan's project was to create a test that can tell the difference. There's another difference between the two tests too. Here's the key. In this case,

07:36

the judge was a human. In our case for the capture, what we needed to do is we needed the judge to be a computer because we need we need the computer to determine whether it's talking to a human our computer, which is which is much harder in some sense, at least for to grade it. So I think the hardest thing was just coming up with this general idea that like, okay, what we need is a test that can assume shroom some computers, but that computers need to be able to grade.

07:59

Then after that we started coming up with like, okay, what I think the computers are not very good at. In the year two thousand, the answer was obvious, computers are not very good at identifying what's in pictures. We quickly owned in on images and just doing you know, images of text, images of flowers, images of stuff. And then after a while, the images of text were the

08:21

ones that seemed like the best idea. And then I just went and developed a program that distorted random text and that was the first version of a cap chow. That's right. The test they came up with presents you with the image of a typed word, but the letters are all like twisted, bent and distorted, as though the typist were severely drunk and typing on saran wrap. You are supposed to interpret what that word is and type it into a box on the website. Actually, computers in

08:52

the early two thousands were pretty good at OCR. That's optical character recognition, meaning looking at a picture of text and figuring out what the letters are. But the added challenge of the twisty distortion really threw those OCR programs off the track. Behind the scenes, I mean, what is it. I mean there's got to be some I don't know, sequel database or massive bank of little images. I mean,

09:18

actually there was no database at first. We would just write a program that what it would do is it would pick some first random characters, would put them on an image, then it would distore him, and then we would save that image. And then we just had I don't know, a couple of million of those saved, not even in a sequel database. Is just they were there, so save as files. It worked brilliantly. The spambots didn't have a chance. At the time. Vonn had no idea

09:43

if his invention would be of any commercial use. But one guy he knew would be interested, Oodi Manber, that Yahoo chief scientist who'd given the talk that started this whole affair. We sent them an email saying, hey, we think we can solve your problem, and he said, oh, that that seems like it solves the problem. And then in fact, pretty soon after that it was being used by Yahoo, and then basically every website started using it, and you know, there was millions of websites out there.

10:11

We're using it. Well, how wonderful Luis Spawn's ingenuity. One spammers, zero Internet saved. And at first I was very kind of proud of myself because, okay, look at the impact that my work has had. Basically we stopped spams being used by a lot of people. There was only one problem. Now, people hated his invention. How many of you have had to fill out some sort of web form where even has to read a distorted sequence of characters like this, Yeah,

10:40

how many of you found it really really annoying. Okay, that's standing. So I invented that. That's how he introduces himself in a twenty eleven TEDx talk at Carnegie Mellon. I would be at a party, and you know, people would ask me what I did, and I would tell them that I helped invent that thing. And people would tell me, oh, I hate you. That's right. The inventor of kapta is fully aware that people hate the thing. I say, either well, I'm sorry, or I find it

11:08

annoying too. You've heard it right here, folks. Even he finds them annoying. In fact, Louise can tell you exactly how much of your time they waste. I did a little back of the envelope calculation at the time, about two hundred million times a day somebody type one of these captures two hundred million times times ten seconds, which is how long it takes to type one of these. Humanity as a whole was wasting about five hundred thousand

11:32

hours every day typing these annoying captures. Great. So I started feeling bad about that, and that's when I started thinking, Okay, can we do something good with that time? See, the thing is kind of similar to the gym idea. Can we get millions of people to do something during that time that is actually valuable. I'll give you a hint. We're only at the halfway point in this story. After the break, we'll tell you what he came up with to make those half a million hours every day useful

12:00

to humanity. And one more plug here. I'm the author of a book called How to Prepare for Climate Change. It's a six hundred page paperback that's designed to be a field guide to the new climate. It tells you where to live, where to invest, what to grow, how to reinforce your home, how to insure, how to talk to your kids, and how to ride out wildfires, hurricanes, heatwaves,

12:30

and so on. If you live in a state whose name contains a vowel, then you've been affected by climate change already, and you should check out this book to protect your health, your family, your home, and your finances. It's How to Prepare for Climate Change. The book that's exactly what it sounds like. Welcome back. By two thousand and five, Louis vaughan An's invention the captcha test was a huge hit. It reduced the world scumbag spammers to

13:02

blubbering losers. No longer could they bombard websites with phony sign ups for the purpose of pursuing their pathetic spanny schemes. Unfortunately, he had achieved this success by transferring the burden onto us, treating us as though we were guilty until proven innocent. Now we were the ones being challenged. We were losing ten seconds per website typing in those stupid distorted letters. Now.

13:30

To be fair, history is full of examples like that, where the actions of a few selfish, greedy idiots wind up inconveniencing billions of innocent people for the rest of our lives. You know, some dirtbag tries to put poison into drug store tile on all bottles, and now the rest of us are stuck with frustrating, plastic, wasteful bottle lids forever. Some delinquent tries to blow up a plane with a shoe bomb, and now we all have to

13:56

walk through the TSA scanners in our socks. Louise felt bad that his hacker blockade wasted everybody's time, but at least he could do something about it. So that's a very valuable time, So can we use it for something? And then I ended up coming up with this idea that while you were typing a capture, you could be helping digitize books. And here's here's kind of how that works.

14:19

So at the time, this is the year maybe two thousand and five, two thousand and six, there were a lot of projects trying to digitize all of the world's books where where you know. The way that worked is you start with a physical book and you want to put it on the internet. And the way you do that is you basically take a digital photograph of every page of the book. Now these are pictures of text. The next step in the process is that the computer

14:38

needs to decipher what's the text in there. In other words, computers had to perform come on, you know, this term ocr optical character recognition, and unfortunately, for books that are older where maybe the ink has faded, computers could not recognize many of the words. So the thought, the idea was, let's take all those words that the computers could not recognize while books are being digitized, and let's get people to read them for us while they're typing a capture.

15:06

So what we started giving people where these words that they con computer was not able to digitize and or to recognize. So yeah, all this time you thought you were typing random words. In fact, you were helping companies digitize old books and articles and, by the way, helping Luise's little company make money. The ideas we made a capture, a system, a whole system that would help your website be protected against BAM, and we gave that away for free.

15:33

And for example, Facebook use our capture and we gave it away for free, etc. But always with a caveat that if they are going to do that, then we can see the answers that users are typing, so that we helped digitize something. And the way we made money is by charging people who needed digitization stuff. For example,

15:52

the New York Times was our client. The New York Times had this old archive of all the editions of the New York Times from you know, one hundred and thirty years of the New York Times or something like that, from the eighteen hundreds, and they needed this to help digitize their whole archive. They were sending us all the scans they had scanded already and we were sending them.

16:10

We were taking all the words that computer could not recognize, and we were getting through the captures people who were, for example, signing up for Facebook or Twitter or a lot of websites that we're using our capture. They were helping us digitize the New York Times, and we would make money from The New York Times. It became very successful, and then Google bought it to help their book digitization whole project. The new system called recapture became an even

16:31

bigger hit. Here's how we described the aftermath in his TEDx talk. So every time you buy tickets on Ticketmaster, you hope to digitize a book. Facebook, every time you add a friend, you help to digitize a book. Twitter, and about three hundred and fifty thousand other sites are all using recapture. And in fact, the number of sites that are using recaptures so high that the number of words that we're digitizing per day is really really large.

16:51

It's about one hundred million a day, which is the equivalent of about two and a half million books a year. And this is all being done one word at a time by just people tapping captures on the Internet. There are some people who are a little nervous about Google being the owner of one of the most widely used captive systems. I'm sure you've then asked about that. Yeah, there are people who are nervous about that. I mean, I understand, I think you know this is these are

17:20

very very tricky questions. I mean, personally, I think the privacy fight US is over. I mean I I've given up on my privacy against large companies a while ago. Wow. Not only that, I also think after having been inside Google, I saw with how much respect they treat user data because they know that they are, you know, a few scandals away from being in deep trouble, so they take it with a lot of care, I think. And we should point out that Google has said we do not

17:50

use data collected for advertising purposes. Yeah, that's the case, and so and I actually believe them. Now. Remember Louise said that the hard part was finding a test that was too hard for a computer to pass, but easy enough for a computer to judge whether the test had been passed. That's been bugging me. If the computer chooses a word that's so distorted that it itself cannot do the ocr then how does it know if we're right. Yeah, that's a great question. When we try to digitize books,

18:22

Here's here's what we do. We take a word that the computer does not know. We actually pair it with another word for which the computer does know the answer, and we actually give people both words, and we say please type both, and we don't tell them which ones which,

18:35

We just say, hey, please type both. If they type the word for which we know the answer, if they type that one correctly, we assume that they're human, and we also get some confidence that they type the other word correctly, and then what we do is okay, so now we have a guess for what that other word is. We give it to like ten other different people and we see if they type the same thing, and if they all type the same thing, we get with very high accuracy what that word really is, and that works.

19:00

One hallmark of the recapture system in other words, is that you have to type in two words. There's sometimes also funny words that a funny combinations that happen, especially because we are showing two words at a time. Oh boy, I mean, you know, there's been all kinds of really funny examples where it's just like, you know, a website of a church that says like bad Christians and it's just but these are just two randomly chosen words, so

19:26

we shouldn't infer any evil on your part. No, they're random. Now, a lot has happened since two thousand when capture came along, and since two thousand and six when you started unsuspectingly helping Google in the New York Times digitize their old pages. You know, early on in the first version of a cap shop, computers were pretty bad at recognizing distorted text,

19:47

so they didn't have to be that distorted. But you know, over time, computers got better and better, and in fact, by now computers are in many cases about as good as humans. Because of that, we have to make them harder and harder. A lot of times, the puzzles are so hard that even the human can't pass the challenge. I'm sure you've been sent screenshots of words that are

20:08

so much no one can tell where it is. Yes, that happens, I mean, it's rare that that happens, and that's why the capture itself in true arms race fashion has evolved. So what has happened is that for the more secure things, the captures have moved away from these distorted characters. And what is being used now are these the puzzles are now things like you see a bunch of pictures and you have to click the ones that

20:37

contain a stop sign right the traffic lights, the fire hydrants. Yeah, it's exactly the same idea as recapture, except we're not the story. We're not trying to digitize books. This a lot of times comes from things like all the all the mapping cars or the self driving cars. Basically, these are cars that are driving around that are capturing images of the whole world. They're trying to figure out what's around them. Sometimes they cannot recognize what's in an image.

21:01

So it's a similar case. It takes the things like is this a stopting I'm not sure, Okay, send it to a human, and then when you get it and you click on the store sign, you're actually helping either the self driving car or the mapping software or whatever know that there is actually a stop sign right here. Oh so we're still doing good for the world as we do this, still doing good for the world, or for a company or for a company, but maybe not digitizing books. But it's a similar ideas thing that a

21:26

computer cannot do. You've just solved a mystery for hundreds of millions of people. Why it's always traffic lights and fire hydrants we're supposed to choose and not bananas and puppies, or it has to do with both self driving cars and also mapping software. Okay, so now we kind of get why we have to put up with these challenges, or we did twenty years ago, but really nothing better has come along since. Are we sure that there's nothing

21:54

less annoying that we could do to thwart these spammers? Yes, there is. By now, it did become a lot less annoying. I don't know if you've seen that of late, where you know, there's a thing that's us recapture. We're just trying to figure out whether you're a human, and they just ask you to click somewhere, just click on this box. That is much less annoying. So sometimes you don't see anything except I'm not a robot by yeah, yeah, yeah, I'm not a robot. This is something that is that

22:19

is done by Google. This actually comes from you know, the original team, that is the company that they bought from me. When you get that one, that means that, in this particular case, probably means Google has figured out that, yeah, you know what, we know you because you've been around since twenty sixteen in this computer, and yeah, you have a lot of Gmail emails, and you've done a lot of Google search queries. You're a normal person, You're not

22:43

a spammer. So they just do a little thing that just tries to double check that, you know, I can move the mouse or whatever. So one thing that has changed from the year two thousand and five to now is that there are companies like Google or like Facebook that for the majority of people on the Internet, they kind of know who you are. If you have a fresh computer that you've never used before, then you would

23:05

have to do the annoying capture. But for most of us, you're unlikely to have to type these as much as you you were back in say the year two thousand and five. It has become a lot better, you know, probably a little bit at the cost of your privacy. Okay, but wait a minute, we now know that computers eventually got too smart for the distorted text reading touring tests. Won't they eventually get good enough to identify a few stupid stop signs in a photo grid? It is, it's

23:31

a cat of mouse game. Now. Probably there's a bunch of people working on making better recognition of stop signs or something like that eventually. But eventually computers are going to be able to do everything humans can, and so at some point there won't be a test that kind distinguished humans going to computer. Well wait a minute, does that mean the end of the internet? I mean, what happens if that, If there's no sort of touring tests

23:53

that works anymore. I don't think it's the end of the internet, particularly because, like I said, more and more these companies are going to know more and more about you, and I just don't think there will be a humans problem. Okay, well, whatever the end game is, why can't we do today, Since we know it's an arms race, Since we know that eventually we'll lose it to AI and computers, why can't we jump to whatever we'll follow it? Now? I'll tell you why this. By the way, it's like ninety

24:23

five percent of the way there. I mean, really, for most of us, you know, Facebook knows who we are, and Google knows who we are, So it's ninety percent of the way there. The reason is not one hundred percent of the way there is because there are some people who really care about privacy, and you know, there's there's always going to be a kind of a way to browse privately. So for example, there's a chrome has private browsing. So it's all the stuff when people care

24:45

about privacy, I mean there's there's a trade off here, right. Well, the irony is it seems like most of the websites to present me with a capture I'm trying to get to in order to supply my name and address, like like I'm signing up for something. Yes, it's funny. Why do I need privacy when the whole purpose is to supply my information? Yeah, that's funny. Now. I mentioned at the beginning that Louis has had three world changing ideas. You've heard about the gym membership that powers the grid,

25:16

and you now know about kapture. But what about his third creation. It's Duo Lingo, the language training app. At this moment, it has half a billion registered users learning forty different languages, all for free. And from the very beginning you could see the fingerprints of Luis Vaughnan, master of crowdsourcing all over it. In early due Lingo, as you were learning a language on dual Lingo, you're actually helping us to translate stuff that computers could not translate.

25:52

In fact, CNN was a client, so CNN would send us their news in English. We would then give it to people who were Spanish speakers who were learning English, and we would say, hey, you want to practice your English, help us translate this CNN article into your native language of Spanish. And so they would do it, and they would be learning English and then we would get that translation, and then we would send it back to CNN and they would pay us for the translation. That was the

26:16

very first version of due Lingo. It turned out that, just like the gym, it ends up being that it just can't make much money from this, and so we decided, okay, just go go to a business model where we actually give you ads, and the way we make money is by, you know, showing your ads. The dude just keeps doing that. He keeps coming up with ideas that make the world a better place, thwart the bad guys, and make a lot of money. It's really a shame he gave up

26:43

that electrical grid gym thing. Are there ever things that come to you in the shower that might be your big third act? I mean, honestly, to have the impact you've had twice is astonishing, But it makes me think there's something in you that just has great ideas that can go really wide. You know, as time passes, I am a lot more interested in literacy and teaching people

27:08

how to read. I think with a computer, we should be able to teach the whole world how to read significantly better than humans can teach you how to read. You know, the US, The US is fine, most adults Indias know how to read, but many countries in the world there's a significant fraction of people who don't know how to read. In fact, there's about a billion adults in the world that are illiterate. And I think we can I think we can make a big dent, you know,

27:29

with a system to teach people how to read. So we're working on that in the meantime. Now you know why you have to encounter those infernal website challenges. You know how they came about, and you now consider them unnecessary evil. Well, maybe you do just for people who are like, I don't know what it is. I just don't like doing it. I can't even tell what's a freaking draffic. Like, let's just lay out what would happen if all these challenges went away tomorrow. What would happen

27:56

to the Internet. Most likely, you would get a lot more spam in either your email spam or you'd get a more kind of random Facebook follow words that are not you know, real people. These fake accounts can start boosting up that political messages. There would be probably more fake news. They would probably be you know, more spam, right, and from spam fishing and spywear and yeah, more spywear. Yeah. The web would be a less safe place, all right. So when you do explain this to someone at the

28:33

proverbial party, are they generally satisfied with the notion that? Yeah, I think most people. I think most people realize that it's like, you know, these things are kind of like a like a key nobody nobody likes. It's not like I love opening my door with the key. It's kind of annoying, but yeah, that's there, and I understand it just makes it makes my house safer. In this case, it's kind of just makes the whole Internet safer. I kind of gotta do it. Thanks for listening to this

29:00

second to last episode of the season. If you have any interest in a second season, please spread the word, subscribe to this podcast, leave a review on Apple Podcasts, or a rating on Spotify. Unsung Science with David Pogue is presented by Simon and Schuster and CBS News and produced by PRX Productions. The executive producers for Simon and Schuster are Richard Rohrer and Chris Lynch. The PRX production team is Jocelyn Gonzalez, Morgan Flannery, Pedro, Raphael Rosatto, and

29:31

Ian Fox. Project manager Jesse Nelson composed the Unsung Science theme music and Christina Robello fact checked my script. At Unsung Science dot com, you can listen to every episode we've ever made and read complete transcripts. For more of my stuff, visit David Pogue dot com or follow me on Twitter at Pogue Pogue. Thanks for the scene that was an episode of Unsung Science from our friends at CBS News. You can find more episodes of Unsung Science wherever you get your podcasts.

Transcript source: Provided by creator in RSS feed: download file

Episode description

Transcript