What’s Your Story: Emre Kiciman

00:01

[MUSIC PLAYS UNDER DIALOGUE] 

EMRE KICIMAN

00:01

I think it's really important for people to find passion and joy in the work that they do. At some point, do the work for the work's sake. I think this will drive you through the challenges that you'll inevitably face with any sort of project and give you the persistence that you need to really have the impact that you want to have. [TEASER ENDS] 

JOHANNES GEHRKE

00:26

Microsoft Research works at the cutting edge. But how much do we know about the people behind the science and technology that we create? This is What’s Your Story, and I’m Johannes Gehrke. In my 10 years with Microsoft, across product and research, I’ve been continuously excited and inspired by the people I work with, and I’m curious about how they became the talented and passionate people they are today.

00:49

So I sat down with some of them. Now, I’m sharing their stories with you. In this podcast series, you’ll hear from them about how they grew up, the critical choices that shaped their lives, and their advice to others looking to carve a similar path. 

01:04

[MUSIC FADES]

JOHANNES GEHRKE

01:06

In this episode, I’m talking with Emre Kiciman, the senior principal research manager leading the AI for Industry research team at Microsoft Research Redmond. After completing a PhD in systems and networking in 2005, Emre began his career with Microsoft Research in the same area,

01:23

studying reliability in large-scale internet services. Exposure to social data inspired him to refocus his research pursuits: his recent work in causal analysis—including DoWhy, a Python library for causal inference—is helping to connect the whats and whys in the abundance of data that exists. Meanwhile, his work with large language models is geared toward making AI systems

01:48

more secure and maximizing their benefit to society. Here’s my conversation with Emre, beginning with some of his work at Microsoft Research and how he landed in computer science.

GEHRKE

01:58

Welcome to What's Your Story. So can you just tell us a little bit about what you do at MSR [Microsoft Research]?

KICIMAN

02:04

Sure. I work primarily on two areas at the moment, I guess. One is causal analysis, where we work on trying to answer cause-and-effect questions from data in a wide variety of domains, kind of, building that horizontal platform. And I work a lot recently, especially with this large language model focus, on the security of AI-driven systems: how do we make sure that these AI systems that we're building are not opening up new vulnerabilities to attackers?

GEHRKE

02:35

Super interesting. And maybe we can start out even before we go more in depth into that by, you know, how did you actually end up in computer science? I learned that you grew up in Berkeley.

KICIMAN

02:45

Yeah, on average, I like to say.

GEHRKE

02:48

On average? [LAUGHTER]

KICIMAN

02:49

So I moved to the US with my parents when I was 2 years old, and we lived in El Cerrito, a small town just north of Berkeley. And then around middle school age, we moved to Piedmont, just south of Berkeley. So on average, yes, I grew up in Berkeley, and I did end up going there for college. And you asked about how I got into computer

03:11

science. When I was probably around third or fourth grade, my dad, who was a civil engineer, decided that he wanted to start a business on the side, and he loved software engineering and wanted to build software to help automate a lot of the more cumbersome design tasks in the design of steel connections, and so he wrote ... he bought a PC and brought it home and started working on his work. But then that was also my opportunity to learn what a computer was.

GEHRKE

03:49

So that was your first computer? Was it an x86?

KICIMAN

03:52

Yes, it was an IBM PC, the first x86, the one before the 286. And—it wasn't the very original PC. It did have a CGA—color graphics adapter—so we could have four colors at once.

GEHRKE

04:07

Nice.

KICIMAN

04:08

And, yeah, that's ... it came with—luckily for me, I guess—it came with a BASIC manual. So reading that manual is how I learned how to program.

GEHRKE

04:17

And this is the typical IBM white box with a monitor on top of it and a floppy drive, or how should I picture it?

KICIMAN

04:23

Yeah, two floppy drives ... GEHRKE: Two floppy drives? OK ... Two floppy drives, yeah, so you could copy from one to the other.

GEHRKE

04:28

Five and a quarter or three and a half?

KICIMAN

04:29

Five and a quarter, yeah, yeah. The loud, clickety-clack keyboard and, yeah, a nice monitor. So not the green and black; the one that could display the colors. And, yeah, had a lot of fun with programming.

GEHRKE

04:43

So what were some of the first things that you wrote?

KICIMAN

04:46

A lot of the first ones were just the examples from the book, the for loops, for example. But then after that, I started getting into some of the, you know, building, like, little mini painting tools. You know, you could move a cursor around the screen, click a button and paint to fill in a region, and then save the commands that you did to

05:06

make graphics. Eventually, that actually turned into, like, a friend and I really enjoyed playing computer games, so we had in our mind we're going to build a computer game.

GEHRKE

05:15

Who doesn't think that.

KICIMAN

05:16

Of course, right?

GEHRKE

05:17

Of course …

KICIMAN

05:18

And so we had, like, a "choose your own adventure"–style program. I think we had maybe even four or five screens you could step through, right. And he was able to get some boxes, and we printed some manuals even. We had big plans, but then we didn't know what to do, how to finish the game, how to get it out there, so ... but we had a lot of fun.

GEHRKE

05:40

Wow, that sounds amazing.

KICIMAN

05:41

Really fond memories, yeah.

GEHRKE

05:42

That sounds amazing. And then you went to Berkeley afterwards? Is that how you realized your passion, or how do you decide to study computer science?

KICIMAN

05:50

Yeah ... so from that age, I was set on computing. I think my parents were a bit of a devil's advocate. They wanted me to consider my options. So I did consider, like, mechanical engineering or industrial engineering in, like, maybe junior year of high school, but it never felt right. I went into computing, had a very smooth transition into Berkeley. They have a local program where students from the local high school can start to take college classes early.

06:18

So I'd even started taking some computer classes and then just went right into my freshman year.

GEHRKE

06:24

Sounds like a very smooth transition. Anything bumpy? Anything bumpy on the ride out there, or …?

KICIMAN

06:30

Nothing really, nothing really bumpy. I had one general engineering class that somehow got on my schedule at 8 AM freshman year.

GEHRKE

06:41

[LAUGHS] That's a tough one.

KICIMAN

06:42

That's a tough one, yeah. And so there were a few weeks I didn't attend class, and I knew there was a midterm coming up, so I show up. Because, you know, next week, there's a midterm. I better figure out what they're, what they're learning. And I come in a couple minutes late because it's, even though I'm intending to go, it's still an 8 AM class. I show up a few minutes late, and everyone is heads down writing on pieces of paper. The whole

07:07

room is quiet. And the TA gives me a packet and says, you might as well start now. "Oh no." And I'm like freaking out. Like this is, this is a bad dream. [LAUGHS] And I'm flipping through ... not only do I not know how to answer the questions; I don't understand the questions,

07:27

like the vocabulary. It's only been three weeks. How did they learn so much? And then I noticed that it's an open-book exam and I don't have my book on top of it, like ... but what I didn't notice and what became apparent in about 20 minutes … the TA clapped his hands, and said, “All right, everyone, put it down. We'll go over the answers now.” It was a practice.

GEHRKE

07:52

Oh, lucky you.

KICIMAN

07:53

Oh, my god, yes. So I did nothing but study for that exam for the next week and did fine on it.

GEHRKE

08:00

So you didn't have to drop the class or anything like that?

KICIMAN

08:02

No, no, no. I studied enough that I did reasonably, you know, reasonably well.

GEHRKE

08:07

At what point in time was it clear to you that you wanted to do a PhD or that you wanted to continue your studies?

KICIMAN

08:13

I tried to explore a lot during my undergrad, so I did go off to industry for a summer internship. Super fun.

GEHRKE

08:24

Where did you, where did you work?

KICIMAN

08:25

It was Netscape.

GEHRKE

08:26

Oh Netscape.

KICIMAN

08:27

And it was a joint project with IBM.

GEHRKE

08:29

Which year was that in?

KICIMAN

08:30

This would have been '90, around '93.

GEHRKE

08:33

’93 … OK, so the very early days of Netscape, actually.

KICIMAN

08:36

Yeah, yeah. They were building Netscape Navigator 4, and the project I was on was Netscape Navigator for OS/2.

GEHRKE

08:42

OK.

KICIMAN

08:43

IBM's OS/2 had come out and was doing poorly against NT, and they wanted to raise its profile. And this team of 20 people were really just focused on getting this out there. And so I always thought of, you know—and I was an OS/2 user already, which is how I got onto that project.

GEHRKE

09:03

OK … And how was the culture there, or ...?

KICIMAN

09:04

The culture, it's what you would think of as a startup culture. You know, they gave out all their meals. There was lots of fun events. You know, dentists came into the parking lot like once a month or something like that.

GEHRKE

09:17

Dentist?

KICIMAN

09:18

There was, like, a yeah, it was, yeah, you know, everyone's working too much at the office, so the company wanted to make things easy.

GEHRKE

09:25

That sounds great.

KICIMAN

09:26

But the next summer then, I did a research internship, a research assistantship, at Berkeley. I worked with Randy Katz and Eric Brewer and got into, you know, trying to understand cellphone networks and what they were thinking about, you know, cloud infrastructure for new cellular technologies.

GEHRKE

09:48

And Eric Brewer, was he, at that point in time, already running Inktomi, or ... ?

KICIMAN

09:52

He was already running Inktomi. Yeah, yeah, he'd already started it. I don't think it was public yet at the time, but maybe getting there.

GEHRKE

09:59

OK. Well, this was right at the beginning when, like, all the, you know, cloud infrastructure was defined and, you know, a lot of the basics were set. So you did this internship then in your, after your junior year, the second one?

KICIMAN

10:10

Yeah, after my junior year. It was then senior year, and it was time to apply for, you know, what's going to come after college. And I knew it … after that assistantship at Berkeley, I knew I was going to go do a PhD.

GEHRKE

10:24

So what is the thing about the internship that made you want to stay in research?

KICIMAN

10:28

Oh, it's just the ... it gave a vision of the future. Like, we were playing with, like, you know, there were people in the lab playing with video over the internet and, you know, teleconferencing, and just seeing that, it felt like you were seeing into the future and diving deep technically across the stack in a way that the industry internship

10:57

hadn't done. And so that part of it and obviously lots of particulars. You know, lots of internships do go very deep in industry, as well, but that's what struck me, is that, kind of, wanting to learn was the big driver.

GEHRKE

11:12

And what excited you about systems as compared to something that's more applications-oriented or more touching the user? I feel like systems you always have to have this, kind of, drive for infrastructure and for scale and for, you know, building the foundation as compared to, like, directly impacting the user.

KICIMAN

11:28

I think the way I think about systems today—and I can't remember what it was about systems then. I'd always done operating ... like, operating systems was one of my first upper-division courses at Berkeley and everything. So, like,

11:45

I certainly enjoyed it a lot. But the way I think about systems now—and I think I do bring systems thinking to a lot of the work I do, even in AI and responsible AI—is the way you structure software, it feels like you should be making a statement about what the underlying problem is, what is the component you should be building from an elegance or first-principles perspective. But really, it's about the people who are going to be using and building and maintaining that

12:19

system. You want to componentize it so that the teams who are going to be building the bigger thing can work independently, revise and update their software without having to coordinate every little thing. I think that's where that systems thinking comes in for me, is what's the right abstraction that's going to decouple folks from each other.

GEHRKE

12:40

That's a really great analogy because the way it was once told to me was that systems is really about discovering the beauty in large software. Because once you touch the user, you, sort of, have to do whatever is necessary to, you know, make the user happy. But in the foundations, you should have simplicity; you should have ease; you should have elegance. Is that how you think about it?

KICIMAN

12:59

I do think about those aspects, but it's for a purpose. You know, you want the elegance and the simplicity so that you can have, you know, one team working on Layer 1 of the stack, another team working on Layer 2 of the stack, and you don't want them to have to talk to each other every 10 minutes when they're making any change to any line of code, right. And so thinking about, what is the more fundamental layer of abstraction that

13:23

lets these people work on separate problems? That's what's important to me. And, of course, like, that then interplays with people's interests and expertise. And as people's expertise evolves, that might mean that that has implications for the design of your system.

GEHRKE

13:41

And so you're, OK, you're an undergrad. You have done this research experience; you now apply. So now you go to grad school. Do you do anything fun between your undergrad and grad school?

KICIMAN

13:49

No, I went straight in.

GEHRKE

13:50

Right straight in?

KICIMAN

13:51

Right straight in. I did my PhD at Stanford. So I went, you know, a little way to school.

GEHRKE

13:56

To a rival school, isn't it? Isn't it a big rival school?

KICIMAN

13:59

To a rival school. Well, the undergrad school wins. I think that's the general rule of thumb. But I did continue working with folks at Berkeley. So my adviser was also from Berkeley and so ...

GEHRKE

14:09

Who was your adviser?

KICIMAN

14:10

My adviser was Armando Fox, …

GEHRKE

14:11

OK, yeah. Mm-hmm.

KICIMAN

14:13

… and we had a ...

GEHRKE

14:14

Recovery-oriented computing? KICIMAN: Yes, exactly. Recovery-oriented computing. And the other person on the recovery-oriented computing project ... Dave Patterson ...

KICIMAN

14:24

... was Dave Patterson, yeah.

GEHRKE

14:25

So it was really a true, sort of, Stanford-Berkeley joint project in a way?

KICIMAN

14:28

Yes, yeah. And that was my PhD. The work I did then was the first work to apply machine learning to the problem of fault detection and diagnosis in large-scale systems. I worked with two large companies—one of them was Amazon; one of them was anonymous—to test out these ideas in more realistic settings. And then I did a lot of open-source work with J2EE to demonstrate how you can trace the behavior of a system and build up models of its behavior and detect anomalies.

15:01

Funnily enough, I know this is going to sound a little alien to us now maybe in today's world: Dave and Armando would not let me use the phrase "artificial intelligence" anywhere in my thesis because they were worried I would not be able to get a job.

GEHRKE

15:18

I see. Because that was, sort of, one of ... I mean, AI goes through these hype cycles and then, you know, the winters again, and so this was one of the winter times?

KICIMAN

15:27

This was definitely a wintertime. I was able to use the phrase "machine learning" in the body of the thesis, but I had to make up something about statistical monitoring for the title.

GEHRKE

15:39

So what is the actual final title of your thesis, if you remember it?

KICIMAN

15:42

"Statistical monitoring for fault detection and diagnosis in large-scale internet services" or something like that.

GEHRKE

15:49

So you replaced AI with statistical modeling and then everything [turned out all right]?

KICIMAN

15:52

Yes, yeah. Everything ... then it didn't sound too hype-y.

GEHRKE

15:55

And then after your PhD, you went straight to MSR, is that right?

KICIMAN

16:01

Yeah. I mean, so here I'm coming out of my PhD with a focus on academic-style research for large-scale systems. Kind of boxed myself in a little bit. No university has a large-scale internet service, and most large-scale internet service companies don't have research arms. So Microsoft Research was actually the perfect fit for this work. And when I got here, I started diving in and actually expanding a little bit and thinking about what are the

16:28

end-to-end reliability issues with our services. So assume that the back end is running well. What else could go wrong that's going to get in the way of the user? So I had one project going on, wide area network reliability with David Maltz, and one project ...

GEHRKE

16:44

Who is now CVP in Azure.

KICIMAN

16:45

Who's now, yeah, leading Azure network—the head of Azure networking. And one project on how we can monitor the behavior of our JavaScript applications that were just starting to become big. Like around then is when, you know, the first 10,000-line, 100,000-line-of-code JavaScript applications [were] appearing, and we had no idea whether they were actually running correctly, right? They're running on someone else's browser and someone else's operating system. We didn't know.

GEHRKE

17:15

A big one at that point in time, I think was Gmail, right? This was, sort of, a really big one. But did we have any big ones in Microsoft?

KICIMAN

17:20

Gmail was the first big one in the industry.

GEHRKE

17:22

Hotmail, was it also Java, based in JavaScript?

KICIMAN

17:23

Hotmail was not initially JavaScript based. The biggest one at that time was our maps. Not Bing maps, but whatever we called it.

GEHRKE

17:33

MSN maps, or ...

KICIMAN

17:35

Probably something like that, yeah, yeah.

GEHRKE

17:37

I see. And so you applied your techniques to that code base and tried to find a lot of bugs?

KICIMAN

17:41

Yeah, this project was—and this was about data gathering, right, so I'm still thinking about it from the perspective of how do I analyze data to tell me what's going on. We had data for the wide area network, but these web applications, we didn't have any. So I'm, like, I'm going to build this infrastructure, collect the data, so that in a couple years, I can analyze it. And so what I wrote was a proxy that sat on the side of the IAS server and just

18:08

dynamically instrumented all the JavaScript that got shipped out. And the idea was that no one user was going to pay the cost of the instrumentation, but everyone would pay a little small percentage, and then you could collect it in the back end to get the full complete picture.

GEHRKE

18:25

Right. It's so interesting because, I mean, in those days, right, you still thought maybe in terms of years and so on, right. I mean, you've said, well, I instrumented, then maybe in a year, I have some data. And today it happens that I instrument, and tomorrow I have enough data to make a decision on an A/B test and so on, right. It was a very different time, right. And also, it was probably a defining time

18:42

for Microsoft because we moved into online services, right. We moved into large-scale internet services. So it must have been exciting to be in the middle of all of this.

KICIMAN

18:49

It really was. I mean, there was a lot of change happening both inside Microsoft and outside Microsoft. That's when ... soon after this is when social networking started to become big,

19:02

right. You started seeing Facebook and Twitter show up, and search became a bigger deal for Microsoft when we started investing in Windows Live and then Bing, and that's actually ... my manager, Yi-Min Wang, actually joined up with Harry Shum to create the Internet Services Research Center with the specific focus of helping

19:29

Bing. And so that also shifted my focus a little bit and so had me looking more at some of the social data that would, kind of, take my trajectory on a little bit further.

GEHRKE

19:40

Right. I mean, so you're unique in that, you know, people very often, they come in here and, you know, they're specialists in systems, and they branch out within systems a little bit and, you know, of course, move with time. Maybe now they do, you know, AI infrastructure. But you have really moved quite a bit, right. I mean, you did your PhD on systems … I mean, systems and AI really, the way I understand it. Then you

20:01

worked here a little bit more on systems in wide area and large-scale systems. But then, you know, you really became also an expert in causality and looked at, sort of, the social side. And now you, of course, have started to move very deeply into LLMs. So rather than talking about the topics itself, how do you decide? How do you make these decisions? How do you ... you know, you're a world expert on x, and how do you, in some sense, throw it all away and go to y? Do you decide

20:28

one day, "I'm interested in y"? Do you, sort of, shift over time a little bit? How do you do it?

KICIMAN

20:32

I've done it, I think, two or maybe three times, depending on if you count now, and some transitions have gone better than others. I think my transition from systems to social data and computational social science, it was driven by a project that we did for search at the time. Shuo Chen, another researcher here at Microsoft Research, built a web application that lets you give very concrete feedback back to Windows Live. You could

21:07

drag and drop the results around and say, this is what I wanted it to look like. And this made, you know, feedback much more actionable and helped really understand DSATs and where they're coming from. DSAT being dissatisfactions. And I looked at that and I was like, I want to be able to move search results around and share with my friends. And I, kind of, poked at Shuo,

21:31

you know, asked him if he would build this, and he said no. He said he's busy. So eventually, I—because I knew something about JavaScript applications—decided to just drop things and spend six months building out this application. So I built out this social search application where you could drag and drop search results around, share it with your friends, and we put it out, actually. We got it deployed as an external service. We had maybe 10,000 people kick the tires.

GEHRKE

22:00

Within Microsoft or ...?

KICIMAN

22:02

No, externally.

GEHRKE

22:03

OK.

KICIMAN

22:03

Yeah. There was a great headline that, like, Google then fast followed with a similar feature, and the headline was like, Google fast follows, basically, on Microsoft. Our PR folks were very excited about that. I say this all ... I mean, it's all history now. But certainly,

22:21

it was fun at the time. But now we're ... I'm giving this demo, this talk, about this prototype that we built and what we're learning about, you know, what's in people's way, what's friction, what do they like and not like, etc. And I'm standing up and, you know, giving this presentation, this demo, and someone says, hey could you, could you go back to, you know, go back in the browser? On the bottom right corner, it says Mike did something on this search page;

22:49

he edited some search results. Could you click on that? I want to know what he did. I'm like, OK, yeah, sure. I click on it. And [it’s like], OK, that's great. That's, that's really interesting. And this happened multiple times. Like, in a formal presentation, for someone to interrupt you and ask a personal question just out of their own curiosity, that's what showed me … that's what got me really thinking deeply about the value of this social data and, like, why is it locked up

23:17

in a very specific interface. What else could you do with this data if it's so engaging, so fascinating, that people are willing to interrupt a speaker for some totally irrelevant, basically, question? And that's when I switched to really trying to figure out what to do with social data.

GEHRKE

23:36

I see. So it was this, kind of, really personal experience of people being so excited about that social interaction on the demos that you're giving.

KICIMAN

23:45

Exactly. They cared about their friends and what their friends did, and that was super clear.

GEHRKE

23:51

So, so coming back, let's go there in a second, but coming back to the story that you told, you said you had 10,000 external users.

KICIMAN

23:57

Yeah.

GEHRKE

23:57

So I'm still, you know, also always trying to learn what we can do better because we sometimes have prototypes that are incredibly valuable. They're prototypes that have fans; they're prototypes that, you know, the fans even want to contribute. But then somehow, we get stuck in the middle; and they don't scale, and they don't become a business. What happened with that?

KICIMAN

24:16

Yeah.

GEHRKE

24:17

Also in [retrospect], ...

KICIMAN

24:19

In retrospect ...

GEHRKE

24:20

… what, what ... should we have done something different, or did it live up to its potential?

KICIMAN

24:23

I think we learned something. I think that there were a couple of things we learned. One was that, you know, every extra click that people wanted to do, you know, took the number of interactions down by, you know, an order of magnitude. So starring something and bringing it to the top, that was very popular. Dragging and dropping? Little bit less so. Dragging and

24:46

dropping from one search to a different search? So maybe I'll search for, you know, "Johannes," find your homepage, and then drag and drop it to, like, people's, you know, publications list to, like, keep an eye on or something. Like that, almost never. And people were very wary about editing the page. Like, what if I make a mistake? What if it's just, just me, like, who wants this, and I'm messing up search for the rest of the world? And it's like, no, no, it's just your

25:19

friends, like just you and your friends who are going to see this. And so we learned a lot about people's mental models and, like, what stood in the way of, you know, interactions on the web. There were lots of challenges to doing this at scale. I mean, we needed, for example, a way of tracking users. We needed a way of very quickly, within 100 milliseconds, getting information about a user's past edits to search pages into, you know, into memory

25:47

if we were going to do this for real on Windows Live. And we just didn't have the infrastructure.

GEHRKE

25:51

I see. And those problems were hard in those days.

KICIMAN

25:53

Yeah. A prototype is fine. People, you know, will handle a little bit of latency if it's a research prototype, but for everyday use, you need something more.

GEHRKE

26:05

And there was no push to try it, to land it somehow, or what ... ?

KICIMAN

26:10

There were big pushes, but the infrastructure, it was really ...

GEHRKE

26:13

I see. It was really an infrastructure problem, then?

KICIMAN

26:15

Yeah, yeah.

GEHRKE

26:16

OK. Interesting because it sounds to me like, wow, there's an exciting research problem there; now you need the infrastructure to try to make all of these things really, really fast. It's always fascinating to see, you know, where things get stuck and how they, how they proceed.

KICIMAN

26:27

Yeah, I think it'd be a lot easier to build that—from an infrastructure point of view—today. But, of course, then there's lots of other questions, like is this really what, you know, the best thing to do. Like I mentioned, Google had this fast follow feature. They also removed it afterwards, as well.

GEHRKE

26:42

OK. Yeah, hindsight is always, you know, twenty-twenty. So, OK, so you're now starting to move into social computing, right, and trying to understand more about social interactions between users. How did you end up in causality, and then how did you make the switch to LLMs? And maybe even more about this; I mean, I understand here this was, sort of, this personal story that you really saw that, you know, the audience was really asking you

27:07

about what's happening here and that, sort of, motivated you. Was it always this personal drive, or was it always others who pulled you? And how did you make these switches?

KICIMAN

27:16

I think the switch from systems into social, it was about trying to get closer to problems that really mattered to people. I really enjoy working on systems problems, but oftentimes, they feel like they're in the back end. And so I wanted something where, you know, even if I'm not the domain expert working on something, I can feel like I'm making a contribution to that problem. The transition with social data then into causality and, um, and LLMs, that was a bit

27:51

smoother. So working with social data, trying to understand what it meant and what it said about the world in aggregate, was super-fascinating problems. So much information is embedded in the digital traces that people leave behind. But it was really difficult for people to come to solid conclusions. So there was one conference I went to where almost every presentation that day gave

28:16

some fascinating insight. This is how people make friendships. This is how, you know, we're seeing, like, signs of disease spread in, you know, through real-world interactions as they're in social data. Here's how people spend their time. And then people would, and then people would close; their conclusion slide every time was, "And, of course, correlation is not causation, so anything could actually be happening." Like, that is such, that is such a bummer. Like,

28:46

beautiful theory, great understanding. You spent so much time. I feel like I got some insight. And then you pull the rug out and say, but maybe not. And I'd heard about this work on ... that there was work on causal analysis and that there were certain conditions and ways to get actual learned causal relationships from data. So that's the day I decided I'm going to go figure out what

29:09

that is and how to apply it to social data for these types of questions. And I went out, and the first work there was a collaboration with Munmun De Choudhury, faculty at Georgia Tech, looking at online traces related to mental health and suicidal ideation and trying to understand what some of the factors were in a more, in a more solid and causal fashion. And so this really became, like, this was ... this interest in computational social science really ended up

29:43

branching out into two areas. One, obviously, I'm caring about, what can we learn about the world? Part of this is, of course, thinking deeply about the implications of AI on society, like what is it going to mean that we have this data for all of these, you know, societal challenges? And then causality. So the AI and its implications on society is what led towards the work on the security of AI systems and now security of AI as it relates to large

30:10

language models. And then causality was the other branch that split off from there. Both of them really stemming from this desire to see that we have a positive impact with AI.

GEHRKE

30:21

So you mentioned that, you know, you were sitting in these talks and people are talking about the correlation, and now you finally have this new tool, which is causation. So what are some of the examples where, you know, with correlation you came out with answer A, but now causation gave you some better, some real deep insights?

KICIMAN

30:36

I haven't gone looking to refute studies, so ...

GEHRKE

30:41

I see. OK.

KICIMAN

30:42

... but there are many well-known studies in the past where people have made mistakes because they didn't account for the right confounding variables. Ronny Kohavi has a great list of these on one of his websites. But a fun one is a study that came out in the late '90s on the influence of night lights on myopia in children. So this was a big splash. I think it made it to like Newsweek or 60 Minutes and stuff, that if you have

31:11

night lights in the house, your kids are more likely to need glasses. And this was wrong.

GEHRKE

31:19

My parents told me all the time, don't read in bed, you know, with your flashlight because your eyes are going to get bad.

KICIMAN

31:24

Yes. GEHRKE: That's the story basically, right? This was, yeah, the night lights that plug in the wall.

GEHRKE

31:29

But that's the ... KICIMAN: That's the idea, the same thing. The same thing, right.

KICIMAN

31:31

And so these people analyzed a bunch of data, and they found that there was a correlation, and they said that, you know, it's a cause; you know, this is a cause. And the problem was that they didn't account for the parents' myopia. Apparently, parents who had myopia were more likely to install night lights. And then you have the genetic factor then actually causing the myopia. Very simple. But, you know, people have to replicate this study to, you know,

32:02

to realize it was a mistake. Others were things like correlations, I think, around vitamin C have been reported repeatedly and then refuted in randomized control trials. But there's many of these. Medicine, in particular, has a long history of false correlations leading people astray.

GEHRKE

32:20

Do you have a story where here at Microsoft your work in causation had a really big impact?

KICIMAN

32:24

You know, the one—it's still ongoing—but one of the ones that I'm really excited about now, and thinking also from the broader societal impact lens, is a collaboration with Ranveer Chandra and his group. So with a close collaborator at MSR India, Amit Sharma, we've developed a connection between representation learning and underlying causal

32:54

representation of the data-generating process that's driving something. So if you imagine, like, we want to learn a classifier on an object, on an image, and we want that classifier to generalize to other settings, there's lots of reasons why this can go wrong. You know, you have, you know, like a classic example is the question of, is this picture showing you a camel, or is it showing

33:17

you a cow? The classifier is much more likely to look at the background, and if it's green grass, it's probably a cow. If it's sandy desert, it's probably a camel. But then you fail if you look at a camel in the zoo or a cow on a beach, right. So how do you make sure that you're looking at the real features? People have developed algorithms for these. But no algorithm actually is robust across all the different kinds of distribution shifts that people see in the real world. Some

33:48

algorithms work on these kinds of distribution shifts. Some algorithms work on those kinds of distribution shifts. And it was a bit of an interesting, I think, puzzle as to why. And so we realized that these distribution shifts, if you look at them from a causal perspective, you can see that the algorithms are actually imposing different statistical independence constraints. And you can read those statistical independence constraints off of a causal graph.

34:13

And the reason that some algorithms worked well in some settings was that the underlying causal graph implied a different set of statistical independence constraints in that setting. And so that algorithm was the right one for that setting. If you have a different causal graph with different statistical independence constraints, the other algorithm was better. And so now you

34:34

can see that no one algorithm is going to work well across all of them. So we built an adaptive algorithm that looks at the causal graph, picks the right statistical independencies, and applies them, and now what we're doing with this algorithm is we're applying it to satellite imagery to help us build a more generalizable, more robust model of carbon in farm fields so we can remotely sense and predict what the carbon level is in a field. And so, the early results ...

GEHRKE

35:08

And that's important for what?

KICIMAN

35:09

And so this is important because soil is seen as a very promising method for sequestering carbon for a climate change perspective. And it's also the more carbon there is … the higher your soil carbon, usually the healthier the soil is, as well. It's able to absorb more water, so less flooding; your crops are more productive because of the microbial growth that's happening. And so people want to adopt policies and methods that increase the soil carbon in the fields for all of

35:44

these reasons. But measuring soil carbon is really intensive. You have to go sample it, take it off to a lab, and it's too expensive for people to do regularly. And so if we can develop remote-sensing methods that are able to take a satellite image and, you know, really robustly predict what the

36:03

real soil carbon measurement would be, that's really game changing. That's something that, you know, will help us evaluate policies and whether they're working; help us evaluate, you know, what the right practices should be for a particular field. So I'm really excited about that.

GEHRKE

36:19

That's really exciting. You'd mentioned when we talked before that you'd benefited in your career from several good mentors. How do you think about mentoring, and what are the ways that you benefited from it? And how do you, you know, live that now in your daily life as you're a mentor now to the next generation?

KICIMAN

36:40

Yeah, the way I look at all the people—and there's so many—who have, you know, given me a hand and advice and, you know, along the way, I often find I pick up on some attributes of my mentors, of a particular mentor, and find that it's

37:10

something that I want to emulate. So recognizing, you know, everyone is complicated and no one is perfect, but, you know, there's so many ways that, you know, individuals get things right and trying to understand what it is that they're doing right and how I can try and repeat that for,

37:26

like, you said, the next generation, I think, is really, really important. It's like one story, for example, around 2008, while I was still working on large-scale internet services, I was going around the company to, kind of, get a sense of, you know, what's the current state

37:45

of the reliability of our services and how we architect them and run them. And so I was talking to developers and architects and Ops folks around the company, and James Hamilton was a great mentor at that moment, helping me to connect, helping suggest questions that I might ask.

GEHRKE

38:03

So he was working on SQL Server reliability, right, at that point in time or on Windows reliability?

KICIMAN

38:08

He was already starting to move over into datacenter reliability. I think at the time, right before he moved over to the research side of things, I think he was one of the heads of the, of our enterprise email businesses, and then he came over to research to focus on, I think, datacenters in general. And, yeah, and he just donated so much of his time. He was so generous with, you know,

38:37

reviewing this large report that I was writing and just helping me out with insights. That struck me as, like ... he's a very busy person. He's doing all this stuff, and he's spending, you know, I sent him an email with, you know, 15 pages, and he responds with feedback within a couple

38:55

of hours every morning. That was astonishing to me, especially in hindsight, and so … but that kind of generosity of time and trying to help direct people's work in a way that's going to be most impactful for what they want to achieve, that's something I try and emulate today.

GEHRKE

39:14

So, so, you know, you've benefited from a lot of great mentors and you said you're now also a mentor to others. Do you have any last piece of advice for any of our listeners?

KICIMAN

39:23

I think it's really important for people to find passion and joy in the work that they do and, at some point, do the work for the work's sake. I think this will drive you through the challenges that you'll inevitably face with any sort of project and give you the persistence that you need to really have the impact that you want to have.

GEHRKE

39:47

Well, thanks for that advice. And thanks for being in What's Your Story, Emre.

KICIMAN

39:51

Thanks very much, Johannes. Great to be here.

39:53

[MUSIC]

KICIMAN

39:54

To learn more about Emre or to see photos of Emre as a child in California, visit aka.ms/ResearcherStories.

40:04

[MUSIC FADES]

Transcript source: Provided by creator in RSS feed: download file

Episode description

Transcript