168  |  Highlights from IEEE VIS'22 with Tamara Munzner - podcast episode cover

168  |  Highlights from IEEE VIS'22 with Tamara Munzner

Nov 21, 202257 minEp. 168
--:--
--:--
Listen in podcast apps:

Episode description

168  |  Highlights from IEEE VIS'22 with Tamara Munzner

Transcript

Oklahoma University has this astounding history of science program and these archives. There was this part where he went into the climate controlled archives and brought out the books and there was this room full of books where we had to like, of course not touch, but look at these amazing source materials. Hi, everyone. Music.

Welcome to a new episode of Data Stories. My name is Enrico Bertini and I am a professor at Northeastern University in Boston where I do research and teach data visualization. That's right. And I'm Mo Steffaner. I'm an independent designer of data visualization. In fact, I work as a self-employed truth and beauty operator out of my office here in the countryside in the north of Germany.

And on this podcast, we talk about data visualization, analysis, and generally the role data plays in our lives. And usually we do that together with a guest we invite on the show. And that's the same today. But before we start, just a quick note, our podcast is listener supported, so there are no ads. But that also means if you do enjoy the show, you might consider supporting us.

You can do that with the recurring payments on patreon.com slash data stories, or you can also send us one time donations on paypal.me slash data stories. Okay, that's right. So we can get started. The topic for today is a classic of data stories. I think we missed only once or twice. We're going to do highlights from the iTripoli Viz conference. And we have a friend of the show helping us to do that. We have Tamara Manziner with us. Itamara.

Hi, thanks so much for having me. Great to see you again. Tamara, can you briefly introduce yourself and then we can dive right into the highlights from Viz? Sure. I'm Tamara Munzner and I'm a professor at the University of British Columbia Computer Science Department and I do visualization of many flavors. I am a long-term fan of IEEE Viz, first started going in 1991, so it's been a few years. And I'm a fan of data stories as well.

Yes, we have a lot of things in common. Perfect match for the episode. So I thought maybe I should spend a couple of words explaining what iTripoli's. Is for any of our listeners who are not particularly familiar with it. So I try to believe this is a conference, international conference that happens every year. Since, as you said, Tamara, from the mid-90s or so. And no, early 90s, I guess. The first one is 1990. It depends how you kind. Yeah, yes.

And it tends to be pretty academic. It's the main academic conference. But I like to say that it's not only academic. There are a lot of events and things that happen in there, in there, including industry tracks, and there's an art section. So over the years, it's been expanding in a way that is definitely, it has an academic flavor but it's not only academic. We tend to talk a lot about the papers that get published and presented there, but there are many other events going on.

And I think today Tamara is going to talk about some of these events. There are panels, there are workshops, there are keynotes, there are awards, and many, many other things. As I said, there is an art track, an industry track. I also just want to say that summarizing what happened at Viz is basically impossible. We were going to talk about a tiny, tiny fraction of what happened there this year, but yeah, I think it's better than nothing and we will add links to videos and papers and documents.

So definitely go to our blog post and check it out for if you want to dive right into if you want to see the details of what we're going to talk about. So Tamara, thanks so much for helping us with this. And I think we want to go through a number of different sections. And the first one is about keynotes and capstones. Yeah. So, you know, there's the obvious ones, which were the main viz keynote and capstone.

And there were these sort of interesting parallels. So the keynote was Marty Hurst of Berkeley, and her talk was called Show It or Tell It. And it had this great framing context where, you know, using these large language models is now a thing. And so her framing context was that there were these dolly images as a bunch of the slide backgrounds, and she included the prompts that made them. And her whole point was that language should be considered co-equal with the visualization parts.

It was all about this sort of text and images working together towards a goal. And you know, I very much liked it. It's exactly my jam. It's, you know, starting out with references to that Scott McCloud understanding comics, which still holds up to the test of, time, you know, all the way up to some book that I clearly have to read because it's called, Proust and the Squid with the tagline of we were never born to read. So I'm like, oh my,

God, I have to read that. And I thought it was very nicely done. She really talked about language as a user interface for visualization and that there may be ways of using it that, instead of coding or a GUI, what if you can just speak. A theme that others also talked about is the speed of improvement of natural language methods with these huge models these days with GPT and Dolly.

She really wanted to draw the attention of the Viz community to the way in which text, and visualization can work together. I thought it was very well done and very engaging. Yeah, I think it's a huge, huge topic right now, both from, as you say, text understanding by machines, but also text generation. And then we have the whole topic of annotation and accessibility where text plays another huge role. So I think it's probably one of the big...

Big topics right now. And I guess she was a great match. She's a legend in information retrieval and text natural language processing, right? So yeah. And she's been involved in viz for eons and had one of the very early courses in InfoViz. Yeah, she's always been a little bit across. She also, she's also done some really fundamental work in viz. So it's really interesting. But yeah, I agree there's so much going on in NLP that is almost like having an effect everywhere,

including in visualization. So I'm curious to see what is going to happen next. She was citing a few scary statistics that like a fifth of people in some studies wanted to hide the graphics so they can read the text better. I think these are inconvenient truths for the database community, but we have to acknowledge that, I guess.

Yeah, there's all kinds of great stuff about what happens when people do see both, and how much text you want, and thinking about a dial of a little bit of text, a lot of text. Much to learn. So definitely a recommendation to watch this keynote. It's 45 minutes or whatever it is well spent. It's up on YouTube. What if we leap all the way from the keynote to the capstone? Those are the two bookends of the conference. Oh, it was so cool. So this guy, Cary McGruder.

Who is this historian of science. So it turns out that although I did not know it before I got there, that Oklahoma University has this astounding history of science program and these archives. And he dug into the archives to come up with this talk on Galileo's telescope discoveries, thinking visually in the history of science. The capstone didn't just start with the capstone. It started the night before where they had an open house at OU and I was like, okay, yeah, maybe I'll see some demos.

But there was this part where he went into the climate controlled archives and brought out the books and there was this room full of books where we had to like, of course, not touch, but look at these amazing source materials. And so seeing these physical books that you hear about them, but to actually see them with your own eyes and see the scale of some of them, some were huge in terms of fold-out things, that was pretty magical.

And the talk itself, I really did love. I thought it was one of the top three capstones I've seen at Biz in 30 years. I feel like sometimes people just sort of, you know, they give their same old same old talk. And sometimes people actually kind of rise to the occasion and really try to think about the intersection between what they're expert in and what the Vis community might care about.

And I feel like this was one of those where he had this sort of ternary triangular coordinate system where he thought about what it means to have naturalistic visual evidence, like. What Galileo actually drew as logs when he looked at the moons of Jupiter through the lens versus didactic purposes, which is what he put in the Starry Messenger book that's so famous, which is not a naturalistic picture at all.

It's actually trying to specifically show things, and he talked about shadings of craters and the line of the sun versus evidential. And then he talked about how 40 years later, Hevalius came up with this full Selena Graffia book really trying to chart exactly what all is on the moon in terms of its craters. And I really liked this versus just a plain visual aid, which is just trying to help people see and think.

And so I really appreciated the way he sort of unfolded these deeply historical things in a way that felt totally relevant for today. So I loved it. other people go to. Nice. And then what else? I think we have another keynote you wanted to talk about? Yeah, there was, I mean, a lot of the workshops have keynotes. And there were several that I liked quite a bit. You know, just in passing, Jean-Daniel Fiquette had one at the visual data analysis on scalability with progressive data science.

And Sheila Carpendale had one, and Viz Guys on best practices considered harmful. But the one I want to go deeper into is the one from Kaseley Fiesler, which is at Believe, which is one of my very favorite workshops. It's all about methods and I'm a method geek. And her talk was called Data is People, Research Ethics Beyond Human Subjects. And it was just kind of great. Best title so far. Yeah. And it had some great taglines, like Twitter as the fruit fly of this kind of research,

it's so easy for researchers to study. And it really went deep into, you know, there's all these sort of facile things you can say like, well, ethics is what the ethics boards makes you think about. No, wrong. That's not what ethics is. It goes deeper. And you know, if the data is public, everything's fine. No, wrong. Look at all these ways in which things can go to hell. So it was just a bunch of like, you might think it's so simple. No,

is why it's more complex. I think one of the ideas that I hadn't quite heard before that I really liked is the idea of ethical debt, which is this analogy to technical debt where, you sort of don't deal and suddenly later it comes and whaps you upside the head. And so I liked that concept as a way to think about some of the ethical things of if you don't do them early, they will come to bite you later. Yeah. And I like the way she closed She said, critique the things you love. I love that.

I love that. Yeah. Unfortunately, I missed it. But yeah, I want to see the recording. Yeah, definitely worth it. I think the idea that data is people is something that has been kind of like discussed some time before. I think it's something that database struggles with, right? the idea that you may have a visualization with lots of data points and it may be trivialized,

but then every single data point is a person. That's so common and, and it's an interesting ethical dilemma there, Quite common in visualization, I would say. Yeah, and for ethical depth, I never heard of it, but it sounds, yeah, I can certainly relate when I think about projects I've been working on in the past. If you don't think about these things early enough, then when you realize that there is a problem, it's typically too late.

Yeah, her exact definition was the implied cost of not considering social or ethical implications or harms now, assuming you can fix them after you find out what they are. Yeah. And of course, sometimes not too easy. Yeah, yeah, yeah, yeah. Totally. And did she give any specific guidelines? Or I don't know, does she have anything, that people can read to consider for their projects, something like that?

Yeah, I mean, the whole talk was kind of full of lots of, especially a lot of concrete examples of what can go wrong when you assume the easy, simple thing. And I mean, she wrapped up with saying, questions to keep at the top of your mind are, what are the potential harms or negative impacts of the research? What can I do to mitigate these potential harms? Is there a benefit to the research and who does it benefit? And I think these are all useful questions. But basically this is another,

take the 45 minutes of your life and just sit down and watch the video. It'll be well spent. I think a lot of what made the talk good was just the sheer number of concrete examples, that help people see how assuming the simple thing is probably not going to work out so well. Sounds great. And there were even more interesting keynotes as well that folks can check out for all the other workshops.

For people that don't know, Viz has the first two days, the Sunday and Monday, are like eight parallel tracks. There's workshops and tutorials and symposia. So there's actually a, there's not just the main keynote. there's like eight more keynotes. So many of them are great. Yeah, because hosted event can have their own keynotes. And so why don't we talk about awards?

There are also awards, there are many, and I think we want to focus on the 10 year, test of time award, which is an award that is given to papers that have been highly influential and highly cited in the last 10 years, correct? And there was one from VAST, and that's a paper that I've cited many times, which I like quite a bit, on enterprise data analysis and visualization and interview study. And it's Sean Condell and Andreas Papken, Joe Hellerstein and Jeff Hare.

And it's basically like, let's go talk to people that are actually doing data analysis and viz in sort of business-y settings and find out what there is to know. And this, from a methods point of view, these interview studies where you are doing semi-structured interviews and you do qualitative data analysis to try to figure out what are the themes and what could you learn, they often precede many building visualization projects.

And there was a time in Viz when it would have been hard to publish those when the only kind of work you could do is like, here's my new technique. And doing these kinds of studies to figure out, but who is doing what now? What are they doing? the pain points, what needs to be done, I think is super valuable. And so, you know, of course they went off and they ended up founding Trifacta, which

is a nice success story spin-off. But you know, based on the kinds of insights you get when you go talk to people about what they're doing. So great paper, very happy to see you get the test of time. I agree. I think this paper has been very influential. for myself, kind of like give me the courage to work on similar studies and say, oh, we can do interview studies in our community and they can also be very successful.

I agree with you. Interviews are so relevant if they're done well and if they're done with the right audience at the right time. I think now it's more common to see this type of studies. I agree because learning what kind of problems people have in practice or even the way people think about a problem in practice. I think it's really useful to inform. How we frame research, the practical research that we do on top of these problems or issues.

Or when we learn more about what kind of problems people have or how they think about it. So I agree, very influential and very useful. And then we have another one that is one of your papers, Tamara, right? Yeah, that was pretty fantastic. Michael Saddlemyer and Mariah Meyer and I all did a paper on design study methodology at InfoViz 10 years ago And that one got the test of time this year So, You know in addition to sort of being personally gratifying like yay somebody cared.

It's I feel like it's kind of a testament of like because this idea of design studies as a way to say, Let's do really really problem-driven research and try to you know valorize actually solving problems of people and and wait how do you do that like and you, know this paper came out of we did it wrong many times and I mean we also did it right many times but trying to figure out what you know if you're just sort of striding through the wilderness trying to figure out how do I even do this work

how do I do it successfully how do I write a paper about it we ended up, coming up with a process that at least we found we were able to consistently get good results with. And I think one of the most useful parts of the paper is the giant list of 30-plus pitfalls of, don't do this, here's how to avoid it. And sometimes we did it wrong, sometimes we reviewed papers who did it wrong, sometimes we read published papers who did it wrong.

It wrong. And so, you know, all the ways to get stuck, we wanted to help route people around those quagmires. And the nice thing is, there's just been this incredible explosion of work where there's a lot of really strong design studies these days, where, you know, and this is not the only methodology paper for it. There's a bunch of papers that, you know, continued on, some of from us, some from others, but the community has really

done a whole lot of super strong work in these so-called design studies. And that's the part that's made us the happiest. Maybe for people who are not familiar with what a design study is or like how it's different from maybe describing a technique or whatever. Can you quickly summarize what the gist of it is? Yeah, so the gist of it is that you are doing, you're working on a real world problem and you're trying to solve that problem of real people with real data sets.

So it's typically the standard approach is that it's very collaborative. Like you just can't do it on your own by theorizing. You have to actually talk to the people. Usually lots of trying to understand in the terms of another paper on the nested model, you're really trying to understand their abstractions. Take what they do and cast that into more abstract language so that then you can think about how you could build a visualization

system to solve it. So a lot of work on understanding their task and data at an abstract level. And then it's inevitable that to actually build something, it's got to be iterative. You've got to to have rapid prototyping, really try many versions, check back in with the people. So there's often a kind of iterative development and deployment stage. And then often to try to figure out, so, hey, wait, did this work? You often really try to get people to use it in their daily work practice.

It's a thing where you build it and then you watch what happens. Sometimes they use it the way you thought. Often they use it not the way you thought. Rather than trying to like run an empirical study that's a controlled experiment in a lab where you've got a little stopwatch telling people do this task, instead you're like, here, here is this tool, let me see what happens.

And they're often verified through these case studies of like, well, first they tried it on something they already understood. And look, it took them 15 minutes to figure out something that maybe would have taken them 10 hours without this tool. And then they used it to do different things that they had not already anticipated.

Often it's describing the kinds of things they could do with this visualization tool, that they would not have been able to do without it or that would have taken them much, much longer. So, you don't have to invent new techniques to do design studies. Yeah, that's really interesting because that's obviously close to the practitioners work, right? And like figuring out what people need and then building it and then seeing if it worked.

And yeah, I think traditionally people might have said, well, that's just anecdotal if just do it once for a specific client and tailor something to their needs, you know, where's the big principle here? But I think your paper describes how you can generalize from single case explorations like this, right? Yeah, a big part of the paper is to say... You know what makes it research as opposed to simply excellent practice is the part where you reflect and you,

You say did you confirm existing guidelines? Did you refute them? Do you need to propose refinements of them? Do you need to propose entirely new? Ideas on how to do this work. And so it's that what did you learn? That is different than what the research world already knew about in terms of guidelines is what I think is the part that makes it the research. But it's definitely true that a lot of practitioners very much follow these methods.

Yeah. Yeah, but it's nice. It has so many facets to it. Like to a practitioner, it could be sort of the paper is almost like a guideline of how you could document your projects like, the how-to or this is how the project was done. It provides a chapter structure basically almost to that. So it has a lot of nice facets. Well deserved. Thank you. That's the final word. Yeah, I think one thing that I like about that paper is that it seems like, so you

have a set of recommendations or pitfalls to avoid, right? And I like the way you describe it at the beginning saying, look, the three of us together, we have done like X numbers of these projects. So we have learned something about what all the mistakes that we have done. And I like this kind of premise. I think it's an interesting way to frame a paper.

I just want to quickly mention about the awards. I wanted to highlight the fact that a lifetime achievement award has been awarded to Colin Ware, and this makes me really happy, because we don't talk about the work of Colin much, but I think he had a huge impact on visualization. So I'm really happy that he got this award. I think he really deserves it, especially not only for the excellent research work that he has done over the years, but of course for his super influential book.

I think pretty much everyone here has gone through the book maybe multiple times. And yeah, I think it's really deserved. Yeah, and if you haven't, you should. Information visualization, perception for design, read it. It's still amazing. Yeah. I think it's what, the fourth or fifth edition now? Something. And I think it's the only book I can remember reading at the beginning of my PhD that now, is what, I don't know, many years ago. I don't want to know.

And I'm still sifting through the same paper, the new edition, same book. So it's pretty remarkable. I think there are many gems scattered throughout the book. Yes. So I think we can maybe start talking about a few papers. Of course, this is a tiny, tiny portion of them. So Tamara, you wanted to highlight a few sessions maybe and maybe, specific papers within the sessions? Yeah. I mean, what I really want to do is talk about all the papers, but I can't because I haven't even read them all.

And I will encourage for folks who did register online, there's this fabulous browse by serendipity where you can keep hitting the show me a new serendipitous set. And it's just a way to really get a sense of the breadth of the papers because there's hundreds and we are only covering a tiny few. Having said that, there was this session that made me super excited, which was on transforming tabular data and grammars.

And there were several papers that I really liked in there. There was Animated Vega Light. So that was from Arvind Satyaranian's group. So first author Jonathan Zong and Josh Pollock and Dylan Wooten. And they had this really nice kind of framing abstraction where you think about time both as a way to visually encode data as an encoding channel, but also as an event stream and really reflecting on that dual nature.

And so I found that to be a sort of interesting paper that, you know, for those who don't know VegaLite is sort of built on top of D3 and there have been a number of projects originally coming out of Jeff Harris Group and Arvind was a former PhD student with Jeff and is now an assistant professor at MIT. And so continuing this line of work of how it is that you can sort of think systematically about these things I think is really one of the strengths of visualization.

Another one that I liked in the very same session was from Andrew McNutt on No One Grammar to Rule the Mall, a survey of static visualization DSLs, where DSL is the acronym for domain-specific languages. And it included, I mean, there's the paper itself which sort of goes deep into a nice analysis of all these grammars, but he's got this great accompanying site, you know, visjsonDSLs.netify.app, where you can actually start playing around with these hundreds of systems yourself.

And so it's been more and more these days the thing in visualization that when there's a survey paper, people build these interactive faceted browsing sites underneath it that, lets you take all the data that they painstakingly gathered and browse it yourself for your purposes. And it's such a great thing that it's a thing in the field. This was not the only paper. There was someone like ML. There were several of these survey papers that let you browse underneath.

Also he had a zine. He made a zine for this. I think somewhere there's also a survey of survey. Yeah, and there is even a survey of surveys. And Andrew actually made physical paper zines, one of which I have, which I love. He had a paper last year on zines and he didn't get to hand them out because it was virtual. So he brought that paper zine and this one.

The other one that I really liked and that I noticed in my gigantic tweet stream has gotten just a huge amount of retweets and attention is this Hightailor system, interactive transformation and visualization for hierarchical tabular data. And so that's Goh Jung Lee, Runfen Li, Zhe Cheng Wang, Chi Harold Liu, Min Liu, and Guoran Wang. And what they did in this paper is they really, like, you know, everyone in their pet cat

things about tabular data, right? Like tables, tables, tables. But they specifically thought about the fact that there's this hierarchical structure in tabular data in ways that you might really want to make use of and ponder. And so I liked their sort of system for thinking systematically about these hierarchical tables. So another great one.

Nice. In general, I'm happy to see when there are new papers in this that talk about more about how to specify something rather than how to visualize something. I think understandably in visualization, we have a really strong focus on how do you visualize this, right? But we tend to forget that being able to specify what you want to visualize and how is at least equally important.

And I think in general, I would love to see more research in that direction, because effectively somebody has to specify what they want to see and maybe also how they want to see it. And I think this is a huge... This is an area where we can make a lot of improvements, and being it specific... And it's something that gets reinvented all the time. All the time, yeah, exactly. Hundreds of grammars. Yes. This will keep going. I think that's the crazy part.

But something like Vega-Lite really, or also ggplot really changed how everything is much more systematic. It seems to me like in describing visualizations than even five years ago. And I think the whole grammar trend has been super helpful in creating order. So, how we describe things. Yeah, totally. It's gonna be interesting how this develops. If there will be like one underlying, well, the paper says there is no one grammar.

No one grammar to rule them all. The paper says, yeah, no one to rule them all. Come on, haven't you read Lord of the Wings? Maybe a handful of them, maybe, maybe. Who knows? Three grammars to bind them. Yeah, the real challenge is that you need something that is simple enough for a human, to create meaningful and expressive specifications, but at the same time it's readable by the machine. So I think that's what is really challenging there.

Yeah. Who knows? Maybe it's going to be all natural language at some point. No, I'm not going to hold my breath. No. I don't think it will be. And if so, which language? Which language, right? Esperanto. There's more than English. Esperanto. There's your next year's paper, Esperanto for Whiz. There you go. Timing. Okay. So what's next? Well, there was another one.

Speaking of people that do these kind of great websites underlying papers, you know, there's been this trend to have, you know, more and more supplemental materials. It's not just this like 10 page PDF. Benjamin Bach is always doing these great papers like this. And so this year's favorite dashboard design patterns, it was Benjamin and Ewan Freeman and Alfie Abdul-Rahman, Chetai Cherkay, Seiful Khan, Yilei Fan, and Min Chen. DashboardDesignPatterns.github.io.

It's this thoughtful content incredibly beautifully and thoroughly presented. So they went through and they did all this analysis of dashboard designs and they have these 23 patterns that are hierarchically grouped into content and composition and these seven genres. And I think this is a paper that's gonna be super useful for a whole lot of practitioners and also advances the state of the art of treating dashboards as a research question.

I feel like five years ago, dashboards were like the poor cousin and it was like, ah, dashboards. Don't ever say the word dashboard, right? The D word. And you know, more and more they're just, they're getting studied as things in and of themselves. You know, there was the dashboard conspiracy paper a few years ago. Yeah. What do we talk about when we talk about dashboards? And so this continues in that vein, and I thought it was a great paper.

Yeah, and the site is, as you say, beautifully done and immediately useful. I can attest to that. Like, because it breaks down all the decisions you can make or all the variations you could do on a dashboard. And that's immediately helpful when you want to design one to think about. How many of these strategies to support or do I just take one of each in each category or are there different modes for it what what are the pros and cons of different approaches and as you.

Say I like how they structure the content also so they do something that good dashboards also do and which they describe is to provide multiple like zoom levels and there's the cheat sheet and there's the quick summary and then there's a page for each important topic and then there's the full paper, And then there's the full table with all the dashboards they have been looked at and every single thing they have considered. And I think that's exemplary, really, really nice.

Yeah, maybe old papers should be like that. It should be a new format. Who knows? I think it's a really good genre of paper to say, like, well, here's a domain of things that people do. Let's try and describe it and do some analysis on it. I think it's just awesome. I have to say that, yeah, I want to spend a few good words about the BIS community.

I think that's been a strong trend over the years, trying to make research not only meaningful also for, say, practitioners or meaningful in the real world, but also translating the research in a way that people who are not writing papers can actually consume it. And I think it's been getting better and better over the years. And I'm really happy to see this trend going. I think it's very, very relevant. And yeah, who knows? Maybe data stories played a little role over there.

I like to think that's the case. Or maybe also that medium publication that you have also really wanted. Exactly. I think it's a good trend. It's somewhat to blame for that trend.

Yeah. Yeah, you know, there's a lot of open science and open data and there's even making the PDFs available publicly and what's, Changed dramatically in recent years is it went from only a few PDFs on archive to like, you know, Three-quarters of them on our archive in part because it's allowed and encouraged and described in the submission materials of how to do that, You know, there's more and more open data where people say, you know, not just the study results,

But you know the the study stimuli let's document everything and code, analysis scripts. I think this trend is really super heartening. Yeah. Another thing that comes to mind, Tamara, I think that's true also for your book, that. It's clearly coming as an academic angle, but it has huge impact also for practitioners. I felt that maybe it's a good time to talk about your book series.

I think it's another, I don't know how to say, something that is happening that I think can be described as things going between academia and... In the real world, so to speak, or practitioners, right? Maybe you want to spend a couple of words about that. I think that would be useful. Yeah, I would love to. So there's the book series. Alberto, Cairo, and I are co-editors, of the AK Peters visualization series, which is like a CRC route ledge imprint.

And every year we have either a physical or a virtual table at Viz. So this year it was virtual. But hot off the presses are a few new ones coming out. So Neil Richards has a book, Questions in DataViz.

He's somebody who comes out of this Heblot community, very much a practitioner, a lovely book of questions with some tentative answers and then even more questions, unpacking, all kinds of stuff about when to break the rules, when to go beyond standard chart types, what there is in life beyond a bar chart. And then there's this other one that's gonna be coming out within a month, I believe. Making with data. It's all about data physicalization. So physical designing

craft in a data-driven world. It's this kind of all-star cast between the editors and the authors. So Samuel Huron and Till Nagel and Nora Ulberg and Wes Willett, and then a bazillion people that contributed. And even the book cover itself is a data physicalization of the structure of the book. So very meta because we all love meta. So super excited about that. I can't wait to get.

My physical copy. And Nigel Holmes, we have a book from on joyful infographics. And he's someone who definitely comes out of the practitioner community, did a lot of extremely high impact work. And it's all about humor and joy. And Jen Christiansen is having a book coming out soon, Building Science Graphics. She's been very active in things like Scientific American

and many, many other places. And Ali Torban actually did the covers to both Jen's book and Neil's book in a lovely circular way that the community all has ties to each other. We've got even more that are becoming out, but those are the ones that are coming out just in the next two to three months. So we are very, I mean, we want books from everyone.

We want academics, we want practitioners, we want textbooks, we want beautiful books, we want everything in between, we want edited collections, we want single author manifestos. So if you want to write a book, we want to talk to you. Data sketches was in the series. That was a Shirley Wu and Nadi Bremer utterly fantastic one. Mobile data visualization came out recently, another all-star cast that grew out of a dog to a workshop. So I think books are great and we need more. I totally agree.

Especially from such great people, you know. Richard Brath, Visualizing with Text, that's another favorite. Yeah, yeah, yeah. There was a visual analytics textbook from Christian Taminsky and Heidi Schuman, Interactive Visual Data Analysis. So... Maybe we need a separate episode just to talk about all these books. Okay, so what is next, Tamara? I got lost. Yeah.

I mean, there's so many papers. We don't have time to really get into them, but I will just point out that probably particularly interested practitioners are making stuff go fast. So there was this Vega fusion paper from Nicholas Tristan, John Meason, Dom Moritz, and also Plotly resampler making large time series go fast and Plotly from Jonas and Jerwin van der Donk and Ernie L. de Prost and Sophie van Hook. So, like, just, you know, my jaw kind of dropped at the demos.

I'm like, wait, that was really fast. That was a lot of data. How did you get some second response for that data set? Holy moly. So that was cool. And, well, I should, I mean, we also had some papers. I, of course, think they're fabulous and lovely. We worked on them. And actually both of them were originally published through through TVCG and then presented at Viz because that's a channel that a lot of work gets more, and more coming in.

There's a, the Viz proceedings are a special issue of TVCG, but then also any TVCG papers on Viz that want to be can be presented at IEEE Viz. So there's a lovely relationship between journal and conference. So TVCG is the journal, right? Transactions on Visualization and Computer Graphics. Yeah, it's the journal. So we had one which was sort of fun with deep graph models or rather graph neural networks, GNNs. So our paper is called CORGi and it has various dog-related menu items.

And corresponding a graph to its embedding is where we came up with that acronym. And so it's all about trying to see the... Basically, it's... You know how when you go on a car trip with the kids in the back of the seat and they keep saying, Are we there yet? Are we there yet? It's answering the are we there yet question for people trying to train these ML models, where you're like, has the model actually learned everything it's supposed to?

Let's check, visually. So understanding the sort of topological structure of the original graph that supposedly is getting reflected in this, you know, high dimensional embedding in a latent space that you could then use to understand, some kind of topology space and this latent feature space beats on the graph itself. How do you actually see what the heck is going on?

Because of course, as you might guess, it's complicated and you know, any of these super high dimensional spaces that you're doing ML on, it really helps to try to see what's going on. So that was a fun project. That was first author is Zipeng Liu. He's now assistant professor at Beihang in Beijing, China. Yang Wang, who started out at Uber and is now at Facebook. Bernard Yergin, who's now an assistant professor at Zurich and me. Yeah. So that was a fun to present.

And we had another one which is called Gevit Rec. So data reconnaissance through recommendation using a domain-specific visualization prevalence design space. First on that was Anakrishan, who's now at Tableau Research, and Shana Fisher, who was an undergrad at UBC at the time, Jen Gardy, who at the time was at UBC School of Population and Public Health and BC Center Center for Disease Control, but is now working for the Gates Foundation on malaria stuff, and me.

Ana's got a great blog post on the Tableau Engineering blog that's the super easy to access and think about version, and then of course there's the paper itself. We were happy to have those both presented to the world. Gevitrec is all about how is it that you could recommend automatically? Let's say someone flings a pile of datasets in your lap, what is going on? You don't want to spend hours, weeks, days, months kind of carefully designing a visualization.

You just want to get a quick sense of what's there. So this world of automatic recommendation where given datasets, you try to figure out, well, how are they connected up? How could we help you see where they intersect and overlap? And then after you get that quick look, you could decide where you want to go deeper.

But having the system that's built in R that just lets you type, show me what is there, and have that be something that you get in a second or two is super useful for this thing we call data reconnaissance of what is going on in these datasets. I like the term data reconnaissance. It really communicates what it is about. And yeah, there's been quite a few. Quite a few papers lately about this idea of recommending plots.

And I agree that it's especially, it seems to be especially useful at the beginning. If you want to make sense of, you just want to know what's in here exactly. Right. And yeah, I like that. I think it's really useful. What is next? So there were 10 zillion other papers, but we should probably talk about some of the the other stuff as well, like maybe a few panels. Yeah. So one of the ones I liked a lot, so Vis in Practice is one of the workshops.

That really tries to connect practitioners and researchers and often bring in a bunch of the folks who kind of straddle that line between. And so they had a whole panel on integrating research and products. And I actually, unfortunately didn't make it to the whole panel, but I came in like right as Richard Brath was giving this really fun talk about all the places where what you think you know what you're doing, if you just read the research, it just falls short.

You know, things about 3D versus 2D, how many annotations could you cram into a single display. Just a lot of really nice concrete examples of where he found he had to sort of adapt and refine and not just take for granted the advice you get from the research community. So I liked that talk. Yeah, I think Richard has been in industry for many years, I guess, a lot in finance, or something like that. So yes, a lot of really... So at what used to be called Oculus, but it's now Uncharted. Yeah.

Is the consultancy. And then as a researcher, I cared a lot about this panel on how is it that we review papers at IEEEV is and realizing that that might be a bit of insider pool for the non-researchers. But for those who are interested in the review process, which I do care about, because I'll be one of the overall papers chairs next year. So I suddenly care quite a bit about what's going on on the purview and thinking a lot about what various folks think.

Um, lots of people, Cody Dunn, Alex Lex, Torsten Miller, Abita Otley, Melanie Tori were the speakers. It was organized by Bob Laramie and Petra Eisenberg and Tobias Eisenberg. Um, I'll be only, so I think if you are interested in those things, it's definitely worth, um, seeing some of that.

The part I'm going to tell you is that Melanie Tori had the most fantastic extended metaphor about what kind of a dog you should adopt as a pet compared to what kind of paper types you are reviewing as a viz reviewer. So cute pictures of dogs, pointing out that you want the best of both worlds, that a lab doodle is both friendly and doesn't make you sneeze, and how does that connect up to the different kinds of hybrid paper types.

It shouldn't just be a monolithic do this and nothing else. don't have to be like pure red dogs. They can be hybrids, they can be mutts. So I like that one. I like that. I sympathize because sometimes my students come to me and say, is this a technique paper or is a system paper? Is this? And I'm like, I don't know. Let's just do the work. But they also like the fact that there's some guidance. I think it helps

helps kind of like reduce their anxiety. Fitting in a specific box, especially at the beginning, is useful. Yeah. I mean, guidance is always a trade-off. There's always this question of if you have no idea what to do, a path through the wilderness is great. Once you get more deep into things, you might find the guidance to be a bit more of a straight jacket. So I think guidance depends a lot on where the field is at.

Types were, you know, in 2003, I'm like, we need paper types, because it shouldn't all just be technique papers. But it's 2022. You know, we already have these very fine-grained idea of contributions in the call for papers now. It's not just monolithic types. But it's still, it's good to know what they are. So you can break the rules deliberately knowing

what the rules are. That was the thing Sheila talked about in her keynote at at Vizcom, which is you should know what the rules are so you know how and when to break them. How does that work practically? Like how do... Is there like any debate before the Vist Conference, like among the organizers about if there will be like... A change in what formats will be or should be part of the conference? Is there any curation

on that sense going on? Or do you more react to, oh, we seem to be getting more papers that do, something we haven't seen before. How do we deal with that? Or is that not even an issue? AMT – Well, there was this giant spasm of introspection because the revised committee, which I was on for a while and then actually chaired for a while, was all about, should we integrate what used to be InfoViz, Visual Analytics, VAST, and SciViz together.

So, as part of that whole revised work and of deciding exactly how to integrate them, we actually went super deep into this question of, well, what kinds of papers should there be and how could you define the contributions? So, we did actually think quite about that. We had a whole consultative process and we, you know, got the community together.

So, as a community, we already had a whole lot of discussion about that kind of a few years ago, pre-pandemic, and then there is a group whose job it is to sort of analyze that data and look at it and see whether there are new ideas we should keep track of.

We're probably not going to change instantly because we want to have a bit of stability, but we sort of tried to build into the new system this idea that you do go off and sync and analyze and try to understand where the field's going and be out ahead of it and not lag behind like a dinosaur. So the short answer is, yup, lots of thinking. As you say, if certain papers just don't fit certain categories, they might just not land as well.

Well, this is why there's now this really fine-grained idea of contribution. It's not just like, here's six paper types. It's what kinds of things do you contribute? Did you propose a new technique? Did you propose a new abstraction? Are you providing a data set? So a lot of things that were sort of difficult to get academic credit for, you know, like providing open benchmark data sets or something you can really explicitly claim these days. So there is a lot more nuance there now.

Mm-hmm. Yeah, that sounds like a good approach to think about the outcomes or the contribution, as you say, and not the packaging, basically. Yeah. I think you reminded me that that's the first episode we record about this, that they don't have to explain what is is size is vast and info-based and it feels really good. Yep. It's unified now, all kinds of work. There's now these multiple different areas. Most are not exactly any of the old categories, but mashups of many.

Okay. I think you wanted to briefly talk about AltViz as well, Tamara? Yeah. AltViz is this new workshop. For those who know CHI, there's AltCHI, which is sort, of a bunch of unusual papers that don't fit the standard academic mold, speaking of paper, types. And so a lot of what they're looking for is what's provocative, what's interesting. You know, last year's ELF is had this great version of like heavy metal logos, which was,

remarkably scholarly despite being hilarious. And so this year, my favorite ELF is paper is from Joe Wood and it's called Beyond the Walled Garden, a visual essay in five chapters. And it sort of fits into this theme that actually Marty Hurst had in the keynote, which is about, What do these new models allow you to do? I do love this sort of extended metaphor of the walled garden, where if you work in the walled garden, you feel protected and you know your place and you belong.

But what could be beyond that? The early work in visualization with McKinley and Bertin was all about the atomization of design, reductionism, and really thinking about this question of visual variables. It, but what about expressivity and how this new technology could allow us to do things and how to even, you know, many people don't think of that as being visualization. They

think about it as being graphics, but maybe not viz. Joe has actually made me rethink my hatred for Chernoff faces because of a point in that essay, which is interesting. I'm still not in love with, but at least I'm pondering. But his sort of central idea of Our walled garden remains home, but those distant lands beckon. It was a sort of poetic and evocative visual essay, which is very different from a standard research paper.

This reminds me more of Moritz's Eurovis keynote, where he was also talking about. What we call this vis or not. I think that was inspiring. What is actually vis? Yeah, I think that's... Yeah, and also this anti-reductionism, minimalism, and it goes more for. Sensual and metaphoric and semantic maximalism that is often a bit, Undefined, but really interesting, right? And exactly, as you say, maybe it's outside of what we should even be looking at, or it's right at the core, you know, who knows?

Who knows, yeah. So that makes it exciting, I guess. Yeah. That reminds me also of this effective, this taxonomy of effective data visualization. Oh, affective, yeah, it was one of the best papers. Yeah, yeah. Not effective, but affective. There was another cross connection to even look at, what are non-analytic goals of data visualizations that are more on the effective or emotional level, and how could we characterize those, what's a good language to talk about those.

I thought that was also very timely on that whole arc. And that was a paper from several folks, including Lais Padilla. Who's that you seem to have said? Okay, so I couldn't identify any specific trend this year.

Any other closing thoughts or maybe? Yeah, I think the goofiest two things I saw in Oklahoma City were that there's this human-made canal and if you keep walking and you go way out and get a little bit lost, there's this part where there's giant horse sculptures that go diving into the canal and then coming back out. And they're like three times life size And it's quite surreal. What is going on with the giant cavalcade of horses?

And then they have a skeleton museum with hundreds and hundreds of animal skeletons, which I had fun visiting on Friday, skeletonmuseum.com. It's well worth your time if you're in Oklahoma City. Yes. Okay, well, I think that's it for today. Thanks so much, Tamara. Thanks for helping us go through lots of exciting new research. Thanks for having me. It's always fun to have you on. So yeah, we'll try and collect all the PDFs and all the talks. Some of the talks should be up on YouTube.

Yeah. Hopefully you'll find a way to follow up on all that. Make sure to go to the show notes. Show notes that would be plenty of links over there. Okay, thank you. All right, and see everyone in Melbourne. Viz 2023, Melbourne, Australia. Yes, we forgot to mention that. Yep, see you in Melbourne next year. Bye-bye. All right, bye-bye. Hey folks, thanks for listening to Data Stories again. Before you leave, a few last notes.

This show is crowdfunded and you can support us on Patreon at patreon.com slash data stories, where we publish monthly previews of upcoming episodes for our supporters. Or you can also send us a one-time donation via PayPal at paypal.me.data.stories. Or as a free way to support the show, if you can spend a couple of minutes rating us on iTunes, that would be very helpful as well. And here's some information on the many ways you can get news directly from us.

We are on Twitter, Facebook and Instagram, so follow us there for the latest updates. We have also a Slack channel where you can chat with us directly. And to sign up, go to our own page at datastory.es and there you'll find a button at the bottom of the page. And there you can also subscribe to our email newsletter if you want to get news directly into your inbox and be notified whenever we publish a new episode. That's right, and we love to get in touch with our listeners.

So let us know if you want to suggest a way to improve the show or know any amazing people you want us to invite, or even have any project you want us to talk about. Yeah, absolutely. Don't hesitate to get in touch. Just send us an email at mail at datastory.es. That's all for now. I'll see you next time and thanks for listening to Data Stories. Music.

Transcript source: Provided by creator in RSS feed: download file