#151 Certified! It works on my machine - podcast episode cover

#151 Certified! It works on my machine

Oct 10, 201926 minEp. 151
--:--
--:--
Listen in podcast apps:

Episode description

Topics covered in this episode:
See the full show notes for this episode on the website at pythonbytes.fm/151

Transcript

Hello and welcome to Python Bytes, where we deliver Python news and headlines directly to your earbuds. This is episode 151, recorded October 2nd, 2019. I'm Michael Kennedy. And I'm Brian Okken. And this episode is brought to you by DigitalOcean. Check them out at pythonbytes.fm/digitalocean. Get $50 credit for new users. More on that later. Brian, how are you doing? I'm really good. Super. We are on PDX Python Meetup West, PDX West Python Meetup, whatever you're calling it. It's Eve, aren't we?

Yeah, it's like just right around the corner. Yeah. Although it'll probably already have happened by the time this comes out. Yeah, it may. It's going to be tight. So, well, I'm sure it went wonderfully. So, got all the speakers lined up and everything. I'm really excited to have that as part of our community around here. Who knows? Maybe we can put it online somehow, but no plans for that,

it sounds like. You know, I hear a lot about containers. And when I first heard of Docker Docker and things like that, I thought Docker was, I thought they had created containers, right? Like they had come up with this idea of this other kind of isolation relative to, say, a virtual machine. And looking into it, it turns out that's not the only option. And they actually didn't even come up with it, right? They just bundled it up and packaged it nicely.

I think what Docker has done is very cool, but I have yet to find a use case where I needed it so far. And there are other options. And one of our friends of the show, Matt Lehman, has written an article called Python Alternative to Docker. The article uses a tool called Shiv from LinkedIn, which is a hilarious name. We actually did mention it briefly on episode 114, but I didn't know much about it. And so this is kind of a neat, neat article. It talks about

that Shiv uses a zip app. And I think we've talked about that before that you can, and I think there's a few tools that utilize that. Right. Things like pecs and stuff. Yeah. Yeah. You can zip up a Python application with all its dependencies. And I don't think that at that point it includes Python, does it? I don't think so. And that was actually super disappointing to me that it didn't just deliver

a thing that ran. You still have to have Python, but it's like, you don't have to install the app. As long as you have Python, you can run it, was my understanding. This article is talking about, and this totally makes sense. You've got a continuous integration build pipeline where you're making changes and maybe you're updating dependencies, but on the deployment machine, you're not deploying, you don't have to update Python all the time.

You're just updating the thing that you package. And that's, it's kind of a neat thing about Shiv is you're, all your dependencies, maybe you're updating dependencies, or maybe you're updating your own application or fixing bugs or whatever that can all be put together and tested as one deliverable unit. And then that just that can be updated to a server machine, but then all the steps of like, okay, that sounds in concept. That sounds great. But what are all the steps I have

to go through to actually get that done? And Matt has done that. He just basically, walked through all the steps to try to get that done. And I thought it was really great. He does talk about packaging a little bit because you kind of have to have to talk about packaging when you're talking about deployment. So this is kind of fun. He does do a shout out to,

if you didn't use Shiv or Docker, what would you do? What were, what are all the little steps you'd have to do, which just makes it sound like something that you'd either hire somebody else to do? Like that's not, I'm not doing that. Yeah. But he also does shout out to platform as a service, things like Heroku and Python anywhere. And I think that's cool that there's other options. Like if you

want to just throw money at it, you just use one of these. Yeah. And these actually aren't that expensive. If you're trying to just save time, that's a fine trade-off. But if you want to do this, this is a neat idea. Also Docker is a big hammer. I mean, a lot of people love it and that's great, but bring up with a lot of people that you want to try to put Docker, deploy with Docker. And if it's a thing like this, like you're just deploying an application, that might be overkill.

It seems like it's a middle ground, right? Like I'm packaging up not just my Python code, but my Python dependencies and a lot of the setup around that. You still have to have Python there, but beyond that, right? It's like not really much to it. Yeah. If you need to depend on other things, like I need Redis running on this port and I need Postgres like that. And then I also want this other thing in Linux doing something like all of a sudden, then you're like, you've outgrown,

you know, bundling Python more cleverly. But if you don't need that, it sounds like a pretty cool way to do it. Yeah. Like in this example, the example that Matt gives is a Django application that uses a Gunicorn or G Unicorn so that all those dependencies are bundled together. It's not like a special kind of Django app with a huge backend that's necessary. But although, I mean, Django uses a database, so I'm not sure. Yeah. Yeah. Interesting. While you're talking about Docker,

I'm a fan of Docker in general, but kind of like you, I don't do a ton with it day to day. I just, the stuff I do is just not that complicated. But I recently read an article in ZDNet, that site actually still exists apparently. And it actually says, this is surprising to me, sort of kind of like I was surprised when GitHub got acquired. I realized afterwards, like, oh, they're actually financially in deep trouble. And maybe this is good rather than just going away.

The article says Docker is in deep trouble. A leaked internal memo, the Docker CEO reveals that the troubled company is looking for more money. And they talk about like pushing through with perseverance despite lack of clarity on the future and stuff like that. So anyway, just something to think about. Definitely. And I read that and I forgot that Docker was a company. Yeah, exactly. And I don't even know if it matters if Docker as a company goes away. I mean,

there are parts of Docker that are important, like the registry and stuff like that. But you could totally use Docker if you just had the tools. But still, it's, I don't know, it's just always interesting to see these, how this is playing out and something you want to consider if you're going to bet your project on it, right? Yeah, definitely. So I have a bit of a theme for this week as well. And it has to do with helping scientists do Python stuff a little bit better. So this first thing I want

to talk about is an article that's kind of a how-to from this woman named Anna Noagrodsky. I'm going to go with that. Sorry if I messed up your name, Anna. And it comes to us by a recommendation from a listener named Jason Thomas. Thanks, Jason. And the title is How to Support Open Source Software and Stay Sane. Nice. And this is referring to scientists who develop libraries like NumPy, or some form of scikit-learn or

something like that. And then they release it to the world. And then all of a sudden, they've got all this software development work and, you know, 100 GitHub issues. They got a review and they're like, I was just trying to give back a little bit. What just happened, right? So the article has some interesting quotes. So I'm going to do a little bit of quoting from it because it's, I think it's, it's pretty

revealing. Anyway, it says, the premise of the idol is that releasing lab-built open source software involves mountains of unforeseen work for the developers, right? Like you think you're done when you release it and it kind of starts. But it opens up saying, on the 10th of April, astrophysicists announced that they had captured the first ever image of a black hole. This was exhilarating news, but none of the giddy headlines other than us mentioned that the image would not have been

possible without open source software. And the image was created by Matplotlib, a Python library for graphing data, as well as other components of the open source Python ecosystem. Here's where it gets interesting. Just five days later, the US National Science Foundation, the NSF, rejected a grant proposal to support that ecosystem, saying that the software lacked sufficient impact. What even? What even, right? So like I said, it's kind of a how-to. It talks about where you're, as a scientist,

if you're creating software like Matplotlib or something, where do you go to get funding? Where have people been turned down? What has worked? And so on. It also talks a lot about using some software software development, software engineering techniques for having successful open source projects that grow without killing you. It's pretty good. It talks about the challenges of not knowing, as a scientist, how you might not have fluency with Git. You might not have a continuous integration.

Unit testing might not be a thing that you are super skilled in. And it mentions places like the eScience Institute, the University of Washington, the software carpentry workshops, places like that, where folks can go get that training. And then also talks a little bit about when do you keep your software going? And when do you decide like, it's obsolete, we're not going to work on it anymore?

What else are we going to go build? So if you're a scientist out there, or you work with scientists, you want to help them get rolling with Python and releasing their projects in a way that they'll stay sane, you know, check that out. I think it's pretty good. It's a continuous problem. Yeah, it is. And some of the recommendations are things like, make sure you have continuous

integration, make sure you have version control, make sure you have testing. And one that I really liked was that you should focus on code readability over peak performance. No, definitely. Always. Right. Yeah. Because you're gonna have to fix bugs. And you had a friend who had this saying that was really interesting that debugging software is harder than writing software, generally speaking. So if you

write software at the edge of the complexity that you can handle, you probably can debug it. So you need to step it back a notch, take it a little bit simpler. And then when it goes wrong, you could still deal with it. Yeah, I like that saying. So before we get to the next one, though, let me tell you about DigitalOcean. So DigitalOcean, all of our infrastructure runs there. When you get your MP3s that you hear our voices on, those came through a whole bunch of stuff that's happening

over at DigitalOcean. And they have two new services that I've been telling you all about for a little while. But when I keep telling you about it, because they're good. If you want to use MySQL or Redis, they now have those as a service, right? There's go flip a switch and turn them on. That's in addition to Postgres as a service as well. So definitely coming along. Speaking of the software, as a service over at the DigitalOcean side, I'm still a fan of fire me up a VM and install some

stuff and let it roar. But you know, that's all good. Check them out at pythonbytes.fm slash DigitalOcean, get $50 credit for a new user. And they're doing good work. So we can recommend them. Nice. So you found another license in addition to MIT or GNU? Yes. I don't know what to make of this. So I wanted to bring it up just to hear your thoughts as well. So there's a license. I can't remember where I ran across it. But some I think probably Twitter or

something. I spend way too much time on Twitter. It's a license called the Hippocratic license, like the doctors have the Hippocratic. Right, right. First promise to do no harm, things like that. Yeah. I mean, I know it's way more than that, but that's all average people remember, like me. So this is interesting. It derives from the MIT license and it adds a restriction. And the restriction reads, Interesting. Yeah.

That's an interesting thing. And I like, I kind of like it. My first reaction is, wow, that's great. You can say, I'm going to let everybody use my, use this software, but I don't want it to be used to harm people. Completely fair. But I don't know if anybody, the people that are like intentionally harming people, they don't care.

Yeah. Do you really think people that are in this are really going to go, oh, yeah, we really can't use this software to guide our, like, you know, does this cover military? I don't know. Also, a lot of this has to do with the human wellbeing. That's up for debate and like a perspective, right? Like, well, we had to start the war to help people or, you know, but obviously, if you're on the other side of that, like that, you're not helping us. This is not helping. Thank

you for not, for blowing us up. It's really hard to interpret. And I think you're right that it is very challenging because the folks who are most likely to abuse it, it's like the hackers are like, yeah, well, I was going to steal that, steal that $10 million from the bank, but it's got the wrong license. Yeah. It does still send a message that the person who works in the project cares and it's not necessarily a negative, even if it's not super enforceable. It's enforceable, right? I mean,

I was sure that things about like the MIT license people break. I don't even know what you've got to do to break it, but you probably could. I actually think it's an interesting thing to have around. I could definitely see people coming up with like different sorts of alternatives to the MIT license also, like, like not necessarily

maybe this plus other stuff. Like I know people that intentionally would never work for a military, work for the military or work for a government or a military contractor because of they don't want to. And I think it'd be completely legitimate for somebody to say, to have a license that said, my open source software doesn't want to, I don't want that to work for these people either. So, but I haven't seen that.

Yeah. Not putting any judgment on those things like working for Boeing or whatever. If you wanted a license like that, I think you could, you'd have to be more explicit. It's a slippery slope, but the things that I kind of agree with, I'm like, oh yeah, well, that's completely reasonable. Yes, I know. What if people like release software that said, this is okay as long as you don't live in Oregon because I don't like Oregonians. That would be lame, but people can do that.

It would definitely be lame. Yeah. That's why you set up a server in California and you just remote in. I'm vacationing permanently in Oregon. Exactly. Exactly. It's an interesting find. Yeah. So the next one, like I said, I'm on a bit of a theme here. And this next one is also about helping scientists do more, but more computational people. So this one is a real Python how to on why and how to make the switch from MATLAB to Python. Oh, this is great.

Yeah. It's really, really detailed, like standard real Python stuff. I'm not entirely sure how many pages it is, but it's, it's quite a few. So it's a good thorough, it's almost like a mini book on this as Dan and his crew are used to doing. So it basically compares the philosophy of MATLAB to Python, closed source to open source, things like that. Then it walks people through how to set up Python. And then just, you use these computational libraries in MATLAB. Here's your Python ones.

You did these operations like graphing. Here's how you do it in Python. And so if you're out there working on a team that uses MATLAB, and I know there are a ton of them out there. I've worked with some even around here, a great team who was moving from MATLAB to Python, who I worked with for a few days, like helping, helping to speed that transition and things like that. There's a ton of these science like groups, data science, not quite maybe data science, but on their way to that world,

who I think would really benefit from that. Like I long ago even use MATLAB for a little bit of work at this research lab that spun off into a company and did like wavelet decomposition of real time eye measurements. 250 hertz per second, you know, 250 samples a second. Like wavelet decomposition is like a way harder Fourier transform, but it's funky stuff. Anyway, to do that, it was fine. But the little toolbox, I think it's called, the wavelet toolbox that you got was $2,000 per user.

Yeah. That's a lot of money when you can pip install something that also does that. So there's MATLAB used a lot in our company as well, or there used to be. I don't know what it is now, but similar sorts of things. There's engineering. I'm more familiar with communication algorithms using FFTs and other communication transforms being mocked up in MATLAB before they go to like

ASICs or something. Yeah, this is very common. And I'm really glad to see more and more people moving to Python from MATLAB. Yeah, absolutely. I think it's just another one of those slices of the technology ecosystem where tons of people are leaving that space and coming over to Python because the great computational support that's developing things like JupyterLab and all the NumPy, SciPy type things.

Also in this article, I learned that there apparently is this thing called GNU Octave, which is a free and open source clone of MATLAB that looks pretty good, actually. Oh, interesting. Yeah. Not that we want to encourage people to just go do that. They should come do Python, but this is still pretty interesting. I didn't know that exists. Yeah, definitely. Have we spoken about GUIs? Yeah, quite a bit. I feel like we should actually cover that sometime.

Yeah. So I almost felt guilty about covering this, but it came up in episode 143 already. But Nicholas Tollerby has got a project called PiperCard. I mean, I remember HyperCard. I never really used it back in the day, the dark ages. But PiperCard is kind of this neat little thing. And did you use HyperCard ever? No. I have some friends who did, but I've never used it.

It was a Mac only thing. And it was this thing where you kind of have a whole, it's designed as if there's a whole bunch of different cards and a GUI window was just a card idea. And then you could print whatever you wanted on the card. And then there would possibly be buttons or something on the card. And if you push one of the buttons, you go to a different card. And so you can think of an application as a deck of cards or a choose your own adventure

type game. And that's where a lot of a lot of really complicated choose your own adventure games were made on HyperCard. And so PiperCard is a similar thing just in Python. And it started out with Adafruit, but it's now gone. Nicholas has made it. You can just run it on your anywhere and you can just pip install it. And I was playing with it the other day. And at first I dismissed it. But when I had him on testing code on episode 89, and we talked about a little bit and I didn't realize

this. So one of the ideas is you have basically a simple list of what's in the card and then what card to go to next with different buttons and inputs and stuff like that. But if you, instead of saying which card to go to, you give it a function to go to, you can run any Python code as long as that function returns a string to tell you which card to go to next. So that makes it actually pretty

powerful to build up some fairly complex logic very simply. Anyway, it's fun to work with. I can even think, I was even thinking of some work business uses that I could use this for and get it mocked up really quick. That's cool. It looks interesting. It feels a little bit like a visual workflow executor, right? Like I go down this step, then I do this step, then there's a decision, and then I do this step or that step, and it's all represented by these cards. It's pretty cool.

He has an example of a Celsius to Fahrenheit converter, but also like, I mean, there's a lot of little simple conversion math stuff applications in businesses that would be a good use for it. And it's also uses that user interface thing that I always forget, but it was like, nevermind, I can't remember what it is. It's one that you don't see very many examples of. Cool. Yeah. And it looks like a nice project. Cool. You know, something we don't talk a whole lot about on this show is Node.js.

I don't know if we ever have. Have we? You know, I think the extent to which we talked about Node.js was maybe some security issue in like NPM and how it is an analogous thing over at PyPI. Maybe. Like that's the extent of it. Okay. So Node.js obviously is a pretty big server side framework for running JavaScript code on the server. Right. And you know, the story is like, Hey, if I got to write JavaScript on the front end,

I might as well write it in the back end. That's not my philosophy. Mine is more like, how do I minimize the amount of JavaScript I write? Let's not also do that on the back end. I mean, not that I hate JavaScript, but I don't want to overdo it. Anyway, there's a cool project that brings some integration between Python and Node. So imagine you're working on some project,

or you're on some team, and they're like, we've chosen Node. You're like, sigh, sad face. But you still want to use some Python, or maybe you want to use some kind of Python library, like, I don't know, NumPy, TensorFlow, whatever, something that doesn't have a JavaScript equivalent, right? Yeah. So you can use this thing called PyNode. And what it does is it's a bridge between Node.js and Python that allows you to call Python code directly from JavaScript. Oh, wow. So this is on

the server, right? Like if it was on the client side, then who knows, we'd have to use something like Sculpt or Brython, and then it would convert the Python to JavaScript. Like that's a different thing. This is the CPython interpreter with everything that Python can do on the server. But you can interact with it, call it and get return values from those functions in Node.js. You call into Python

and you get your answer back. Nice. So that's pretty cool. And the API for working with it's not too bad. You have to basically, you know, in Node.js, you require stuff to load it up. So you require PyNode. And then you go to get your PyNode, you can say start the interpreter, which will, I'm guessing,

embed CPython into the node process. And then you just call functions. The way you directly call them is kind of crummy, you say like PyNode.call, and you would say quote add, comma one, comma two to call the function add by passing one and two to it. And then you get a callback from that. I think you could do some interesting stuff by wrapping this up into a JavaScript class. So you just say, you know,

create it. And then you just start working. You just it has a dot add that takes a one and a two. And you could put like a facade that makes this much simpler. Do a little async and await to get rid of the callback potentially things like that. But yeah, it's, it's definitely got some potential there. And if you're working in Node.js space, you're like, I really wish I had Python. Well,

PyNode. It's great. I mean, there's more and more projects where they really aren't one language, you're trying to do lots of different things, or you're trying to migrate from one thing to another, use the language that works the best for wherever you're at. And similar to the last one that I covered, not trying to send people away to that thing, not to Node or to Octave or whatever. But, you know, just acknowledging, right? Like just what

you said that there are these environments where you don't always get to pick Python. Here's another environment where you can now blend these together. Yeah, there's lots of languages that are good. None of them are as good as Python, but just saying. That's right. All right. Well, those are the major topics that we're covering for the week. Do you have anything extra to cover? No. Although like putting together a meetup is more work than I expected.

I appreciate you doing that. I'm sure that it is. I did submit. There's like call for proposals out and I did submit to a couple of conferences. So we'll see. Awesome. Good luck on that. The call for proposals for PyCascades, right? And also I think main PyCon, right? Yeah. PyCon's open. Tennessee's open. A couple others are open. Super. I actually have nothing extra to share except for I have a joke. Oh, good. I didn't bring a joke.

It's not exactly a joke, but it's definitely something that's very funny. So have you ever heard the excuse that, man, this works on my machine? Yeah. It's closing a defect with cannot reproduce. Exactly. Like, dude, this is working. Like, why does this not work for you? I just, who knows what it is? It could be the version of some library that's installed. It could be like permissions. Who knows? But there is a, it works on my machine certification program that I want to tell you

about. Okay. Okay. This is a real program and it has like an honors badge. It says works on my machine. It's kind of golden. You can put that on like your wall when you get certified and there are like real steps. This is not new. This front, this is a pretty old certification, but I thought it'd be fun. We haven't talked about it before. So there's some specific steps that you got to follow. And the steps here are talking about like a compiled language. So maybe like it

slightly varies with Python, but it's still, I think it's funny. So step one is you have to compile your application, getting the latest version from any recent or any recent changes from other developers is purely optional and not required for certification. Okay. Step two is you launch the application or website that's just been compiled. Step three is you cause, you're going to like this part as a person who's into pie tests and stuff. You cause one

code path in the code that you're checking to be executed. The preferred way to do this is with ad hoc manual testing of the simplest possible case for the feature in question. You may omit this stuff if the code change was less than five lines, or if in the developer's professional opinion, the code change could not possibly result in an error. Yeah. And finally, you now check your code into version control. Congratulations. You're certified. That's nice, right? Yeah.

There's probably a lot of certified people. They just need to like claim credit. They've done this. I'm sure. Yeah. They just don't know. Well, they can now get the star and put it on their wall. Yeah. All right. Well, thanks for being here as always, Brian, and sharing all this news with everyone. Well, thank you. Bye. You bet. Bye. Thank you for listening to Python Bytes. Follow the show on Twitter via at Python Bytes. That's

Python Bytes as in B-Y-T-E-S. And get the full show notes at pythonbytes.fm. If you have a news item you want featured, just visit pythonbytes.fm and send it our way. We're always on the lookout for sharing something cool. On behalf of myself and Brian Okken, this is Michael Kennedy. Thank you for listening and sharing this podcast with your friends and colleagues.

Transcript source: Provided by creator in RSS feed: download file