EP 449: Can Claude’s AI Agent Simplify Your Work? A Live Test Drive - podcast episode cover

EP 449: Can Claude’s AI Agent Simplify Your Work? A Live Test Drive

Jan 29, 202541 minEp. 449
--:--
--:--
Listen in podcast apps:

Episode description

Wondering if Claude's latest agentic AI is worth it? Computer Use is an agentic AI system that allows you to operate a virtual computer simply by speaking with Claude. We dive in and explain how it works.

Newsletter: Sign up for our free daily newsletter
More on this Episode: Episode Page
Join the discussion: Ask Jordan questions on Claude AI

Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Website: YourEverydayAI.com
Email The Show: info@youreverydayai.com
Connect with Jordan on LinkedIn

Topics Covered in This Episode:
1. Overview of Anthropic Claude
2. How to Use Claude Computer Use
3. Critiques of Anthropic's Tools
4. Future of AI Agents

Timestamps:
00:00 AI agents essential in businesses by 2025.
04:48 Google developing AI agent 'Jarvis'; competition intensifies.
10:01 Using an API key; GitHub shares code.
11:22 Docker is a versatile containerization tool for developers.
15:36 Claude Sonnet 3.5 limits commands despite plans.
17:08 Replace placeholder with copied API key.
23:17 Demonstrating computer vision on a virtual desktop.
25:33 Claude retained information without website visit.
29:31 Experiencing repeated errors toggling between applications.
30:49 Visit everydayai.com, list latest 3 episodes.
35:10 Word document created with AI episode summaries.
37:12 Direct AI with simple code; needs improvement.

Keywords:
Jordan Wilson, Claude AI, language model, Everyday AI Podcast, podcast summaries, document formatting, model interaction, AI errors, AI execution challenges, API key, Docker usage, virtual desktop, Word document creation, live stream, Anthropic updates, Claude free plan, API key security, Docker installation, Service tier levels, GitHub repositories, AI in Business, Claude's updates, Google Project Jarvis, OpenAI, Microsoft, Salesforce Agent Force, Amazon Bedrock, Google Cloud's Vertex AI, AI agents, Application Programming Interfaces.

Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Ready for ROI on GenAI? Go to youreverydayai.com/partner 

Transcript

This is the Everyday AI Show, the everyday podcast where we simplify AI and bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life. Whether you know it or not, you're going to be using AI agents in your business in 2025. Let me repeat that. In 2025, you will be using AI agents at your company, especially if you live in the U.S., whether you know it or not.

All right, so you might as well get ahead, understand what's going on in the space, and stick around for the next 20-ish minutes, and you're going to see live. And hey, if you're following along on the live stream, you can even go ahead and... use an AI agent. So we're going to be talking today about how Claude's agent works. It's computer use. And like I said, give me about 20 minutes in.

we'll go ahead and do it together live all right i'm excited for this if you're new here thanks for tuning in my name is jordan wilson and welcome to everyday ai we're a daily live stream podcast and free daily newsletter helping everyday people leverage

generative AI and how you can actually use it to grow your company and your career. So maybe you're listening on the podcast. Thank you for tuning in. Please, if you haven't already, go to youreverydayai.com and sign up for the free daily newsletter. for our live stream audience. Thank you for tuning in as well. Technically debuting this live, but it is prerecorded. So if you want the daily AI news, don't worry.

it's going to be in the newsletter today also if you are listening on the podcast this is going to be a little bit of a more visual episode uh so you might want to listen to this if you normally listen in your car at the gym walking your dog uh whatever it is thank you number one but this might be one where, hey, go check out today's newsletter and go watch this if you do want to kind of learn along because I'm going to break down how to use this new computer use.

tool from Anthropic Claude. I know some people were asking for this when we did cover this news last week. So hey, if you ask for it, we're going to do it. All right. Without further ado, let's get straight into it. So last week, Anthropic Claude announced a couple of things. They announced a 3.5 sonnet update. So it's kind of weird. They already had a 3.5 sonnet. I kind of just call it 3.6 or, you know, maybe we refer to the old sonnet as 3.4. But anyways, they released an update.

to claude 3.5 sonnet which is out and then uh they announced 3.5 haiku uh which should be out any day uh no word yet on what their big model opus which is still on version three if that's going to get any love or any updates, we will see. But probably from Anthropics announcement, what got everyone talking, including us here on the Everyday AI Show, was its new computer use tool.

All right. And if you do want more on this, this is today's is going to be a little bit more of a demo. Right. So if you want more on what was announced and what our takes on it. our takes on it were, make sure to go check out episode 386. It'll be in your show notes as well. All right, so here's what this new computer use is. Well, according to Anthropic, they said it is a new capability in public beta. So it is...

available now. It's called computer use and you are going to use your API key. Don't worry. You don't need to be a developer, even though it says developers can direct Claude to use computers the way people do. Anyone can, right? Everyday AI is for everyday non-technical people. So I'm going to give you the walkthrough. Don't worry. Couldn't agree more. They're saying it is still experimental at times, cumbersome and error prone. The thing that is most cumbersome is you can barely.

Use it. Yes. If anyone at Anthropic is listening, and I always love it. I had some people from OpenAI reach out to me recently, said everyone here is listening to the show. Thanks. But hey, Anthropic. you should listen to because one thing I always do here is I literally have trained thousands of business leaders in the US on different AI tools. And Claude is very difficult. It's the limits.

The usage rates, even in the API, are so limited unless you are on a higher tier. So Anthropic really needs to get their program together, if I'm being honest. Otherwise, they are going to get smoked by everyone else because it is so hard to even use their tool and experiment. and to see if it's

going to work for your business. Anyways, let's just jump a little bit into the landscape. So not only do we have the computer use tool from Anthropic, but we just got news that Google is reportedly developing its own version. of this, of its computer using agent called Project Jarvis. We'll see if that name actually makes it to production, mainly because...

Some other company had tried to use Jarvis in an AI product before. We've actually been using it now for, I don't know, three and a half, four years since it came out. And they had to change their name to Jasper. So we'll see if Google can actually come out with this project Jarvis. They're not the only ones with a computer use tool. It's been rumored now for probably 10 months that OpenAI is working on agents. We saw Microsoft should be out.

literally any day with its new co-pilot studio, Agentic AI, as well as Salesforce with its agent force. So that's why I said at the beginning of the show, you're going to be using... AI agents, whether you, whether you want to or not, like I said, it's coming to Microsoft. Windows, all right? Inside Copilot Studio. It's coming. If you are a company that uses Salesforce, which is the most dominant CRM player around, you're probably going to be using AgentForce in 2025, all right?

So, currently, it's available. So, one thing about Anthropic, even though I don't think it's that good, I don't think computer use is that good. It is extremely limited. It is hard to try out. But, hey, they shipped it. Hats off to Anthropic. They shipped, right? They didn't launch a blog post and wait list, which sometimes Google and OpenAI kind of do. Microsoft a little bit as well. So they shipped it.

It may be too early, but I guess it's better than not shipping. But right now you can only use it via an API. So if you're using Anthropics API or you can access it inside of Amazon Bedrock or Google Cloud's Vertex AI. platform so yeah this isn't something where you log on to claude ai or download a desktop application from claude and it's not how it works so we're going to do it live and you have to actually download a separate program

You know, if you actually want to take the simplest route, and this is what Anthropic recommends. All right. A couple of things. Let's first go over some terminology because when I'm doing this live, I'm probably not going to be giving you definition. So I said, all right. Especially if you're listening on the podcast, let me break all of these definitions down that I'm going to be talking about. So first of all, Anthropics API. All right. So if you don't know in.

API, it's an application programming interface, right? But in all of the large language models, you know, you essentially have the option to either use it on the front end, right? So you can go to chatgbt.com or claw.ai and chat with the chatbot. right or you can use it on the back end and this is generally what developers will do all right um when

If you're fine-tuning a model, bringing your company's data in with RAG, if you're building third-party applications, for the most part, you're using either OpenAI's API, Claude's API, Google Gemini's API, et cetera. So all these... programs, you can go get an API key and build an application. So that's kind of what we're going to be doing here. So the only thing you need to know about this is you can still use Claude's free plan. So even if you're not on their paid

I forget if it's $20 or $30. I think it's $20 a month. So even if you're not on their paid plan, you can still use their API, but it is a pay as you go. So you will pay for... actual usage all right so uh the other thing when we're talking about api keys you know i'm going to show you mine

on the screen and then when i'm done i'm going to delete it right so you always want to keep your api key kind of secret because if anyone gets it they can essentially use it in their programs and run up uh you know run up a bill on you until you notice it all right uh so It's it's.

when you are doing this you do need to have a credit card inside of the clawed api and you need to preload some money in there if you want to do this and follow along don't worry i'm probably going to blab on for another three minutes before we do this live so if you want to go ahead do it now and you can log into the back end of the Claude system which I'm going to go ahead and announce that. It's council.anthropic.com. All right. And here's the bad part.

It is so limited. All right. So I would say most people, you're going to be on tier one. So essentially, Claude gives you a tier depending on how much you use their product. And I'd say even people that I know in the AI space. Even they're all on tier one. So unless your company has already been using Anthropix API for a very long time, or if you're an individual, you're probably going to have to start on tier one, which is extremely.

You can barely do anything in this computer use unless you're on a higher tier. All right. So that's terminology number one is using your Anthropic API key. Terminology number two, GitHub.

in a github repo all right so github is essentially a website where developers but even everyday people can go ahead and store and share the code that they write in projects that they work on then other people can go kind of download them and fork them edit and add to them so a github repository or a repo is kind of like a folder with a bunch of code in it and it holds all of the files and information related to a specific project and the reason why this is important and we're going to be doing

this all live as well. Anthropic released computer use in a GitHub repo. So everyone can go on there and they can kind of build off it. But that's how you use it. So like I said, you don't access this via a website.

right you don't go to claw.ai you have to actually grab the information from anthropics github repo all right so again think of that as a place where uh if you don't already know where everyone kind of stores their code and you can go download it people can modify it not there but they can create versions of it right so think of it like templates right

So people put their code up there. You can go look at the code, improve it. If you don't know GitHub, it's a great place. And then last but not least, Docker. So Docker is a program. It's one of the ways that Anthropic recommends that you use the computer use. So Docker is essentially a tool that helps developers package their applications and everything they need to run them into a small portable container that can work anywhere.

Kind of the way I like to describe Docker is it's a closed environment where you can essentially run. programs so it's kind of similar to terminal right if you use a terminal on mac it kind of has its own terminal it just helps you run everything in a contained way and it can work from anywhere that you download it All right, so...

Enough chitchat, y'all. Let's do this live. I said we would try to do this in 20-ish minutes. Let's see if that actually works. All right. So this is going to be a fun one here, y'all. So let's go ahead. I'm going to share my screen. Let's see if we can get this going. All right. Now, if you're listening on the podcast, I'm going to try to do my best to walk you through exactly what I just told you. So first, we are now on Anthropics GitHub repo, okay?

Essentially, like I said, there's a lot of different files in here and there's different ways. There's different ways as well that you can use this. The way that we're going to do it is if you scroll down here, it's going to give you some directions and it essentially gives you this little piece of code.

okay so i'm going to copy and paste this code i'm actually putting up in my browser so it's all going to be flat and then i'm going to uh oh let me share that tab there we go all right so now i'm i have it pasted in here okay i have a word doc so all i did is i went on uh anthropics

GitHub repo. I went down here. It gives you the code essentially that you need to run. Again, I'm simplifying this. And so what we're going to do next is there is a placeholder where you're going to need to enter your own API. key, okay? And then we're going to combine those two things, and then we're going to put them in Docker and run it, okay? So let's look at Docker here quick. You need to go to Docker.

and you need to download Docker for your desktop. So I am on, I've already downloaded this, but you're going to go ahead and download this, whether you're on Mac Intel or a Mac Apple. So I'm on a Mac. Apple chip. I think my computer has like a M1 or M2. So you essentially need to download it for your operating system as well as what chip architecture you run on, right? So whether you're on Intel or an M chip. uh for mac and then on windows whether you're on an amd or an arm chip and then linux

All right. So you're going to download Docker and then install it. All right. I already did that step. I think everyone out there, if you're listening to this podcast, you've gone to a website before you've downloaded a program and installed it. So very simple. Step one, we copied and pasted that code from Anthropic. Step two is we've downloaded the Docker desktop and installed it and opened it as well. Now, let's see.

i'm gonna have to share my whole screen here in a second so i'm also on my computer fyi you won't see this uh live stream audience but i've opened docker and then when we're ready uh when we're ready to get docker going i will share my whole screen. So we'll be jumping around a lot. All right. And then our last step is we need to create, well, not our last step, but our last ingredient, so to speak, is we need to get that API key.

Okay, so here's the thing when I was talking about limits. So you have different limits. So it says that you have a 50 request per minute limit if you are on the tier one plan. I'll let you guys be the judge. I would say that's not true. all right because it's it's very hard to run any commands even though i am on a tier one plan and it says uh for this new claude sonnet 3.5 the 1022 version yes i wish they just called it 3.6 So we didn't have to call it Claude 3.5.

but it does say you get a 50 request per minute. I don't think that's actually the case or who knows, maybe it just takes so many tokens because you're technically using computer vision every step of the way. So that is actually probably how many requests. But to do this simplest thing, you'll see we're going to time out a lot. All right. So you can check your limits, but you can go into API keys and we're going to create an API key. Okay. So like I said. After you use this API key.

Uh, or you're not going to use mine. All right. So I'm going to go in, uh, I'm going to copy and paste, uh, my API key and I'm going to be deleting it right after this. So, uh, no one can run up my bill. Right. So first you need to give it a name. Uh, so I'm going to call it, uh, I'm just going to call it test-computer-use. Okay. Then you need to select a workspace. I'm going to put it in my default workspace and I'm going to click add. All right. And then from there, I have a API key.

All right. So I'm going to go ahead and copy that API key. And then I am going back to this document. Okay. So now here's what I'm doing. I know there's probably on my screen, a lot of things going on. I always like to do this just to make it a little simpler. All right, so in my original API key, there is essentially a placeholder, okay, where it says API key equals, and then it says, you know, dollar sign, anthropic underscore API underscore key.

okay so i'm copying now my api key that i just used and i am going to place my cursor and i'm going to start with the dollar sign so it says api key equals dollar sign and i'm going to highlight through key All right. Just the Y and key, not an extra space or else you're going to run into some issues. And then all I'm going to do while that's highlighted is paste my key in.

And then from there, it should hopefully be pretty simple. I did one test on this before. Sometimes it's a little buggy. So now all I'm going to do is I'm going to copy this. All right, so essentially now I have this command that I'm going to put into Docker that I got from Anthropix quick start guide on their GitHub repo. I got my API key.

I copy and pasted it. And then I pasted my API key into this Docker command that we're going to then go ahead and put into the Docker program. All right. So I hope that makes sense. So now. Now we're going to get a little wild here because now I'm going to share my whole screen. And hopefully this won't be too wild. All right, let's go ahead and share my whole entire screen here.

All right, let me close some other programs so we're not too distracted here. All right, we should be good. I'm going to share my whole screen and let's get into it. All right. So this is the Docker desktop program. So like I said, you download this, you install it, you launch it.

all right now here's it's kind of hidden remember how i said this is kind of like the terminal program so you're going to want to click the terminal at the bottom and it says a terminal directly within docker desktop all right so

From there, you might, the first time you run it, when you click that little terminal, you will probably get a button the first time. I believe it says like enable terminal or something like that. So I can't re-replicate that. But there will be one little button there that essentially says.

you know, enabled terminal. All right. So now I have essentially what looks like a normal terminal. All right. And I'm going to zoom this up a little bit. So hopefully everyone can see it. All right. So I have a normal terminal here. Now, all I do, I don't have to do anything else. Remember, I copied that combination from the GitHub repo and the API key where I inserted mine. And I'm going to paste it. And I'm going to hit enter. And it should take just a minute, just a minute to run, right?

And so you'll see it says starting. So it's essentially at the bottom of my screen. I can see it's kind of running through. I might actually run into an error here. Again, it's very, very buggy.

uh so okay so it looks like it looks like it worked so yeah sometimes if it doesn't work just try it again but what you're looking for you might run this and be like okay well what happened okay well there's just a little link at the bottom it essentially says open uh and then it gives you a local host all right and all that is uh think of it is this way it is a essentially a local

version of a website that is technically running through something else all right so in this case we are technically running a local website through all this code that we just put into docker all right so now i'm going to click this It's probably going to open in a separate window and I'm going to have to drag it onto my screen. Don't worry. All right. So let me go ahead and click that. It did open into a different window. Now I'm dragging it over. All right. There we go.

So it is working. All right. Now let me explain what we actually have here. So we are on this local. host and it says claude use computer use demo it says security alert never provide access to sensitive accounts or data as malicious web content can hijack claude's behavior all right and then you can do chat or you can look at the exchange logs. All we're gonna do is we're just going to chat. All right, I'm gonna move this a little bit.

So we're not blocking the screen. And so hopefully we can see as much as possible. All right. So essentially on the left side, we are going to be talking to this Claude computer use. And then on the right side, there is a virtual desktop. All right. This looks straight out of 1990, maybe 1995 if we're being nice. So for our podcast audience, it's a split screen. I'm going to be able to talk to a version of Claude in this computer use demo.

side, if everything works, it is going to execute things in a virtual environment. All right. So here's what we're going to do. I'm going to go ahead and paste this in. So here's what I'm saying. I'm saying, please find the largest American companies by market capitalization. Save the top three in a spreadsheet. Include their rank name.

symbol. I should probably spell symbol correctly, even though Claude will probably understand it, right? Because again, I am still talking to Claude, the large language model that's very smart and can understand human language, right? And that's the key here, y'all. You are still getting the power of Claude, but you are just adding to it the ability to use a computer, right? So it's using a digital computer. All right. So I'm saying, please find the largest American companies by market.

capitalization, save the top three in a spreadsheet, include their rank name, symbol, market cap, and then their CEO. This part might be a little tricky, so we'll see how it handles it. And then I'm saying, and add that in there as well. I should probably say, add that in the spreadsheet as well. All right. So I'm going to say, add that in the spreadsheet.

as well all right so on the virtual desktop there is absolutely nothing all right so i'm gonna put my hands in the air once i do this because uh you know i'm sure our live stream audience or maybe if you're watching this later on youtube you might not believe it or understand it all right so i'm going to click

Go. And then on the left side, how it actually works is it takes a bunch of screenshots. It uses computer vision, and then it maps out what it wants to do on this virtual desktop. All right. So I'm going to go ahead and click enter. We're probably going to get a bunch of errors, but here we go. Are you still running in circles trying to figure out how to actually grow your business with AI?

Maybe your company has been tinkering with large language models for a year or more, but can't really get traction to find ROI on Gen AI. Hey, this is Jordan Wilson, host of this very podcast. Companies like Adobe, Microsoft, and Nvidia have partnered with us because they trust our expertise in educating the masses around generative AI to get ahead.

And some of the most innovative companies in the country hire us to help with their AI strategy and to train hundreds of their employees on how to use Gen AI. So whether you're looking for chat GPT training for thousands or just need help building your... front-end AI strategy, you can partner with us too.

just like some of the biggest companies in the world do. Go to youreverydayai.com slash partner to get in contact with our team, or you can just click on the partner section of our website. We'll help you stop running in those AI circles and help get your team ahead. and built a straight path to ROI on Gen AI. All right, so for our podcast audience.

it's saying okay i'm going to click on firefox and search for this information so it brought up firefox and now let's see so on the left side i'm seeing each and every time it's screenshotting something so now it's searching and it says largest companies by market cap 2024.

So it brought up a Google search result. So we're going to see if it's going to try to grab this information because it kind of brought in an AI overview. So I don't know if it's going to go in there or go to a website. So it looks like. It's not even going to go into a website. It doesn't look like it. All right. So let's see. It's still running.

All right. So interesting. So now I got a new error that I haven't seen before. It said warning. All right. So now I'm just scrolling up to see, but it's still doing everything by itself. It said warning failed to launch Java.

ldx java may not function correctly all right so but it looks like if you see at the top here it's still running uh and it's doing some things to counteract this all right so i can still move my mouse uh but i'm not taking over anything on the screen so it's already typing right so it says rank company name symbol market cap ceo okay so again

Podcast audience, my hands are folded on screen. And now Claude is doing all of these things in the computer use tool. All right. So I'm going to scroll down here because I'm guessing we're going to run into an error pretty quickly. All right. So. Again, when it Googled these things, it didn't even click.

into a website. It grabbed information from the AI overview, right? So, you know, sometimes you put in some information and you don't even have to go to a website. So Claude didn't even need to click on the website, visit it. this information i'm actually going to scroll up here i want to see how it recorded it okay so it looks like right here it did just scrape all of this information i'm not sure if that's where it was

Yeah. So it looks like it did not even go into the information and it grabbed the top three U.S. companies by market cap. All right.

And you'll see, I'm pretty sure, let's see. Okay, I didn't run into an error yet. It's still just a little slow. I would have assumed that I would have already hit a... uh a token limit here because let's see how many screenshots it did one two three four five six seven eight nine ten eleven twelve thirteen yeah so it already did way more screenshots than it did previously in my other testing uh and it

Looks like it just stopped for whatever reason. Again, Anthropic said this is very buggy. As you can see, it's buggy. All right. So now it's going again. So let's see. It said. Here's why. Because that original information did not have the CEO. All it had was the company name, the symbol, their rank, and their market cap.

This is funny. It kind of failed here. So it said, let me search for Apple's CEO in Firefox. All right. It didn't bring it up on my screen. All right. And then it said, let's see. It looks like it just put in the CEO column on the spreadsheet and it's using LibreOffice, which is essentially an open source free version of Microsoft Office. So it got some things wrong here. It didn't.

Didn't do it very well because now it's saying, again, let me switch to Firefox and search for the current CEOs of these companies. So it didn't do it very well here. I'm scrolling down here. It looks like there is an error. All right. So now I finally hit the rate limit. So let's look at that. Let's look at the... the spreadsheet here. So there is something when you run this, there's a toggle screen button in the upper right hand corner. So it says.

toggle screen control off so i can click it and now it is on so now i can actually go in to the actual spreadsheet right so i'm clicking on an empty cell and you'll see i can type in here if i want i just type the word type So it looks like this failed because instead of actually finding this information, it just typed in. Who is Apple CEO 2024? Instead of typing that in to.

firefox it actually just typed that into the spreadsheet so it ran into an error there and did not complete this all right and then you'll see it says essentially retry after a minute and 27 seconds all right so In theory, I'm talking right now because I'm biding my time. Sometimes you don't even have to wait that long. I'm going to try it again. And I'm just going to say, I'm going to toggle the screen control off. Actually, I need to toggle it back on and get off that.

get off that cell I was on. And I'm going to say, please, I'm just going to say, please continue. So I'm not going to guide it. Please continue. All right, we'll give it a second. A lot of times it takes a second for your message to show up. So now it's saying running agent. All right, so let's see if it can pick up and see where it went wrong. Those little sounds you may not hear, that happens essentially each message. I might just have to mute this so it doesn't bug me. There we go. All right.

Not doing the best here. So again, we're just running into some repeated errors. It looks like for whatever reason, it's actually struggling to go back to Firefox, right? So it looks like. it's struggling to toggle. Between the two, at least I'm not seeing anything on the left-hand side. It's showing me all of these screenshots that it's taking. And it's really just every time it updates something in the...

In the spreadsheet, it's just taking a photo, right? So didn't do a good job here. I'm going to go ahead and say it failed this task. All right. So I'm going to toggle screen control on. I'm going to go ahead and exit out of this document here. All right. I'm going to go ahead and kind of clear this. All right. We're going to give it a minute. We're going to give it one more kind of.

One more run here. I'm going to clear the cache. So I'm essentially clearing everything, right? I'm going to go ahead also and reload this host. There we go. All right. So I have a blank.

uh a blank uh blank screen here so now all i'm saying is i'm saying please go to the website your everydayai.com with the https you have to include that and i'm going to say and find the latest episodes instead of saying create a spreadsheet because i just did that and it was struggling so i'm going to say create a word doc And write basic info for the last five episodes in the doc. Include the episode number.

title and a description and i'm going to say i'm going to say to do this for three because five we're probably going to write into a bunch of errors and i'm going to give a command that you would maybe give a um a large language model and i'm going to say please Write a witty intro that will catch people's eyes.

All right. So this is the last demo that we're going to do. So now I'm clicking this again. We're starting from a blank doc here. If everything works right, computer use is going to click on. Firefox. There we go. Presumably now it's going to go to your everyday AI.com in each step. It's sending a screenshot.

Right. So it knows what to do and then it gives it the coordinates. All right. It looks like it's having a problem rendering something on our website. It made an emoji like size a trillion, but that shouldn't. that shouldn't keep it from using this so uh let's see so it ran into uh it ran into an issue it said it couldn't open uh a directory file finally it got it all right so it didn't also i didn't see it go to the episodes page uh let's see it looks like it only went to the home page

So let's see if it realizes that. So now it says, let me scroll through the episode page to gather the information about the latest episodes. All right. But. Again, for whatever reason, when I did a demo of this earlier, right? Live demos are the worst. It did fine kind of toggling between spreadsheets and Word documents and Firefox for whatever reason. This time around, it looks like it is struggling, right? So it is still, it's still running here at the top. So I just have to give it a second.

Hey, live stream audience, I know this one is going on a little bit, but let me know. Are you going to try this? Are you going to use it? Or have you seen enough, right? And you're like, ah, this doesn't really look that good. It looks buggy. But I will tell you this. I know that the Anthropic team can ship, right, really quickly. So, all right, now let's see what it's doing. So now it is in back into the document here.

uh it's it wrote some quick recaps i'm going to see it here in a second it looks like it went to the save dialogue so it's going to save this it saved it as everyday ai episodes uh technically it kept the untitled uh information in there let's see if this is uh finishing it let's see Okay, so it didn't tell me that it finished yet, but it did what I asked it to, right? It didn't format it great, but it says...

Let's see. It says your everyday AI serves up the freshest, most digestible AI insights that'll make you sound like a tech guru at your next coffee break, right? So this is all content that Claude wrote. This isn't from our website. So it went to our website, looked at the homepage, looked at the episode page. So then it wrote a quick recap for the three. So it says. uh, episode three, you know, and I did tell it, you know, I think I said, Hey, be witty. Let's see. What did I say? I said,

What did I say to the large language model? I said, please write a witty intro that will catch people's eyes. All right. So it said episode 388, the duality of AI productivity, the fascinating exploration of how AI can both. enhance and challenge our traditional notions of productivity. Discover the sweet spot between AI assistance and human creativity. All right. So it looks like it did.

Did the job there. Fine. Wrote a quick recap. And then it told me here, essentially, yo, I finished. It said, I have created a word document with a catchy intro and information about the latest three episodes from your everyday AI. The episode has been saved as.

And then it gave me the name, everydayaiepisodes.odt. In your home directory, the document includes, then it tells me a witty introduction, information about the three latest episodes, and brief descriptions. And then it says, would you like me to make changes to the document? Or would you like to see it in a different format? So I'm going to try one more thing. I'm going to say, please format the document and add paragraph breaks between the descriptions.

And I'm going to say add more engaging content. All right. I'm going to leave that one open-ended. I may or may not even let this one finish because I know this video is dragging on a little bit. I haven't even tried this yet to see how well it does. at modifying documents that you may have it make. So again, podcast audience, I am just typing with Claude in real time. Okay. It highlighted all of that content.

And it's essentially rewriting it. It looks like it's failing again. So instead of a paragraph break, it just added, it looks like a little symbol that would in theory denote where you would want to have a paragraph. break but it did write a lot more content looks like a little bit more engaging as well so uh nothing nothing here uh that is going to uh you know

Nothing's ready for production. Let me just say that. We're going to wrap this one up. I'm going to keep an eye on it on my screen here as we wrap. But this is nothing that right now that is going to change the way that we all work. But it's laying the groundwork, right? The fact that this technology is available right now, and I just walked you through it. Yeah, it took me longer than 20 minutes. You should know that now, right? So, but.

You can go through and follow this. It doesn't take long. Like I said, I'm going to leave that kind of little piece of code. both in the episode description, as well as if you are listening here on LinkedIn, that information should be there as well. But with very little developer savvy, right? Little bit of copy and pasting. You can literally direct an AI agent, right? It's not good right now. Don't get me wrong. I'll say it's downright mediocre, but it works, right? It's buggy.

But it works right now. Anyone out there with a computer and a credit card doesn't cost a lot, right? I'll have to go check my usage. Um, anyone can go do this and you can have. A language model. A very capable one in Claude 3.6. We'll just call it 3.6 Sonnet. Come on, Anthropic. All right. So with 3.5 Sonnet new, fine. You can go have it use a computer.

And this is not the endpoint, right? This demo, this virtual machine, this is just to showcase the capabilities. This is not the end goal, right? This is just wait because. Any day now, we are going to see developers create fully functioning, robust, polished tools. That's what this is all about. It's giving businesses, developers.

third-party software providers access to this technology, right? This clunky demo is just a clunky demo, right? This isn't the end use, but y'all, AI agents are coming. I can't even say they are coming. They are here. I literally just walked and talked you through it on a live stream. And you don't need to have a technical background. You don't need to be a coder. All you need to do is to be able to type. to a large language model. It's pretty exciting.

All right. I hope this was helpful. If so, please go to youreverydayai.com. Sign up for the free daily newsletter. Also, tell me, what else do you want to see? What else do you want to hear? I did this because a lot of y'all, after we covered this last week, they said, you said, hey, Jordan.

Jordan, I know it might be a little more technical, but do a demo, show us how to use computer use. So here you go. You want it, you get it. What do you want to hear next? Thank you for tuning in. We hope to see you back tomorrow and every day for more Everyday AI. Thanks, y'all. And that's a wrap for today's edition of Everyday AI. Thanks for joining us. If you enjoyed this episode, please subscribe and leave us a rating. It helps keep us going.

For a little more AI magic, visit youreverydayai.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.