AI Agents Full Course 59 Minutes (for beginners) - podcast episode cover

AI Agents Full Course 59 Minutes (for beginners)

Mar 17, 202659 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Summary

Greg Isenberg and Remy Gaskell demystify AI agents, explaining how they differ from chat models by focusing on goal-to-result execution through an "observe, think, act" loop. The discussion dives into practical applications, showing how to create self-improving agents using markdown-based context files (agents.md, memory.md), integrate everyday tools via MCP, and develop reusable "skills" (SOPs for AI) to automate complex tasks and entire departmental workflows. By the end, listeners learn to drive any agent platform and build an AI operating system.

Episode description

I sit down with Remy Gaskell to break down how anyone can build AI agents to run entire departments of their business. Remy walks through the core concepts: agent loops, context files, memory, MCP tool connections, and skills. We put everything together by building a fully functional executive assistant live on screen. This is a beginner-friendly crash course that covers Claude Code, Codex, Cowork, Antigravity, Manus, and OpenClaw, showing that once you understand how to "drive," you can jump into any agent platform. By the end, listeners know exactly how to set up markdown-based context files, connect their everyday tools, and create reusable skills that compound over weeks and months.

Timestamps

00:00 – Intro

01:35 – Agents vs Chat

03:22 – The Agent Loop

05:46 – How Agents work

06:39 – Demoing Agents (Claude Code, Codex, Antigravity)

08:52 – Security and Agent Permissions

10:43 – Comparing Results Across Three Platforms

13:57 – Startup Idea: Cold Email Website Offer

14:50 – Folder Structure and Department-Based Agents

15:52 – Onboarding an Agent Like a Real Employee

17:05 – Voice-to-Text With Monologue and WhisperFlow

18:04 – Chat Memory vs. Agent Memory

19:34 – Building the agents md

22:20 – Context Engineering Over Prompt Engineering

24:29 – How Memory Compounds and Reduces Errors

30:27 – How Big Can memory md Get?

31:43 – Connecting Tools via MCP (Model Context Protocol)

34:49 – Working in Claude Code for High-Value Tasks

37:09 – Why the Real Value Is in Stacking, Not Summarizing

40:04 – What Are Skills? (SOPs for AI)

43:08 – Creating Skills

48:36 – Real-World Example: Ads Analyst Skill: 4-Hour Process in Minutes

50:37 – Chaining Skills together

52:01 – Real-World Example: Automated Car Search

53:34 – OpenClaw and Migrating Agents to More Autonomous Platforms

55:19 – Which Platform Should Beginners Start With?

56:28 – Global vs. Project-Level Skills, Context, and MCPs

Key Points

  • Agent platforms (Claude Code, Codex, Cowork, Antigravity, Manus, OpenClaw) are all running the same observe-think-act loop under the hood — learning one means you can use any of them.

  • The shift from chat to agents requires moving from prompt engineering to context engineering: load the agent with rich context so simple prompts produce excellent results.

  • A memory md file creates a self-improving loop where the agent learns preferences across sessions and makes fewer errors over time.

  • MCP (Model Context Protocol), built by Anthropic, acts as a universal translator between your agent and every tool it needs — Gmail, Calendar, Stripe, Notion, and more.

  • Skills are reusable SOPs packaged as markdown files; once you explain a process once, you can invoke it repeatedly, and they compound as you add three to five per week.

  • Scheduled tasks turn skills into automated workflows — morning briefs, car searches, ad library analyses — that run on a cron without any manual trigger.

The #1 tool to find startup ideas/trends - https://www.ideabrowser.com

LCA helps Fortune 500s and fast-growing startups build their future - from Warner Music to Fortnite to Dropbox. We turn 'what if' into reality with AI, apps, and next-gen products https://latecheckout.agency/

The Vibe Marketer - Resources for people into vibe marketing/marketing with AI: https://www.thevibemarketer.com/

FIND ME ON SOCIAL

X/Twitter: https://twitter.com/gregisenberg

Instagram: https://instagram.com/gregisenberg/

LinkedIn: https://www.linkedin.com/in/gisenberg/

FIND REMY ON SOCIAL

X:https://x.com/remy_gaskell

Youtube: https://www.youtube.com/@aiwithremy

Instagram: https://www.instagram.com/aiwithremy/

Transcript

Intro

I think AI is confusing. There, I said it. I think there's a lot of terms, skills, MCPs, agent harnesses that are Difficult concept to understand. So I had my friend Remy come on the podcast. and explain it in the most simple terms possible. In this free course on how to master AI agents, he breaks down exactly what each piece is. connect together and the simplest ways beginners could start using them today. Enjoy the episode.

I beg them to come on. Remy Gaskills on the pod. You've structured your company where you basically have these folders and dot MD files that Run your company. And what I want to do today is I want you to teach people in a beginner-friendly fashion. This is only for beginners. how they could do the same thing, how they can set up their own executive assistant, head of marketing, chief financial officer. Basically, I want you to basic to tell us

the concepts behind all this. By the end of this episode, Rami, do you think you can do that? Hundred percent, Greg. We're gonna go through all the concepts that make up an AI agent. And by the end of this video, you will know exactly how you can build up um agents to run complete departments of your life and your company within any agent platform you choose, whether it's Claude Code, Codex, OpenCore, MANIS. All of'em. All right, let's do it.

Agents vs Chat

Sweet. So one of the reasons why I really wanted to make this episode is because I feel like the AI landscape is moving into like stage two. From chat to agents. And most people are getting left behind right now, just using the chat models. And uh the founders and employees that are utilizing agents. Are like no word of a lie, 10 to 20 times more productive in their day. And when you stack that up over days, weeks, years,

You're going to just be miles ahead of the competition. So I really want to make this episode today to help bring everyone up to where the AI landscape is at the moment and to start using agents to manage every department of your business. So the key thing to understand here is chat models versus agents because the word agent is thrown around lots online. I'm sure you've seen it, Greg, like Ay, agents this, agents this, use this agent for this and it's kind of lost a lot of meaning.

So I wanted to give start by giving a really clear definition of what an agent actually is. So the way I think of it is a chat model is question to answer. But then an agent is goal to result. So moving from just like uh you asking AI replies, then you do the work, to you giving the agent a task. It planning out the task and then executing and then delivering you a result. Does that make sense?

Crystal clear. I mean the way I think about it is chat is kinda like ping pong back and forth, back and forth. Yep. And agent is Uh, you know, you're giving it it's a goal. I mean the best way yeah, you're giving it a goal and you're hoping that over time it gets better and closer to that goal. Exactly. Yeah, that's exactly it. And I just think that's a nice way to lay it out in your head is chat is question to answer, agent is goal to result. So when you chat to an agent,

The Agent Loop

You might give it a task like, build me a website for XYZ, and then it goes away, it does its work, and outputs this wonderful website to you. But it's really important to understand what's actually happening in this step here. So inside this agent step, we have what's called the agent loop. So you give it your prompt or task, and it goes through these three steps here, which is observe, think, and act.

So let's just say for example, um we're actually going to do this demo after this, but if we gave the agent a simple task like Build me a minimalist portfolio site for Greg Eisenberg. It's gonna start by like you've loaded in that prompt. It's gonna check if there's any files in the workspace that it can work with. Like maybe you've got some information on Greg Eisenberg. Um and then it's going to think about what to do next.

It's gonna act and then it just keeps going through this loop. So for that actual example of building the portfolio site for Greg Eisenberg, let's just say it was a blank agent, we hadn't given it any context. The first thing Is it's received this prompt to build the website. And the first thing it's going to be thinking about is okay, well, I need to build this website about Greg.

Who the hell is Greg Eisenberg? So it's gonna then decide to do some research into Greg Eisenberg. It's gonna research everything about Greg and then feed it back into this observe step. So then it's gonna think to itself, Okay, so I've got this prompt to build a website. I've now got my research here, so I know exactly who Greg Eisenberg is, and then it's gonna start thinking, what is the next step?

And the next step is probably to write up a plan to build the website. So it might write up that plan, feed that back in. Now it's got the research, the prompt, the plan, and it will think, all right, what next? I should probably write the code. Did I write the code, feed it back in, and it just keeps going through this loop as many times as it needs until it can conclude that the task is complete.

And how it concludes that the task is complete is based on the parameters that you set in your prompt. So if you're giving it a research task, you might say compile 10 sources and then create a report as a PowerPoint. And then once it's compiled 10 sources and built the report as a PowerPoint, it can conclude that the task is complete and then give you the output as the user. The agent itself is made up of these four components.

How Agents work

So it's the LLM, which is the brain behind it. So think like, you know, Claude Opus four point six or GPT five point four or Gemini three. It's the model. Uh it's got the loop, which means it just keeps going until the task is done and doesn't stop after one response. So you're going from ping-pong to like it continuing to go rather than you having to sit there babysitting it.

It connects in all your tools and then it connects in all the context. And a platform that facilitates this process and basically facilitates this loop to happen is known as an agent harness. And all of the popular AI agent platforms on the market that you'd be familiar with are just agent harnesses. They're just applications where this loop is facilitated.

Demoing Agents (Claude Code, Codex, Antigravity)

And I wanna actually run this little prompt I prepared earlier. I wanna open up Codex, Claude Code, and Anti-Gravity, and I'm gonna show you this loop uh actually happening in action. So I've uh nicely prepared before the episode these three demo folders which we're gonna run in. So I'm gonna open up demo one to work in enclawed code.

And the way these folders work is if you've used if you're familiar with like any of the chat models like Claude and ChatGPT, there's a projects feature, which is where um if I open it up actually. Try not to get dizzy with me switching tabs so much. But you know, if we create a a project here, it contains all your chats in one place. It allows you to upload all your sources here, which is your context. And then you can even add custom instructions.

which tells it how to behave within this project. And that's also known as a system prompt. Which we're gonna dive into how to do this with agents as well later. But it's a similar concept that you'd be familiar with if you've used projects before. But instead of the project being here on the cloud, we're actually working within projects that are local and our computer. So I've just selected this demo one for now.

Then we're gonna run build a minimalist portfolio site for Greg Eisenberg. And then this little bit here just tells it to actually spin it up, like to publish it on the web uh in a preview mode so we can see what it's done. So I'm gonna run that. So so this is this is clo um Claude Code. Yes. Yeah, right now we're in Claude Code. And this is just accessing it through the desktop app for Claude. So I'm just gonna run that.

And then I'm also gonna give the same prompt to Codex here. So this is the Codex app. And you can see same concept. It says let's build we can choose a folder on our computer to work in, like demo two. And then we're gonna give that a prompt as well and we're gonna tell it to host it on a different one. And then also in anti-gravity. So you can see same concept. We're going in, selecting a folder, and then we will

Give it the prompt as well. How should people think about security and these different products?

Security and Agent Permissions

I like to think of security as in just like Scoping what they have access to. So by default, anti-gravity, cloud code, and codex, they're very, very secure because they're built by these massive companies that have a lot on the line to protect.

And I just, you know, if you're if you're building out these agents to manage different elements of your business, like the other week I built one that does manages meta ads. And obviously that's quite a risky thing to give an agent control over managing ad budget. So it's just comes down to like what you feel comfortable giving the agent and also you can control what privileges or you can control what um

tool permissions as access to so that if it was compromised for whatever reason, the worst case like isn't that bad. And that means, you know, just giving it like read-only access to certain important platforms and stuff like that. Does that make sense? Yeah, totally. I mean comparing it to like OpenClaw, which is like way. Which I wanna touch on at the end as well,'cause that's the same thing, just another harness, but it's just like the Wild West. Cool.

Uh and one thing like a nice little analogy to think about these harnesses is What we're gonna learn today is we're gonna learn to drive. So we're going to learn about how to, you know, steer the car, like how the pedals, the brakes work, the accelerator works, the handbrake.

But then once you know how to drive, you can kind of jump in any car, whether it's like a old Toyota, a Range Rover, and you inherently sort of know what to do. And that just comes down to understanding all these key concepts that we're gonna go through today. And you can think of the agent harnesses like different cars. And some of them will have better features like seat warmers and cruise control, but it's all once you know how to drive, you can pretty much jump in any of them and use them.

Comparing Results Across Three Platforms

So we've just got our thing over here building tar, uh building the website for Greg. And it's going through this agent loop right now. So you can see here it's actually decided that it's gonna launch um an agent to go and research Greg Eisenberg. And I've connected it up to perplexity, so it's now using perplexity to research Greg. So it's going through its first step of the loop. And I imagine that uh codex has also done something similar here. You can see it's still working, but it's gone.

Started to build this out through the loop. I think Claude Code does the best job of actually displaying that loop. Um and allowing you to see what it's thought about compared to anti-gravity and codex. But it's all just going through the same sort of loop process that I described earlier. And I think that when you say you hooked it up to perplexity, didn't it's not like you asked it to hook it up, right? It just sort of did it?

Yeah, because I've um I've given clawed code perplexity as a tool via MCP, which we're gonna get into um very, very shortly, all about MCPs, which is just connecting tools up. So we can see that in anti-gravity, it's gone, you can see this thinking process, it's gone um I'm now examining the current directory to figure out if there's an existing project or if I build one from scratch. It's then going, um, I'm now gonna start to build this thing.

And then it's built the the website and it's given us a little localhost preview here. So it's created this nice little portfolio site for for you, Greg. What's interesting is like it's super minimalist and I mean It it did its job, right? Like it This is a I would

Totally launch something like this. It actually looks really nice. Did it did it scrape your email address correct? That's that's not my email address. And I don't live in Canada anymore. So but yeah, so there's a few copy things, but other than that, uh Yeah. It's done a pretty job. It did.

And if we go so that was anti gravity, um if we go into codex as well, you can see here it's finished doing its website, which is somewhat similar. I think I prefer Gemini's. Yeah, I agree. And if we check out Claude as well. Um it's still going. But you can see this loop, right? It's gone, okay, first off, who is Greg Eisenberg? It's gone and researched Greg, then fed it back into that observe step, and it's gone, all right, what next?

Now I need to create the HTML file. So it's written the f the code. And then now it's gone, okay, so he wanted it spun up on this local server. So now I'm gonna spin it up on the server. And then the last iteration of the loop is to check that it's actually done and can conclude the task is complete.

It's opening it up and screenshotting the website and then reviewing the screenshots to check that the website is complete. And you can see here it's done another pretty good job. This one's very similar to the Gemini one, hey. That's true. Um but yeah, that's just like demoing how that loop is actually working in real time. Yeah. I mean what comes to mind just by watching this is like

Startup Idea: Cold Email Website Offer

How many people on the planet would benefit from a very clean website? And like how how do you set up these agents so that like You know, maybe it's like a cold email loop, right? Like you're sending cold emails, hey, I built you this website, so and so business. Do you want it? It's gonna cost two hundred fifty dollars.

Yeah, yeah, that's actually a great idea. Um, pre-making websites for companies. And it's like an off-the-shelf thing. It's like, hey, I I made you this website if you want it, like if you want to own it, it's$250. You can just do a mass called email thing.

Uh cool. So I think that's like pretty much illustrated that agent loop example. So I'm just gonna go um back to our trusty board over here. But you can understand that it's just like all of these apps are just different flavors of the same thing.

Folder Structure and Department-Based Agents

And then what we're going to be working up to today is my workspace looks something like this is I have, you know, a big like a a folder for each company or or client that I'm working in. And then I'll have folders underneath with all my heads of departments. And then um within those heads of departments, I'll have skills and MCPs, which we'll get into, and context.

And then I've got like an overarching one at the top to just sort of manage them all. But we're gonna be focusing today on building out this executive assistant to take care of just your manual day-to-day tasks and free up at least one to two hours extra per day. Cool. So to build this out, like we did uh with our demos, it's running off your local files. So we're going to create a folder here called Executive Assistant.

And also through building out this assistant, it's going to allow us to clearly explain each of the concepts of building an agent in real time.

Onboarding an Agent Like a Real Employee

And the way I like to think about building agents is onboarding them like a real employee. So if you took on a real executive assistant, you couldn't expect just for them to come into the office and you'd give them a task with out explaining your business first, your clients, what you do, the tools. Um because it w they just would not be a very good executive assistant. So that's the first step that we need to go through when we're building out this agent. So uh I'm actually gonna work

Uh in cowork at the beginning. So cowork is just another agent harness to do the pretty much the same thing as all the others, just that loop connecting in your tools and the context. So you can see here, um this was my little uh previous session where I was building some diagrams.

But we can go um and you can follow along in Claude Code or Codex or Anti-Gravity or whatever Asian harness that you want to work in. But I just think that cowork has really nice simple UI for people to just understand really well what's actually going on. So we're going to open up this executive assistant folder. And you can see here that if we ask it. Write me a cold email. And send that off.

Voice-to-Text With Monologue and WhisperFlow

So p so people are gonna ask how how did you transcribe? You did like a voice to text Yeah. So that is um I use one called Monologue, but there's a lot out there on the market. Whisperflow is another popular one. And it just allows you to hold a little button on your computer and Just yap away and it will just transcribe it neatly into text. And I find that looks good. My log looks really good. Yeah, I think it's built by the team at Every. Every.

What it's asking so it's it's straight away we we've it's got no context here. So it's working out of uh that folder here on our computer, but there's nothing in the folder. And it has no memory of our previous sessions. Um and it's asking like what like what do you even sell? Um and then we've got to kind of give it like who do you target, what tone do you want? This is all things that our executive assistant should know. Um so I'm just gonna stop the response there.

Chat Memory vs. Agent Memory

And one thing that's really important to know, which might be a bit of a shock moving from chat to agents, is that these agents memory work a little bit different. So if you're used to using chat models like ChatGPT and Claude, if you open up a fresh session in one of these chats, You don't give it any context, you don't upload any files, and you just say Who am I and what do I do?

it's gonna know a scary amount about you. And that's because with these chat models, they have memory built in automatically. So every time you sort of say things that are important, the chat model saves it to its memory in the cloud that you can't see and you can't control. And with agents, you have to set up memory and control exactly what you give it.

And I think that's actually a a benefit, not a limitation, because what happens is If you're using Chat GPT and it's got the auto memory, you're having conversations about three different companies, maybe you're asking for relationship advice, and then all of a sudden um when you ask it to write a landing page copy. It's pulling in context from all these other places that you don't really want in there.

So with these agents, um you need to actually set up that context and memory. So as you can see, when we asked it to write a cold email, it just had no idea about anything. So we need to give it a context file. And the way you do this, right, so you can see this example here, it doesn't know anything about us. And that's because we haven't populated what's called an agents.md file.

Building the agents md

And an agents.md file is just like a system prompt. Just like if you've created any custom GPTs before, you have that field for custom instructions. Or in the project like I just showed before, you've got that field for custom instructions. And it just gives it this context that's kind of always there, always on. And you put in there things like its role, context about you, um, your preferences for working.

And then what happens is every new session, before it answers your your query or task, it loads in all this context to its brain as part of that observe step in the loop. So I have pre prepared. So I've pre-prepared a agents.md file here. So if we drag this in over here.

This uh when you're working within Claude Code, it's called a Claude.md. When you're working within Gemini, it's called a Gemini.md. But when you're working within Codex or OpenClaw, it's an agents.md. But it's all the same concept. So we can drag this into our folder here. And if we open up this file for a little preview.

We can see here, I've got in here all about me, what my business does, my working preferences, like the tools that I use and what for, like Notion Project Management, Stripe, um We've got, you know, all the information, my item customized, loaded with context here. And I pre prepared this but and if you want to make one of those you can just use ClaudeChat or cowork whatever and you can ask it to help you build out this uh

The agents.md file and to just ask you interview style questions to extract all the context from you and then build the file. So if I jump back in, now if I go to a new task. same folder and we say write me a cold email, it's gonna have all that context. Yeah. That's what we hope. That's what we hope. There we go. And it knows you can see this files over here. It knows automatically to load in this file if you title it correctly. Yeah.

It's basically just like a remo a reminder file. Yeah, pretty much. It's just like loading it in so it has all this set context before you even start working. And one of the other big shifts to make, which comes with moving from chat to agents. Is prompt engineering used to be the big thing. It was like, here's the ultimate prompt for going viral on social media, or use this prompt for this.

And now it's all about context engineering. It's about how well can you load up your agent with all the information about your business so that your prompts can be stupidly simple, like write me a cold email, and you're still going to get an amazing result.

Context Engineering Over Prompt Engineering

Um you can see already here, it's already asking like um is it a brand or sponsor, potential partner or consulting client? So it's already got that context, um, book a call. Um you know, it's it's it's loaded in everything that we've given it from that agents.md file. And then now we've got a a pretty decent cold email there, ready to go.

So that's basically agents.md files for you. And you want to create one of those to onboard your agent with all the context it needs. And if you have lots of context, Without getting into too many advanced concepts here, sometimes what I will do is I will create like a um a folder called context. Load that in and in here it's got different files, about me, brand voice, idle customer profile, et cetera, et cetera.

And then in order to keep this smaller, I will then just say in this claude.md file, before answering any questions or before doing any tasks, read my context folder to understand about myself and my business. Because by default, if you just have this context file in here but no clawed.md, it won't load all that into the session by default. But if you tell it in this file that it always loads in to then check this file, you can start to like string all your context together.

And a lot of people have done that with obsidian. So they'll have like in their claw.md file, they'll tell it to go check their obsidian vault for their second brain to go and find contact. So that is agents.md files explain. So that's how you actually, when you're onboarding your agent, like our executive assistant, you can train it up on who you are and your business.

And then as you can see here, you know, I've got folders for all these different roles in my business. And in the head of marketing, that claude.md file would look Somewhat similar, but in the top it would say instead like you are my head of marketing. You speak like this. These are your tasks. These are your roles. And then the second thing here is about memory and the self-improving loop.

How Memory Compounds and Reduces Errors

So we've solved the problem now. Um try not to get too dizzy with me switching tabs. But we've solved the problem of our executive assistant not knowing anything about us or our business. But now we have a new problem, which is it doesn't really remember the intricate details or your preferences across sessions unless you're manually going and updating that clawed.md file. So you can see here if we go. Um my favorite colour is lavender.

It'll probably say something like, Got it, noted. Yeah, that makes sense, right?'Cause it's And it's it's adding where where's that adding it? Well it's not adding it, that's the thing. So we can tell it my favorite colour's lavender and it's gone, the users just shared you that's that thinking step. It's like the users just shared this. N like no, nothing needed. Good to know, I'll keep that in mind. But then if we go into a new session, same folder, and we go, what is my

Fave colour. Mind my spelling. it's gonna say, I have no idea what your favorite colour is, even though we just told it. And that um is an issue, you know,'cause if you're working, you know, in uh you've got like a head of sales or something and it keeps it signs off your emails wrong. And you tell it, you correct it, and you say, Never sign off emails with cheers. Say warm regards.

And it will go, okay, got it, noted. But then the next day you start working and it does the same thing again. It's like, well, like my agent's broken. But really it's not, it's running off those context files in the back. And unless you are manually updating it. It won't know to save that preference. So what I like to do is I like to add in. Something like this to my agents.md file. So this is just a little simple thing. You can pause the video and copy it.

But I like to I'm just gonna remove that context file for now. That was just to illustrate that example of adding more. But we're just working with this one file for now. So I'm just gonna open this up so I can edit it. And I will quite often add something on the bottom, like that little snippet. And this basically just says, actually you know what? I might just add it at the top. Just so it's their top of mind for my agent.'Cause I think this is really important.

So you can see I've just added this in. It just says read all files in context. Read memory.md. This is what you've learned over time. And then when I correct you or you learn something new, update the relevant section in memory.md. And it's just got a couple little things here. And it just says keep memory.md current. When something changes, update it in place and replace outdated info.

So we can do command s to save that. And then I'm gonna add another file here. We can actually just duplicate this. And this one I'm going to call memory.md. And then we can open up this one and I'm just gonna remove all this context here Except I'm just gonna keep those sections. So memory.md is basically, I mean it just It's what it sounds like, right? It's basically like you want to

You know, if the goal is to build, you know, AI employees that do things for us, they're gonna need to need to remember our preferences, right? A good employee remembers preferences and learns over time and not c that compound. So memory.md is just a place that you can just make sure that

uh over time it you know, your whatever you're using, cowork or whatever, it ends up it ends up getting compounded, getting smarter. So ultimately You might be trying things like cowork and and you're you're not getting good results. And a big part of that is you don't have cl uh c uh clo dot M D and memory.md sort of set up. Ex Exactly. Exactly. And now the thing is some of these agent harnesses

have started to add in this memory system that we're doing manually, telling it to update. Some of them have have got that built in automatically, like OpenClaw and I believe like Manus and some of the others have that built in automatically. But it's still important to understand because it's just doing the same thing under the hood, except they've just set this up for you. Um so we've got this here now. We've got our memory.md, our clawed.md, and then now if we go back into

work. If we do a new session in that same folder and we say my favorite color is lavender. Better remember. For the sake of the demo, I I hope that it that it does what it's told. It it's gonna remember it. You can see here. Perfect. It's gone. Good. I'll remember that. Let me save it to memory. And now you've got this big memory file that builds up over time.

And whether, for example, this is your executive assistant, so it might be saving preferences like how to sign off emails or don't connect with clients on Slack, I always want to keep client comms on email. But if you're building out like a head of marketing, it might be preferences about how you like your ads structured in Facebook Manager. If you're building if you've got a folder where you're working on a website or an app.

It might be things like don't use dark mode and then it will update so it'll never use dark mode again. And they just compound over time. So as you start to build up these rules, the amount of errors go down. And and this just compounds and compounds over weeks and months.

How Big Can memory md Get?

Remy, have you seen some of these memory dot MD files get so big that at a certain point it's just ineffective? Great question. I personally haven't happened had that happen to me yet. I haven't hit that threshold. Yeah. Um but A best practice for those clawed.md files is to keep it around like no more than two hundred lines.

And yeah, I could imagine if you started to build this up over years and years, you'd eventually hit a point where all the the rules are stepping on each other's toes. Um and then you you know, you could probably go through and do a bit of like a manual clear. But I haven't hit that threshold yet. Cool. So people don't need to worry about... cluttering their memory dot M D

I wouldn't worry too much. I mean if it's saving like this like the silliest little things, like the tiniest corrections, you can maybe update that claw.md to say only save like substantial corrections, you know? And then you can have a bit more control about what it's saving. So that's probably yeah yeah what I would do there. But once you've set this up, now when you say something like quit writing so formally, it's gonna

do the task, then update its agents.md, or in this case claw.md, to keep tone casual, never formal. And then now in any new sessions it's going to keep that preference over time, which is pretty cool.

Connecting Tools via MCP (Model Context Protocol)

So now we've got our executive assistant set up with uh memory and we've given given him his role. We now need to connect our tools, because by default most of these agent harnesses they just um have web search baked in. But if you want to actually start linking it up to your tools like Gmail, Calendar and everything else, which is where the real productivity gains are made, you need to do so uh via what's called MCP.

And I actually got Greg, I got this MCP explanation from when you had on um is it Rossmike? Rossmake, yeah. Yeah. So this he did a great explanation and it just dropped into my head really nicely. And it's basically that before MCPs Your agent or your LLM, in order to speak to tools, it had to kind of learn their language. Because Claude speaks English, Notion speaks Spanish, Gmail French, your browser speaks Japanese, and Slack speaks Chinese.

And it was capable of connecting to those tools, but it like required these extensive custom developments that took a long time. But then uh Anthropic actually created MCP, is that right? Yep, that's right. Yeah. Yeah. Anthropic built M C P to basically sit as this translator. in between your tools so that Claude can still just speak English and your tools can just speak their languages. And this MCP speaks every language and then just translates your calls from your

uh agent to the tool and then from the tool back to your agent. So just set a really easy standardized way to connect tools up. And that's what we're going to be using to connect all of the tools to our executive assistant. So if we go back into cowork here, you can see that Claude make it really, really easy to connect up your tools. You can just go to connectors, browse connectors, and they've got like hundreds of all the like biggest apps that you probably use.

And you can just, you know, add them, sign in, pretty self-explanatory. But I believe codecs would be the exact same. You know, you can go um skills or Um if we go settings, they probably have like uh and then like manus is the same. For example, if you go into Manaus, we can see uh we can go and connect our tools.

Very very similar process and then same with perplexity computer. You know, you've got your connectors and you can connect all your tools in here. It's just all using that um model context protocol, MCP. So I've already, before the episode, gone and connected all of the tools that I use most, like Gmail, Google Calendar, Granola, Notion. They're all set up already as MCPs.

And what I'm going to do now is I'm actually going to open up this executive assistant folder in clawed code to sort of demonstrate how these harnesses are all the same and they work off your local files. And the real future proof AI stack is just having those markdown files on your computer.

And the reason why I like to work in markdown files is because it's just the easiest sort of format for your LLM, for your agent to actually digest and understand, compared to if you were to give it your files as like a docs or a PDF file.

Working in Claude Code for High-Value Tasks

So I like to use clawed code within Visual Studio Code. Um so you can see here I'm just gonna it looks very similar to anti-gravity. I'm just gonna open up our executive assistant folder here. And the way that I see the future of this all going, Greg, is I think that everyone's gonna have their what I call an AI OS, like an operating system.

And this will just compound over time, like you saw with adding the rules and getting less errors. But with adding your tools and then skills, which we'll get into, which is basically just training AI on your processes. And I think that everyone's gonna have like an AI operating system they work in and everyone will just have personal agents and agents to manage each department of their company.

And people won't actually use these apps anymore. Like I've connected up Gmail, uh, Google Drive, Calendar, Granola for my meeting notes. Stripe for payments, notion for project management. And I don't even enter these tools anymore. I just sit in Claude Code as one central place. And an example here is I sent myself before the episode, I sent myself an email from a fake prospect.

And I also entered in granola a fake meeting with this prospect. So now I can say things like um Summarize my inbox from today. Um, you know, so someone might ask like, well, how important is that really? you know, like Is that such a high value task? Like are what are high value tasks that you're actually getting done here? So

Mm I'm or Or maybe you get a lot of emails, you know? Well if you know, emails is a big thing if you do get a lot of emails, but Just having like all those tools connected in one place and not having to switch and and copy paste context.

So you'll see an example here, right? So we've got summarized my inbox from today, which is like one of the most basic agent tasks ever. But we can see this is one I sent earlier. We've got this one email here, like our call today. Excited after your call, wants next step. So I might just say here um

Okay, great. Uh review my meeting notes with Maltoshi from today, and then draft up the email, sending the proposal and creating the Stripe Payment link, and then go into Notion and set up the project.

Why the Real Value Is in Stacking, Not Summarizing

And where this starts to compound even more is when you start to build out skills for each of your processes. Because every time I do a process, even like this, manually prompting it, um, and I know I'm gonna do it again at some point, I'll then just turn that into a skill. Uh and then you eventually end up if you automate like three to five tiny manual processes each week with skills, you eventually end up um automating like your entire life with these aids.

Right. So it's it's not so much in like summarize my my emails where it's super, super valuable. It's like That's where the starting point is and then we wanna like manipulate it and use it and go deeper and stuff like that. That's when Exactly think these tools really, really are valuable. And you can see here it's now connecting all my tools. So it's gone into granola and found the the full meeting of what we went through today.

It's now going into Stripe to create the product link. Yep. And then it's going into Notion to set up the project. And then it will it should create the draft ready for us to go to send out. That this is really like a new way of working, right? Yeah, it is. It is. It it it it it's so new. And I and I even this task, it's really simple, right? Just sending an email based on a call with a proposal link and stuff.

But like, even if you can just do something like seven times faster without having to go into all these tools, copy the meeting notes into the page to give it context on your meeting. It really starts to compound. Then you start to fit like a week in a day and then seven weeks in a week. Um and you stack that up over a year and you're gonna be miles ahead of everyone else. Uh and when we get into skills, you're gonna see how this continues to get even better.

But um you can see here it's drafted the email, it's pulled in all these insights from our call in granola, which is like where I do my meeting notes, and then it's created the stripe payment. Here, ready to go. That's cool. And then now I can just go um send this email. And it will use my Gmail integration to g to go and send it. Uh that's really

It is, hey. I think this is the new way of working. And Cody Schneider, who he had on the poll the other week, I saw a tweet from him and he said Everyone's gonna have like an AI operating system like this and you're gonna have like the hundred X employee because everyone will come into their role with a pre existing AI operating system.

And then build out skills for all their manual processes, similar to how I was describing, and just keep building skills each week for anything manual that comes up until eventually their entire life and work life is automated. Great, as you can see here, it's now created the draft here in Gmail, ready for us to go. And if we're happy with it in the platform, we I could also just ask. Called to send it there and then.

What Are Skills? (SOPs for AI)

Uh but then what like also gets really cool is I'm gonna demonstrate now how I actually build out skills for these processes. So I know I've talked a lot about skills so far. I wanna just give a little overview on what skills actually are. Yep. So the easiest way to think about skills is SOPs for AI. So standing operated A uh standing operated oh my god. Standard operating procedures for AI. So it means you never it means once you explain something once, you never have to explain it ever again.

An example of this is without skills. If you are creating a proposal for a client and you're sitting in your Claude chat or whatever agent harness you're using, And you ask it to create this proposal, you're probably gonna go back and forth a bunch of times. Uh remove change the formatting here, use this color blue for this part, put the price at the bottom instead of at the top. And eventually, maybe after fifteen minutes, half an hour, you land on a proposal that you're really happy with.

and you send it And then next week you want another proposal written. But unless you're going and finding the same session and working in that same session, it's gonna have completely forgotten all of these preferences. And even if you have that memory system set up, these kind of things you don't really want clogging up your memory. They're better off as skills, which is basically it packages up that process.

into a dot skill file. And in that dot skill file, it's basically just a a markdown file that explains the exact process that you went through. So you could create a proposal skill. And then every time you now you need a proposal written, it just takes that skill, knows exactly what to do, and then you can have that uh proposal the same way every single time. So is a skill like a memory file? Like what's the difference between essentially a memory.md

And a skill. Like is it just is a s is um is um is it just like a memory dot md file for a particular job to be done? That's pretty much like exactly it. And all of these agent harnesses pretty much now have skills as a feature. So you can see here, like if we go into um Codex, for example, they've got skills here.

Same with Claude as well. And these when you're working with these agent harnesses that operate like mostly locally off your computer, you can see here it actually operates out of this. Hidden file called a dot claude folder. Skills. And these are all of the skills that I've created. There's tons. And if we open one up, for example, um like this one here, let's find a good one. For example, I've got this one here for writing viral hook.

And in this skill, we have a.skill file, which is basically like your memory.md, which explains the exact process for writing viral hook. And then it's also got packaged in here some references like hook formula. Okay, so wait, so how did you create that skill? Okay, so there's two ways that I find useful to create skills. Is one you can have an idea of a skill you want to create off the bat.

Creating Skills

So like viral hooks, for example, I had this course on viral hooks, which I transcribed, put it into Claude, and Claude has this by default. It has a skill creator skill added into it. Same with all of the uh major agent harnesses. They'll have a skill creator skill. So it's kinda like skillception. You use the skill creator skill and you say, Hey, take this course on viral hooks and create a viral hook skill and it can create it like that. That's one way.

Uh and then it will package it up nicely with that skill.md. It'll do the whole thing for you. Wait. You asked you asked it to take the course. Yeah, I I uploaded the course, like the full transcript of the course. Literally. And I said, uh yeah, based on this course on viral hooks, build me a viral hook skill.

Um and then now I use that for my content team. So you'll just like we're building the executive assistant folder, I've got a folder called content team and I've got that that uses that skill for me. And the second way to create skills is going through a process manually once with Claude. And then if you know you're gonna have to do it again, like that proposal example.

You can just say once you've done the task, hey, create a skill for what we just did, and it will package up that process you went through. And that's the second main way that you can create skills. So in your viral hook example, if you go into that folder again. Yeah. So you have a references folder. Yeah. So that is Probably like was that yeah, let's open it up. I'm just curious. See here it's got a full thing about like screwed. Yeah, this was basically from like a a a course I put into it.

And did you ask it to create a reference folder? Like how should people think about No, it just did it. It just did it. So I think what would be great is if we could actually um demonstrate Building a skill live. Let's do it. Um and so for example, like this process here. Um, I might I could create a skill called like a daily brief skill, you know, that goes through and summarizes like your calendar.

your inbox and your projects in motion and plans out your day for you in the morning. And then you can run that on a scheduled task because a lot of these agent harnesses now are starting to introduce scheduled tasks. So you can just run it on nine AM every morning, use my daily brief skill to prepare me for my day.

But I think another cool one here, just to show you an example, right, of how minute like how intricate I make these skills, is let's just say for this fictional meeting I had with this person. I might say, um Can you draft up an email? I want to refer Maltoshi to my good friend Sebastian, who has an AI automation agency and can help them out better with their needs. And then we can just go um Sebastian's email is And we can just say that, right? And then now it's gonna be able to

take the notes from granola, all the context, and then draft an email connecting these two, um, a prospect with a friend. And you know, I have different like referral things set up like that with people in marketing agencies. And that's just like a little manual process there, tiny. It maybe takes 15 minutes out of my day. But then I can just go.

I want you to use your skill creator skill and create a Sebastian refer skill so that whenever I ask you to refer someone to Sebastian, you know exactly what to do and you know his email address. And then that'll build out that l tiny skill for us, tiny process. But it means I like I know in the future I'm gonna have to refer someone to SEB again. And even if this skill now saves me fifteen minutes another five or six times.

They start to compound when you create skills for every single little process in your business. Yeah. I guess it's like We should just be asking ourselves, like, you know, in our day to day life, like what are all the jobs to be done? Yeah and what are all skills that w we need? Like what are the repet repetitive processes or SOPs as you talked about? And then just Setting up as many as possible. Right? To make our lives easier. Exactly. And just to give you a little demo here. So

I sort of ill I alluded to that folder structure at the start of the video. And this is it here. So I've got workspaces, AI with Remy. And for example, I can open up my content team. And within this folder, I this is just like an a more elaborate version of our executive assistant, but we've got our claw.md in here which explains. Um you are like the main orchestrator, you have these sub agents. It's just a more elaborate version of that claw.md.

But I've got a skill within this for like a meta ads analysis. So that was a process. For example, if you're a marketing agency owner, this is probably like the kind of stuff that you can get inspiration from.

Real-World Example: Ads Analyst Skill: 4-Hour Process in Minutes

Um like ads analyzing, you know, taking competitors ads libraries, breaking down all the creatives um and their landing pages. So I built out this ads analyst skill where I literally just do ads analyst and then I paste in like the ads library URL like that. And I'll click run. And I did an example yesterday with the UDI, which is a super large e-com brand, and it ran through and basically scraped all of it took screenshots of all the landing pages.

It went and scraped all of the ads that they are running, all like 220. It then did a full deep dive here on all the ads. Visual analysis, copy analysis, why did this work? What could be improved? It basically did a breakdown of all the landing pages with screenshots.

And it did a master report here about everything that's going on. So did it just did a full breakdown. And that was like a manual process that I would have gone through when I used to run like my marketing agency. And that probably would have taken me like three or four hours. And then I went through to build out this skill, I went through the process once with Claude. Like I started a fresh session and I was like, all right, go to this ads library URL.

Scrape this, do this, do this, do this for like two hours. And then after I'd done the entire process, I just said, use your skill creator skill to make a skill. for ads analyzing and package up the entire process we just went through as a skill. And then now whenever I want to do that process again, I can just invoke the skill and it and it knows what to do. She's pretty cool. Crazy. Crazy. Absolutely crazy. So ev so now the the refer Sebastian skill is live. And

Now whenever I want to refer someone to Sebastian again, I can just say, um, yeah, refer to Sebastian and it will just start to use it will use that skill. And it's example, they're tiniest process, but you build like those up for all these little tasks you do day to day. And then it just compounds and compounds and compounds. And I can already think of an idea here where you could then you can chain skills together.

Chaining Skills together

So you could example have like a a meeting prep skill. that prepares you for a meeting by researching the guest um and compiling some talking points. You might have like a podcast research skill, for example, Greg, for a guest that's coming on. And you might also create a a morning brief skill. And in the morning brief skill, you can say if there's any meetings coming up or podcasts in my day, use the podcast research skill.

um to research the the the guest. And you can like chain them together and build like some really, really cool workflows. Yeah, and you can have it so it sends you an email, right? Yeah, exactly. And then now these harnesses are starting to get more and more autonomous. Like they're starting to add like you know, in the car, they're starting to add like cruise control and stuff. Like now within most of these harnesses, you can schedule tasks.

Um like in cowork or clawed code now. And you can like for example this one here, you can go new task and I could say like uh run my morning briefing skill. And then set that to go every every morning at nine A. M. And then now it's like an automated workflow that like you've just got running every morning now, which is pretty cool. Yeah. I'm I'm doing this right now, like for example

Real-World Example: Automated Car Search

I'm I'm buying a new car right now and it's like a particularly unique like color that I want and feature set and there's just none none really available. So you know, every three hours I I ha I'm scraping all the different car marketplaces and and then I'm getting a notification that, you know, w when something comes up and it

It's crazy, right? Like it saves me. I'm one of those people that like If I didn't have this, I would be spending an hour of my day just like checking religiously, every single, you know, CarMax and Cars dot com and Auto Trader and all these websites and refreshing like a insane person. Um so yeah. Great great example there. But you know, this like the this is a skill that would be relevant for my executive assistant. Same with that car one. That could be a good executive assistant skill.

But then I've got those more elaborate skills built out for like, you know, my um content team, that ads library scraping one. And then um I've got like, you know, re a research, weekly research skill for my newsletter team and that runs on a schedule every Thursday morning to go and scrape like I've built the skill out so it goes and scrapes Twitter and Reddit to find what's new in AI.

Um but yeah, skills are so so powerful. Um combine them with like your MCP so it can use your tools. And then you can start to just train up your agent on all the processes in your business. And I did a um a build out on open claw for a agent to manage meta ads and it went pretty viral.

OpenClaw and Migrating Agents to More Autonomous Platforms

And the the way I built this was with all these key concepts. So OpenClaw functions the exact same way. So I prov I hope it hasn't timed out, but I've remote accessed into my OpenClaw dashboard here. And you can see. It's just operating off an agents.md file in the back end. But instead of the.clawed folder, it's in a.openclaw folder. And then it's got a couple of these other ones here. It's got a memory.md.

It's got some of these other ones that it's added on, like a soul which tells it its personality and an identity which tells it who it is. But it's that same concept of markdown context files. Connecting your tools and then creating skills. So that uh meta ads manager one that went pretty viral, I just planned it out with Claude. I was like, I want to have this open claw manage my meta ads.

Helped me write the agents.md file to tell it you are my meta ads media buyer, you do these processes. And then I created skills. So I created a Add creative skill. So it knew to go look in the Dropbox folder and create creatives. I created a copywriting skill. So it knew how to write good copy for the business. Um And I just built out there's probably maybe 15 different skills. And then I would combine scheduled tasks, which is cron jobs.

with skills and the context files and then just give it all the tools it needed. Um and just following the same process as we just went through to build the executive assistant, I had an open claw meta ads media buyer, which was sick.

Which Platform Should Beginners Start With?

I love it. And for the beginner, are you like did you recommend people, you know, use OpenClaw or should they be using Cowork or Manis and some of the ones you showed? So great question. Uh I would say that open claw is probably like one of the hardest to learn and set up of these harnesses. I would say cowork is probably the easiest. I think Perplexity Computer, you did a video on it. Um it's pretty simple too. Same with manas. Um but I would definitely learn um and get comfortable using

like clawed code or um one of these other ones before I started to play around with open claw. And I would also uh have all the processes built out in clawed code first. So for example that executive assistant over the next Uh two weeks, I might build out a bunch of skills like the Sebastian Refer skill, like a daily brief, meeting prep.

et cetera, et cetera. And then once I'm happy with how it's all functioning in clawed code, then I could look to migrate that into open core where it has that more autonomous nature to it. So that's kind of how I think about using open claw and those other harnesses. Yeah.

Global vs. Project-Level Skills, Context, and MCPs

Cool. All right. Anything else you wanted to um cover? I mean, like really there's no right or wrong way to to run these. Like that was the executive assistant. I've got ones built out for all the other departments in my business and then other businesses I work on I have the same. Um and you can just kind of build out that structure with what works for you.

Um, you've got like one other thing to mention is global versus project level, which I'll just go over super quick. So, like those skills, for example. Um you can add them at a global level, which means they apply to every single project you work in, whether it's the executive assistant, your head of marketing, and some skills you want globally, because you might use them in every chat.

um truncate skill that I created, which just makes whenever I want to make something shorter, it makes it shorter without compressing the sentences, but just removing sentences that don't need to be there. And that's something I want in every session. So I've got that in global, but you can have project level skills like that. Um Sebastian refers skill. I would not want that with my marketing, head of marketing, because it's just like clogs up the context and you don't need it there.

Um so I would have that as a project level for example. And you can have Um global skills versus project skills, global uh claude.md versus project claude.md and same with MCPs, you can have global MCPs and project MCPs. That's probably the other concept. Um to go over. But look, other than that, that's pretty much the entire agent's crash course. So it's just that loop running in the back end to complete your task. and connecting in your tools, your context, and the LLM all in one place.

Uh and I would just say to to work out what roles you want to start to build out an agent for, go into Claw or your favorite chat model and get it to help you build out those uh context files through an interview style process, just say ask me questions to build this out. I would connect all the tools that you need and then start building out the skills through daily use.

And then pretty soon you're gonna have like pretty powerful AI agents built for every single um aspect and department of your business. Remy, thank you so much. I'll include links uh in the show notes in the description, where you can go follow him, get to know him a little bit better, and uh I appreciate you coming on, dropping some sauce. Thank you, man. Thank you so much for having me on, Greg. It's been a pleasure.

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.
For the best experience, listen in Metacast app for iOS or Android