Ep 775: Open Source AI 101: Why Local Models, Cheap APIs, and AI Agents Change Everything (Start Here Series Vol 24) - podcast episode cover

Ep 775: Open Source AI 101: Why Local Models, Cheap APIs, and AI Agents Change Everything (Start Here Series Vol 24)

May 12, 202637 minEp. 775
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Summary

Open-source AI has transformed from a hobby to a boardroom topic, with its capabilities nearly matching frontier models and significantly reducing API costs due to practices like Chinese model distillation. The episode details how this shift, alongside the power of local models like Google Gemma 4 and always-on AI agents, enables enterprises to rethink their AI spending and workflow triage. However, it critically highlights the often-overlooked legal risks associated with open-source licenses, urging companies to balance cost savings with necessary IP protection, especially for regulated or customer-facing tasks.

Episode description

Until a few months ago, open source AI was kinda a hobby project. 

Now, it's tearing corporate boardrooms apart. 

Why? 

Over the past 6ish months, the gap between frontier closed AI and open sourced AI has shrunk to pretty much nothing. And with the surge of always on agents driving open models, their development and release schedule is on pace with the frontier labs. 

So if your team isn't paying attention to -- and running test cases through -- open AI models, there's a good chance you'll either be overpaying or playing catch up soon. 

We walk you through the 101 and what you need to know when it comes to open source AI in this Start Here Series special. 


Newsletter: Sign up for our free daily newsletter
More on this Episode: Episode Page
Today's Episode on LinkedIn: Thoughts on this? Join the convo on LinkedIn and connect with other AI leaders.

Upcoming Episodes: Check out the upcoming Everyday AI Livestream lineup
Website: YourEverydayAI.com
Email The Show: info@youreverydayai.com
Connect with Jordan on LinkedIn

Topics Covered in This Episode:

  1. Open Source AI vs Closed Models Shift
  2. Chinese Model Distillation & Legal Impacts
  3. Enterprise AI Cost Triage Strategies
  4. Google Gemma 4 Local Model Capabilities
  5. Frontier Model Performance Gap Closing
  6. 24/7 Agentic AI Systems Overview
  7. API Pricing War: DeepSeek vs US Vendors
  8. Legal Protection Tradeoffs for Open Source AI
  9. AI Workflow Triage: Task-Specific Models
  10. Future Trends: Local and Specialized LLMs


Timestamps:

00:00 Introducing the Firefly AI assistant

03:33 Open source AI cost benefits

09:25 AI model performance differences

10:19 Open source model improvements

15:28 Advancements in local AI capabilities

17:04 Impact of Google's Gemma four

22:15 Introducing Adobe's Firefly AI Assistant

24:19 Adobe Firefly AI assistant beta launch

29:26 Choosing the right AI tools

32:00 Shifting workloads to open source

33:31 Using open-source and closed models

36:47 The future of open models



Keywords: 

open source AI, open source models, local AI models, local models, closed source AI, closed models, proprietary AI, proprietary models, AI agents, agentic AI, AI workflow triage, cheap API, AI API costs, model distillation, Chinese open source models, China AI models, US AI models,

Send Everyday AI and Jordan a text message. (We can't reply back unless you leave contact info)

Transcript

Introducing the Firefly AI assistant

B

This is the Everyday AI Show, the everyday podcast where we simplify AI Bring its power to your fingertips. Listen daily for practical advice to boost your career, business, and everyday life.

🎵 Music

A

Meet Firefly AI Assistant, now live in Adobe Firefly, the all-in-one creative AI studio. Just describe what you want to create, and the assistant handles the rest, orchestrating multi-step workflows across Photoshop, Premiere, Express, and more in one conversational interface. You direct the outcome, the assistant accelerates execution.

🎵 Music

A

A few weeks ago, the United States government said the quiet part out loud when it comes to open source models, at least from China. That's because in April, the White House sent out an official memo accusing China of using distillation to illegally copy American AI models to create cheaper domestic knockoffs. And that declaration is really nothing new if you've followed AI for years. However, the recent distillation trend has completely reshaped one important landscape of enterprise AI.

The decision between using open source models versus proprietary closed models. And in 2026 at least, it can actually be a tough choice between saving potentially millions of dollars versus running up your legal liability. About two years ago, before Chinese distillation was commonplace. There was a sizable gap between frontier models and open source AI models, or those models that you can essentially download or use for close to free. But now the gap is all but closed.

Which has thrust the open source versus closed source question into every enterprise boardroom in 2026. And although there's no one size fits all answer, we're going to be tackling the toughest topics and the most important takeaways as we take a zoomed out view of open source models on today's show. That's why we're going over open source AI 101, why local models, cheap APIs, and AI agents change everything about making AI decisions in 2026 as part of our Start Here series. All right.

Welcome to Everyday AI. Before we dig in, let's first zoom out and talk about the big picture here when it comes to open source AI. That's because, well, it's actually a legitimate Right. Two years ago, enterprise companies weren't saying, let's use an open source AI model in production. Today it's actually happening.

Uh it's happening. That's because, you know, maybe the most powerful open source models are only about two to six months behind frontier models, but It on even consumer hardware, you can be running essentially frontier level AI models from like just over 12 months ago. And the Chinese labs now distilling US models have kind of crashed the open API prices to.

Pennies, right? So uh yeah, not everyone out there on you know consumer or prosumer hardware can run the most powerful open source models, although I do think Google has something to say about that, but the most powerful.

Open source AI cost benefits

Open source models run for a fraction of the actual cost if you can't afford to run them locally, which has completely shifted the paradigm when it comes to enterprises making decisions on well, are we going to use a model from one of the big three open AI? Google or Anthropic, or are we going to use a Chinese open source model and pay for it that way? And well, what this has also led to in 2026 is Essentially 24-7 local agents that can run uh and also without costing a ton and now having to.

actively and almost aggressively, uh aggressively go through kind of an AI cost triage. But going full open source does strip away the legal protection that the closed models often include. So stick with me for 25-ish minutes on today's Start Here series show. And here's what you're going to learn. You're going to know why the open versus closed source AI default just officially flipped. You're gonna know how Gemma 4 from Google puts year-old frontier capability on your laptop.

You're going to understand the two payoffs already shaping and reshaping how individuals and enterprises run AI and the hidden legal trade-off most executives miss when going fully open source. Let's get into it. My name's Jordan Wilson. Welcome to Everyday AI's Start Here series. This is the essential podcast series to learn the AI basics. And if you're an AI expert, this is your chance to freshen up and double down on your AI knowledge.

Why do we start this start here series? Well, after 750 plus podcasts, I never really had a good answer when someone was like, where do I start? What podcast do I start with? That's why we created the start here series. It's best, I think, if you listen in order. I think this is now volume 24 of the start here series. So maybe we'll wrap it up at 25. Maybe we'll wrap it up at 30.

I'm not sure, but the whole point of this is you can go to starthearseries.com. That's going to give you free access to our exclusive inner circle community. And in the Start Here Series space, we make it even easier for you. So you can actually go listen. Uh, we have a Spotify playlist ready for all of the different Start Here Series shows, as well as a breakdown on each individual episode all in one place. So make sure you go to start here series.

dot com for exclusive access to that inside of our inner circle community. All right. And if you miss our last start here series show, that was volume 23. We talked about headless software and why companies are building software for AI agents. and not humans and well what that means. So today in volume 24 of the Start Here series, we're going over open source AI 101. So here's the reality. Closed AI used to be the de facto

Right. And I I mean, honestly, there was really never even much of a discussion about open source AI in the enterprise, maybe until uh twenty twenty five, at least not serious enterprise companies. Uh now it's uh It's a real conversation, right? So it's no longer uh, you know, hey, we're just gonna choose whichever API works best for us, right? Whether that's open AI, anthropic.

um or Google, now most companies are looking at some of the open source alternatives, most of them coming from China. And the big kind of thing here is companies are starting to standardize around one frontier vendor. uh in twenty twenty five before this happened, and then they called it an AI strategy. And the assumption originally was well, That worked for three years until two very specific forces broke the standard paradigm when it came to open source models.

First, the proprietary versus closed gap capabilities just completely. All right, so we talk about arena on here a lot. Uh for uh previously had a different name. Now it's just arena. So you put in a prompt, you don't know the output.

uh you know what models they're from and you vote for which one is better. Right. So all these different models get an ELO score. And to really zoom out for our non-technical audience, because I know a lot of you in the start here series are not technical. I'd probably even say what an open source model even is. Right, so uh the very simplified version is certain companies can release

models open source under like an MIT uh or Apache 2.0 license. And that gives people the ability to download these actual models and to run them locally on your machine. And that is, well, one of the big trade-offs. Uh right. So you're not sending any private or potentially proprietary data uh in the cloud at all. Everything runs locally on your machine. So number one, it's private. Number two, it's well, free, right?

Uh and then there are open source models that if you can't download them on your, you know, computer, because not everyone can, some of them are much larger. Uh you can still essentially run those in the cloud for a fraction of the cost of what it would. uh cost to run a proprietary model. So essentially open source models are ones that you can download, you can modify, in some instances you can even build products on top of it. All right, anyways.

Right. Until late 2025, there was a monstrous gap in the arena scores. Right. So these ELO scores, when you put in the same prompt, you look at two outputs, everyone overwhelmingly always chose the best front uh the best frontier closed source model. And that really started to change, right? So the gap between the ELO scores.

Uh well, it cut down by about 90%. So it went from about a 250-point gap from the best uh frontier or closed source model. Um, well, now it's only about 30 points, right? Give or take, depending on the day, right? But it

AI model performance differences

Even uh, you know, recently a couple months ago, it was like 15 points, right? So at that point, You really have to be an AI expert to be able to uh decipher the difference. I think 30 points, you know. Most people could look at different outputs over time and you know, a 30 point, you can kind of realize that if you're looking at the best.

you know, open source model versus the best proprietary closed models, thirty points you can understand, but ten to fifteen points, it's kind of a coin flip, even for people who are, you know, spending most of their days inside of large language models. Um But this collapse came from those two different forces working in parallax.

So I have a little uh little graphic here on the screen for our live stream audience. If you are listening on the podcast FYI, you can always get the video version on our website at your everydayai.com.

Open source model improvements

Uh but going from a 250 point gap to essentially a 30 point gap. Uh This is huge, uh, right, because like I said, in 2023 to mid 2025. It was noticeable, right? It was extremely when you looked at the outputs, you could say my business can use output A, but it cannot use output B. Right. And now we're at the point where the open source models

uh in terms of an ELO score. And I think that's a good metric to look at over time, right? Because you know, the frontier is always improving. Uh but if you look at the ELO scores of the open source models now, so those that you can kind of You know, if you have a beefy enough computer, you can download some of the best open source models on your uh actual local machine. Um, right? Those scores are where we were at with proprietary models three to six months ago.

Right. So think back to the very end of twenty twenty five. you know and there's some models like i think at the time was probably uh gpt 53 uh gemini three pro and uh at that at that time i think we were at like opus uh maybe four six or maybe four five right Now you have open source models that you can run for free 247, run agentically that are at that same level. And that's why now

This is a real enterprise boardroom problem, especially for large companies that have invested heavily uh into AI. Right. So I'm not talking about companies that, you know, with a couple hundred employees. I'm talking about companies that were Spending seven, maybe eight, maybe even more, seven to eight figures on AI each year. Now all of a sudden they're saying, hey. In theory, if we switched, you know

part of our uh you know summarization tasks alone, right? Uh there's I've I've read a lot, uh I've I've talked with a lot of people that have done something similar. You know, if if if we just you know chunk off

Everything that we're using, you know, open source model or sorry, closed source model just for summarizing text, right? Some of those lower hanging fruit. People are saying, well, yeah, we could save one, two, three, four million dollars. And this is an actual reality that a lot of companies are. uh grappling with right now. So the force one was just the capability moving locally. And this, I think we have to credits Google for pushing the edge

of edge AI. That's because with their Gemma 4 model completely shook up. Uh, the landscape of open source AI, right? This thing was 20 times more efficient than other. open source AI models at the time. So uh essentially, let me describe it like this. Do you remember GBT 4.0, right? One of the best models, you know, about 14, 15 months ago. It was at the absolute frontier. Right, so now you can download Gemma 4 on a consumer laptop. And it has.

You know, roughly you look at the scientific benchmarks and the ELO score. It's essentially about a GPT-4.0 level model that you can run on your laptop. So what's the big deal? What's the big difference, right? Fourteen, fifteen months ago. Mm. I mean, there's thousands of companies spending millions of dollars a year to get that type of technology, to get GPT-40 level technology for their employees. Now.

Doesn't take take anything really. It takes a new-ish piece of Apple, right? I I just got a a new MacBook Pro. That thing can run Gemma 4 very easily, right? It can run even better models than that.

Really

A

changes. I think what is ultimately capable when you look at the open source versus closed source. Because yes. I think most people look at normal usage and they're comparing Apples to apples, right? Here's what our marketing team did 15 months ago with a GPT 40 level model. Oh, now they can do that on Gemma 4. Well, yeah.

You can do it, but now you can do it agentically because not only in the last year or so have the models obviously improved with now thinking models, reasoning models being the default, but now we have these agentic harnesses. Um, you know, not just the ones that you can use. um you know inside of Chat GPT, Gemini, uh Claude Copilot. But well, you have these local um autonomous AI systems as well, such as

Open claw, such as uh I always forget if it's Hermes or Hermaz agent, right? Um So now you can have essentially the level of AI from 14 months ago, running for free 24-7 agentically. Even if you're just doing it, you know, summarization, content creation, um, research, things like that. So when capabilities

Advancements in local AI capabilities

went local, right? Gemma Ford leading the way, but obviously all the Chinese models followed suit because of uh right distillation, which we'll talk about a little bit here uh in a couple of minutes. But you can't overlook Gemma because it puts frontier capability on literally a laptop, right? Because Two years ago, to be able to run something like a GPD 40 level model, uh, right, which uh rumors have been swirling that's it's a two trillion parameter model, right? You would need a

Small little data center to run something like that, you know, two-ish years ago. Uh now you have these capabilities. So Um, you know, I've been lucky to talk to a lot of smart people in AI. And now you really have executives grappling with, well, should we be buying a bunch of, as an example, new MacBooks? Should we be, you know, buying a bunch of DGX?

Uh, for our employees and setting them up with 24/7 always on agentic AI, right? To take advantage of these now local and powerful models that, well, you don't pay, right? You download them once. You don't pay again and they work and they can work while you sleep, like I said, because of uh some of the new autonomous capabilities uh from local agents that can run around the clock.

Impact of Google's Gemma four

This has obviously led to, and I think uh Google putting the pressure on the uh the open source world with Gemma 4. Like I said, it was 20 times more efficient in terms of what is it was able to achieve on the benchmarks in terms of size.

Right. Cause uh when it came out, if you looked at the other Chinese models, it was about 10 to 20 times smaller in size, right? So Uh, you wouldn't have been able to, you know, use the best open source model pre-Gemma four on a local machine, uh right, or at least a consumer laptop that you can just go walk into the store and buy.

Now you can. And open Chinese models are amazing. And I think they've been getting better and better and smaller and smaller and more and more efficient since Google's Gemma 4, but you have to talk about the elephant in the room. That is, these models are distilled. Right. We can say that. All the big labs have said that, you know, are

I don't know. Maybe some of our uh audience in China won't appreciate hearing that, but I mean Google, Anthropic, OpenAI have all accused China and have said they have proof, uh, right, but the uh the White House. So in April the White House actually officially said that China was uh using uh kind of illegal uh tactics to distill

uh USAI models to create cheaper domestic knockoffs. Uh right. So it got to the point that at least the White House said that they had enough uh information or intel to make that declaration. So what is model distillation and well why does it matter? The easiest way is like I can spend ten hours studying for a test, right? Think back in the classroom. I can spend 10 hours studying for a test. Someone behind me can look over my shoulder and spend ten minutes and get the exact same answer.

That's kind of like what model distillation is, right? You have the big AI companies here in the US spending billions of dollars, right, on any single new model pre-training as an example. And essentially, uh, you have uh certain actors in China. who will use the API and you know different companies have come out with different levels of um proof and say, okay, well, they're creating, you know, thousands of spoof accounts, more or less.

They're putting in all these inputs and training it on our models outputs versus training it themselves. So yeah, just kind of copying the homework. So what this has led to is China has been able to put out these open source models uh really technically just pushing the frontier of open source by allegedly just copying the best US models out there. And what this has led to is well, it's a crashing out at the bottom price of intelligence.

So Deep Seek V4 as an example, Deep Seek, one of those companies that many uh of the AI labs here in the US have um accused of model distillation. Deep Seek V4 Pro, one of their newer models, now lists their price at 43 cents per million token inputs and 87 cents per million token output. That's like more than 25 times cheaper than the premium uh you know uh closed source proprietary models. And that is the reality that a lot of boardrooms are looking at right now.

Right. To make that math easy, it's like okay. I if we're spending a thousand dollars per month per employee on the API side. Uh right. If you're sorry, if you're spending um, let's just say$40,000 a year, okay, we can be spending$1,000 a year if we switch over uh to an open source model as an example. So that's here's what that actual

leads that what that actually leads to, right? Kind of the uh the model distillation leads to more powerful, cheaper open source models from China. And well, it leads to People using them, but not always knowing the ramifications. Right. So obviously uh Google doing things the right way. But I think with these Chinese models, they've become increasingly popular, even in the enterprise.

Uh, which is tricky. And I don't think that most executives are fully understanding some of the consequences of using open source models. But this has led to essentially having a workforce of always on assistance and they've shifted from expensive. special projects to well that's just now the default operating model. So you know this has just allowed kind of these the this new swarm of agentic AI that couldn't have really have existed.

before. Number one, the technology and the harnessing wasn't there. But number two, you take out uh you know, at least Gemma and the Chinese models uh that it have been accused of distillation and your your your options

Introducing Adobe's Firefly AI Assistant

Aside from those, aren't really that good. All right. We're gonna talk more here after we take a quick break for a word from our partner. Adobe just introduced an entirely new way to create, bringing the power and precision of its creative suite into one conversational experience. Meet Firefly AI Assistant, now live in the Adobe Firefly app, the all-in-one creative AI studio.

Powered by Adobe's creative agent, Firefly AI Assistant lets you start with your vision, just describe what you want, and shape the outcome as it takes form with the assistant. The Assistant orchestrates multi-step workflows drawing on 60 plus prograde tools across Adobe Creative Cloud apps, including Photoshop, Illustrator Premiere, Lightroom Express, and more to help bring your ideas to life.

You can also get started with Creative Skills, a growing library of pre-built workflows for common creative tasks like batch editing photos, creating mood boards, portrait retouching, and creating social variations. Every step the assistant takes is visible, so you can refine, redirect, or take over at any time. You stay in the driver's seat as the creative director. Adobe Firefly AI Assistant now in public beta. See it today at firefly.adobe.com.

Aside from always on AI agents, what this open source movement has led to is well, now enterprises may be moving away from having the one model fits all. uh solution. So now as an example, you might be able to put out a 100 agent swarm goes from, you know,$1,200 plus dollars on Opus to well, maybe like$60 some dollars on Deep Sea. And y now essentially you

um can look at AI as more of a triage or a categor categorization of which models to use for which tasks, right? Especially when you're talking about high volume operations.

Adobe Firefly AI assistant beta launch

So things like when you're going through it in bulk, things like summarization, uh extraction, uh parsing PDFs, classification, right? Now so many even large enterprise companies. are no longer doing that on the back end using the frontier US uh companies. Well, I mean, many still are, but you've already seen a big segment uh of those companies move. to these open source or uh Chinese open source models. If you're thinking right now, if you're like, wow. Our bill's pretty high.

our API bill, right? I'm not talking about on the front end, you know, the number of seats you have in ChatGPT Enterprise or, you know, in Gemini Enterprise or anything like that. Right. I'm talking about back end, uh, all of these special projects that you have running via the API. So if if if you're looking at your API building, you're like, yeah, we're going through a lot. Uh, or you know, hey, we're using, you know, Opus 47 and GPT-5.5 to run our agents.

Maybe we should be looking at, you know, Kimmy or Deep Seek or whatever it is. Before you do that, you have to know that there is a big trade-off. Just because a model is free or open source or cheap-ish to run via the API, right? If you are uh you know running some of these open models uh via the API. There's still a an expensive price to pay. And that might be unknown at this point.

But it can cost your company a lot more than maybe just using that closed proprietary AI would have costed you could have cost you via the API. That's because using open source strips away all of that legal protection that you probably overlook or take for granted. What do I mean? Well, when you're using anthropic, open AI, Microsoft, um, who did I forget? Microsoft, open AI, Google, Anthropic, right? When you're using those uh enterprise officers,

You have a level of legal protection, right? So, as an example, if you use, right, I'm not going to go through all the uh fine prints, right? But the four companies at the enterprise level all offer, you know, some sort of essential uh I won't call it insurance.

Right. But think of it kind of like that. Right. Like, hey, if you use something produced by our systems and if you use it ethically and responsibly and with guardrails and it produces something that's not, you know, correct, there is some level of protection there. Right. You don't get that with open source models, right? So as an example, you know, Deep Seek, uh, you know, they used uh different MIT licenses, Apache 2.0. Uh yeah.

Uh essentially there's no warranty or non-infringement agreements. All right. So for regulated work and customer facing output. You have to look at the trade-off. Yeah, you might save. Seven, who knows, maybe eight uh figures. by switching the bulk of some of your maybe agentic or bulk workloads.

Uh you write, especially if you're a fortune, fortune five hundred, fortune one hundred company. There's minimum seven, eight figures that you could in theory save by switching some of those heavier agentic. Or, you know, parsing, you know, I know parsing is a big one, shifting some of those workflows to open models, but you lose. That legal protection that maybe you've had to rely on it before, maybe you haven't. But that one time,

that you would actually need it. And if you do switch over to open source, you have to understand those ramifications because at that point you're gonna actually be paying for it. So that gets us to the real question here as we get close to wrapping up. Because I don't want my takeaway here to be don't use open models, they're not safe, because that's not the takeaway. I think you need to start looking at your AI workflow.

Right. At least when it comes to back end tasks. Right. Uh front end, I've always been a firm believer and I still am today. You need to pick your AI operating system of choice, whether that's co-pilot. Chat GPT, Claude, Gemini on the front end. And that's where you should move, especially your non-technical people, should move the majority of their day-to-day knowledge work tasks should be happening on the front end there. But you still have a multitude of back-end tasks.

And I think you have to look at it like Triaging in an emergency room, right? You wouldn't send your top neurosurgeon in when someone's having an allergic reaction to honey.

Choosing the right AI tools

Right. You wouldn't do that. You would save that neurosurgeon for well. Someone that needs a neurosurgeon. And I think that there's so many companies that haven't gone through the basics of For the most part, on the API side, they pick, well, one model. And they say, All right, well, we have our AI operating system of choice, then for everything else. As an example, we go to Sonnet 4.6 or we go to, you know, Gemini 3.1 flash.

or whatever that model may be. And maybe that's the right model. Maybe those companies have done their due diligence and have betted out their different use cases and have priced it out. And maybe that's the right move.

But maybe it's not.

A

Because I know from experience and talking to a lot of people, a lot of companies just choose whatever is on the cutting edge. And they say, Well, this is the best. So we're gonna pay for it because there is a push internally to use more AI, to use the best AI. We see all these new benchmarks. We wanna make sure that we're taking advantage of it. Well

Is that neurosurgeon gonna be able to, you know, properly diagnose the allergic reaction to someone eating honey? Well, yeah, probably, but it's gonna cost you a lot more. So you need to think about sending those high volume, low stakes work. Right. Like summarization, research, content creation, maybe, uh, to cheaper open source models that you can either run locally uh or, you know, running via the API for just cost efficiency. If.

There's essentially like no legal ramifications if you get something wrong. Right. So if you're in a highly regulated sector, this is probably not the advice for you. Uh right. You shouldn't probably be using uh, you know, or just

taking my advice on a whole lot of anything as truth, you always need to be vetting these things out for yourself. Uh, right. But if if if if it is something relatively In a sector that's not highly regulated, where there's not a quote unquote a lot on the line, that's one of those instances where you need to say, can we shift some of our more expensive API workloads to an open source model?

Uh, or if you need to run sensitive private workflows on self-hosted open models. That's another thing. I think that there's still, even to this day, even though I think there's plenty of reasons, you know. One thing I always ask companies when they're like, oh, we don't, well, we don't run this through AI, right? Because it's sensitive data. And I'm like, okay, well, do you have a cloud provider? And they're like, of course.

Shifting workloads to open source

It's like, okay, well, it's the same thing, more or less, right? As long as you take proper precautions, turn off model training and all that. It's it's more or less using the same level grade of security that, you know, cloud uh uses. Anyway. There are still some things that companies won't even put on the cloud, right? Which I understand. But having these now

uh extremely powerful open source models and extremely efficient open source models. Now you can start running those private workloads uh uh workloads or workflows on prem, right? Or, you know, self hosted that you can fully control.

And then you can reserve those more premium, uh, those more um uh high value, highly sensitive tasks. Um you know, that for those models that can reason and think and offer kind of that that that level of uh security and legal support that you don't get if you opt for open models instead. you know, have a nice little chart here. So maybe when you look at the cheap open APIs, you look at simple tasks like summarization, extraction, or classification uh for local, self-hosted open models.

Private workflows, great for that. Running agents locally, having more control. Uh, right. And then for the premium closed models, which are the ones that a lot of people are using on the front end, on the back end, you should still be using these for a lot of reasons, right?

Using open-source and closed models

Uh those that, well, carry a lot of business value. Uh Hard tasks that require reasoning. Uh right. Your final review. Maybe you do uh you know draft version either with a cheap API or a local self-hosted open model, but or anything, you know, uh that requires customer-facing output. Should probably be going on that premium closed models for that level of protection. Like I said, you cannot overlook the hidden trade off.

that these open licenses may disclaim warranty and non-infringements where the enterprise offerings do usually include that i that IP uh indemnification so for regulated work that protection in almost all cases, justifies the premium that you pay. However, as we wrap up here, let me just quickly Encapsulate all of it. Local models aren't going anywhere. All right. And I actually think, especially as we uh officially welcome in the era.

Uh

A

Models that can improve themselves and create can create own versions. uh you know smaller versions of themselves, right? All the big companies have essentially said, you know, have hinted uh at RSI or, you know, the fact that our big models make smaller versions of themselves. I think that we're gonna not only see a continued trend toward uh local open source models, I think we're gonna start seeing a lot of smaller models.

uh for very specific use cases. It's something I've been, you know, predicting now for multiple years. We've started to see it slowly. I think it is going to pick up steam now that we're starting to get some hints of recursive self-improvement with these models. So, your company has to be paying attention because this is a trend that is not going away. The open models are going to become more and more capable.

They're going to become faster. They're going to become more efficient. And the options are going to start to become even greater. Uh, right. Not just great general purpose. open models that can run on consumer hardware like Gemma 4, uh but small open models. For very specific tasks that can be highly valuable for your company. So you have to understand the pros and the cons of these local models.

uh when you might use a cheap API and how this changes the agenc outlook for your company. So don't write them off just because you always want to use the latest and the greatest. Yes, you should do that. But Don't send the neurosurgeon to, you know, triage a a basic thing happening in the waiting room. Send the right model at the right time for the right purpose. So I hope this was helpful. As we recapped open source AI 101 as part of our start here series. If this was helpful, number one.

Make sure you subscribe to the podcast. I'd appreciate that. But then make sure you go to starthirseries.com. That's going to give you free access to our exclusive inner circle community. Right now, there's no other way to join except by going to Start Here Series.

The future of open models

So make sure you do that. Thank you for tuning in. I hope to see you back tomorrow and every day for more everyday AI. Thanks, y'all.

🔇 Silence

A

Meet Firefly AI Assistant, now live in Adobe Firefly, the all-man-one creative AI studio. Just describe what you want to create in your own words and the assistant handles the rest. Orchestrating multi-step workflows across Adobe Creative Cloud apps, including Photoshop, Premiere Express, and more in one conversational interface. You direct the outcome while the assistant accelerates execution. Stay in control with the ability to step in and refine at any time. See it today at firefly.adobe.com.

🎵 Music

B

And that's a wrap for today's edition of Everyday AI. Thanks for joining us. please subscribe and leave us a rating. It helps keep us going. For a little more AI magic, visit your everydayai.com and sign up to our daily newsletter so you don't get left behind. Go break some barriers and we'll see you next time.

This transcript was generated by Metacast using AI and may contain inaccuracies. Learn more about transcripts.
For the best experience, listen in Metacast app for iOS or Android