Huntpedia: Your Threat Hunting Knowledge Compendium

Speaker 1

00:00

Welcome back to the deep Dive. Today we are we're doing something a little different. We're putting aside that comforting idea that your firewalls and your anti virus software are enough.

Speaker 2

00:11

Yeah, that's a tough pill to swallow for.

Speaker 1

00:13

A lot of people, it really is. I mean, we are opening a source document today called hunt Pedia, and the core premise right from the start is uncomfortable. It basically says, the adversary is likely already inside your network. Right, so the question isn't you know, how do we keep them out? It's how do we find them before they actually achieve whatever it is they're trying to do exactly.

Speaker 2

00:35

It's a fundamental shift in philosophy because for the longest time, the industry was focused on incident.

Speaker 1

00:42

Response, right, which is just waiting for an alarm.

Speaker 2

00:44

Yeah, incident response is essentially waiting for the fire alarm to ring and then scrambling. But thread hunting, which is what hunt pedia is all about, is walking the floor. You're actively sniffing the air for smoke before any sensor even registers a problem.

Speaker 1

00:58

And this document, hunt key, it's really a fascinating collection. It aggregates wisdom from some of the absolute heavy hitters in the industry.

Speaker 2

01:07

Oh yeah, you got Richard Batelitch, David Bianco, Chris Sanders.

Speaker 1

01:10

Right, and it standardizes what used to be considered this I don't know, almost a dark art in cybersecurity.

Speaker 2

01:16

It really does. And it all starts with the mindset. Baelitch actually traces the whole concept back to the Air Force.

Speaker 1

01:22

To hunter killer missions right, exactly.

Speaker 2

01:24

Friendly force projection. The idea is that you aren't just sitting behind a wall defending a perimeter. You are actively engaging within your own territory to flesh out the enemy.

Speaker 1

01:36

And I think that distinction is key for the listener because a lot of organizations out there believe they're.

Speaker 2

01:41

Hunting, but they're really not right.

Speaker 1

01:43

They just have a security operations center watching a dashboard waiting for red lights to blink.

Speaker 2

01:48

Which is entirely passive. That is monitoring. Real hunting is well, it's hypothesis driven. Danny A. Kacki puts it perfectly in chapter one. He defines hunting as finding ways for evil to do evil things.

Speaker 1

02:01

I love that phrasing.

Speaker 2

02:03

It's great. You aren't waiting for a piece of software to tell you something is wrong. You are operating on the assumption that something is already wrong and you're actively trying to prove it.

Speaker 1

02:13

Which brings up this whole man versus machine debate that runs through the entire text.

Speaker 2

02:18

Yeah, it's everywhere in the book.

Speaker 1

02:19

There's this great quote in the intro from the old TV show Airwolf.

Speaker 2

02:23

Oh I remember that show. Yeah.

Speaker 1

02:25

The quote is they haven't built a machine yet that could replace a good pilot, and Betglitch uses that to argue that attackers they can test their malware against all your automated tool Absolutely.

Speaker 2

02:36

They literally buy the exact same endpoint detection software.

Speaker 1

02:40

That you use, right, and they run it in their own labs until they figure out how to bypass it.

Speaker 2

02:44

Exactly, if you're relying solely on automation, you're fighting a completely static defense. The attacker replicates your defense, beats it, and then attacks. But the one thing they cannot test against in a lab is you. Is you not test against? A creative human analyst who I don't know wakes up one morning and decides to look for some highly specific, weird anomaly in the DNS.

Speaker 1

03:10

Logs that unpredictability is the ultimate defense.

Speaker 2

03:13

It is human creativity is the variable they can account.

Speaker 1

03:18

For But you know, we can't just rely on a gut feeling, right, We can't just wake up with a hunch every day. We need a framework to actually direct that creativity.

Speaker 2

03:26

You need structure.

Speaker 1

03:27

And that leads us to probably the most famous mental model in this entire document, the Pyramid.

Speaker 2

03:34

Of Pain, Ah David Bianco's masterpiece.

Speaker 1

03:37

It really is.

Speaker 2

03:38

It is absolutely essential for understanding modern defense.

Speaker 1

03:41

And I think people often misunderstand it at first glance, Like they see a pyramid and they automatically think it's a ranking of how bad the malware is.

Speaker 2

03:49

Yeah, that's a common misconception. Yeah, but it's actually a ranking of how much pain we cause the adversary when we detect them at different levels.

Speaker 1

03:56

It's an economic model, really exactly.

Speaker 2

03:58

It's an economic model for the attack. So at the bottom of pyramid, the wide base, you have things like hash values and IP addresses, the easy stuff, very easy for us to detect, but also incredibly easy for the attacker to change.

Speaker 1

04:11

Right because if I block a malicious IP address, they don't care. They don't they probably have a botnet of ten thousand other ips. They burn one and move to the next. In the literal milliseconds.

Speaker 2

04:23

It costs them absolutely nothing. It's a minor nuisance. You haven't disrupted their operation. You've just made a computer change a variable.

Speaker 1

04:29

But then you move up the pyramid, right, you.

Speaker 2

04:32

Move up through domain names, the network, artifacts than tools, and it gets progressively harder and more expensive for them to change those things until you hit the peak, the pinnacle.

Speaker 1

04:42

Yeah, TTPs, tactics, techniques and procedures. And this is behavior. This isn't what the malicious file is named or what IP it came from. This is how the attacker actually operates.

Speaker 2

04:53

Precisely. Let's say you detect that an attacker is using a technique like past the hash to move laterally through your network. Okay, if you can detect that specific behavior and block that technique, you haven't just burned a cheap IP address. You've burned their education. You've burned the entire methodology they might have spent six months developing and practicing.

Speaker 1

05:14

You force them to completely go back to the drawing board. You're taxing their resources.

Speaker 2

05:19

That is the pain in the pyramid of pain. It costs them real time and real money. And that is why the hunter's mindset has to focus on the top of the pyramid.

Speaker 1

05:28

We want behaviors, not just giant lists of bad IP addresses exactly. So okay, effective hunting is about understanding the behavior of the enemy. But to find that behavior you need a method. I mean, you can't just scroll through millions of log lines hoping to see the word evil.

Speaker 2

05:47

No, you go crazy. And that's what Jack Crook and sergiokel Tajerone argue in chapters three and four. They call that wandering wandering. To actually hunt, you need the scientific method. You need to start with a solid hypothesis.

Speaker 1

06:00

Have to think like the thief.

Speaker 2

06:01

To catch the thief, specifically, you have to think about the thief's needs.

Speaker 1

06:05

Right, because they have a job to do on your network.

Speaker 2

06:07

They do they need to execute code, they need to escalate their privileges, they need to package and move data. So a good hunter sits down and asks, if I were an attacker and I needed to steal the CEO's password, how would I do it?

Speaker 1

06:20

And that question becomes the hypothesis. So, for example, you might say, if an attacker is staging data to exfiltrate it, they might be compressing large files in a temporary directory.

Speaker 2

06:29

Perfect, that is a hunt. Now you go look for rare dot x or seven zip running in the seed drive Windows temp folder.

Speaker 1

06:38

You aren't looking for a virus signature.

Speaker 2

06:39

No, you're looking for the behavior of staging data.

Speaker 1

06:44

To help structure this, the text brings up the diamond model. It connects four points adversary capability, infrastructure, and victim.

Speaker 2

06:52

It seems simple on the surface, but the power is in how it lets you pivot right.

Speaker 1

06:57

It turns a single isolated data point into a whole web of intelligence.

Speaker 2

07:01

Because if you find a capability that's say, a specific piece of malware, you don't just delete it and stop there. You trace the line to infrastructure, where's this malware calling home to?

Speaker 1

07:10

And then you trace the line to victim. Who else in our network has this file?

Speaker 2

07:14

Exactly? It forces you to map the entire campaign.

Speaker 1

07:17

Let's get practical here, because the theory is great, but huntpedia is absolutely full of these specific technical hunts that really bring this mindset to life.

Speaker 2

07:25

Well, the real world examples are the best part.

Speaker 1

07:27

I want to break down three of them that really stood out to me. The first one is from Tyler Hudak, and it's all about DNS collisions.

Speaker 2

07:34

Ah. Yes, this is a classic OOPS vulnerability that attackers just absolutely love to exploit.

Speaker 1

07:40

So it starts with a configuration issue known as split brain DNS. Let's say, inside my company, internally we use the domaincorp dot example dot org. Okay, my work laptop knows that internet dot corp dot example dot org is a private internal server right down the.

Speaker 2

07:57

Hall, right. But then you take that laptop back to a coffee shop. Yep, you connect to the public Wi Fi, and your laptop, just trying to be helpful, shouts out to the local coffeeshop DNS server, Hey, where's interrnet dot corp dot example dot org.

Speaker 1

08:11

Because it's constantly looking for in the background, and since I'm not on the corporate network, that query goes out to the public Internet.

Speaker 2

08:16

Now here is a real danger if your company doesn't actually own the public registration for example dot org dot org, or if using an internal suffix that overlaps with a real public top level domain, an attacker can just register that domain.

Speaker 1

08:31

So the attacker sets up a server on the public Internet that simply says hey, I'm right here, and your.

Speaker 2

08:37

Laptop fully believes them. Hohodec specifically points out the danger of thewpad dot dot.

Speaker 1

08:43

File here WPAD the web proxy autodiscovery file. That's the file that basically tells the browser how to connect.

Speaker 2

08:50

To the Internet right exactly, it configures your proxysettings. So if the attacker serves their malicious WPAD file to your laptop because of this DNS collision.

Speaker 1

09:00

They designate themselves as your proxy.

Speaker 2

09:02

Yep, and suddenly every single bank password, every email, every session cookie you send flows directly through their server before it goes to the real Internet.

Speaker 1

09:11

That is terrifying efficiency. They don't even need to break into your laptop. They just raise their hand when your laptop asked for directions.

Speaker 2

09:18

It's a man in the middle attack handed to them on a silver platter.

Speaker 1

09:22

So the hunt here, bringing it back to the hunter mindset, isn't looking for malware. It's looking for your own internal assets, trying to authenticate to things that shouldn't exist on the public web.

Speaker 2

09:32

Precisely, you are hunting for the misconfiguration before the attacker finds it. You're looking for internal host names resolving to public eyeps.

Speaker 1

09:41

That's brilliant, okay. Hunt number two comes from Chris Sanders, and this one deals with proxy logs. Right now, Normally we rely on our security vendors to categorize the web for us. They tell us this site is sports, this site is gambling, this one is malicious.

Speaker 2

09:56

But the Internet is just too big. I mean, millions of new domain are registered every single day. The vendors simply can't categorize everything instantly, and.

Speaker 1

10:04

Attackers know this. They're registering fresh domains for their command and control servers, their C two's constantly, just to avoid those vendor blacklists.

Speaker 2

10:13

Right, So when an attacker spins up a brand new domain for a campaign today, the proxy vendor hasn't seen it yet, it has zero reputation, so.

Speaker 1

10:20

It just gets labeled uncategorized or unknown exactly. So Sanders says the hunt should focus on that uncategorized bucket. But isn't that incredibly noisy? I mean, legitimate news sites launch all the time, small blogs, local pop up shops.

Speaker 2

10:34

Oh, it could be very noisy. You definitely can't just block all uncategorized traffic, or you'll completely break the Internet for your users. But Sanders suggests correlating that uncategorized traffic with frequency or beaconing behavior.

Speaker 1

10:48

Ah right, because normal human web browsing is entirely sporadic. I read a page, I click a link, I walk away to get coffee exactly.

Speaker 2

10:55

Humans are random, but malware beacons a rhythmic where it needs to check in with the C two server for instructions automated right. So if you see a machine inside your network reaching out to an uncategorized domain every five minutes exactly, twenty four hours.

Speaker 1

11:11

A day, that's a heartbeat.

Speaker 2

11:12

That's C two traffic. That is the signal hidden in the noise. It's hiding in the blind spot of the categorization engine. But the behavior, the rhythm, gives it away completely.

Speaker 1

11:24

Okay, Hunt number three. This one is honestly my favorite because it feels exactly like running a spell checker. Because this is from David Bianco on process impersonation.

Speaker 2

11:31

It's so clever.

Speaker 1

11:32

You know the standard Windows processes right, like Airy Coast.

Speaker 2

11:35

Dot xa right or LSAs dot xe.

Speaker 1

11:38

Exactly, And attackers know that Sissigmand's just scan process lists visually.

Speaker 2

11:43

We read by pattern recognition.

Speaker 1

11:44

We just scan. So if an attacker names their malware SCVHOSD swapping the C in the V, yeah, your brain just instinctively autocorrect sit to system in. You skip right over it.

Speaker 2

11:56

You don't even notice.

Speaker 1

11:57

So how do you catch that without reading every single line of a us log like a lawyer proofreading a contract. Bianco suggests using the Levenstein distance algorithm.

Speaker 2

12:06

It's a brilliant application of a string metric. The Levenstein distance simply counts the number of edits, meaning insertions, deletions, or substitutions required to change one word into another word.

Speaker 1

12:18

So changing syspein to swistem is just swapping two letters, right.

Speaker 2

12:21

And depending on the specific variant of the algorithm you use, like the Damro Levenstein one, a swap of adjacent characters counts as a distance of exactly one.

Speaker 1

12:30

So if the distance is zero, it's a perfect match. It's the legitimate Windows file.

Speaker 2

12:35

And if the distance is say ten, it's a totally different word, entirely not suspicious in this context.

Speaker 1

12:41

But if the distance is one or two.

Speaker 2

12:43

That's the danger zone. That means someone is actively trying to trick.

Speaker 1

12:47

Your eyes exactly. So you just run a script that says, show me every process name running in my environment that has a Levenstein distance of one from a known system binary.

Speaker 2

12:56

It's mathematically identifying deception. Yeah, you don't have to rely on your tired eyes at two am. You let the math find the camouflage.

Speaker 1

13:04

It's using the attacker's desire to blend in against them.

Speaker 2

13:07

It really is.

Speaker 1

13:08

But let's play this out. Let's assume we use these methods. We found the typosquadded process where we found the beaconing proxy log. We actually found that the bad guy on the network.

Speaker 2

13:17

Okay.

Speaker 1

13:18

Segment four of our Deep Dive covers the strategy of the kill, and Scott Roberts introduces a genuinely controversial idea.

Speaker 2

13:25

Here, the Hamilton dilemma.

Speaker 1

13:26

Yes, he quotes the musical Hamilton regarding erinberr, I am not standing still. I am lying in wait.

Speaker 2

13:32

Because the natural instinct of every single security and certainly every manager is kill it right now, get them out right. You see a bad EP, you block it. You see an infected machine, you isolated and reimage it immediately. But Roberts argues that while that might be a tactical win, it's often a strategic.

Speaker 1

13:51

Loss because if you kill it immediately, you show your hand, you.

Speaker 2

13:54

Tell the attacker I see you.

Speaker 1

13:56

Yeah.

Speaker 2

13:56

What do they do? They disappear, they patch their tool, they change their EYEP, and they come back next week. Using a method, you don't know about.

Speaker 1

14:04

You've stopped the immediate bleeding, sure, but you've lost the intelligence. You have no idea who they actually are or what they were trying to steal exactly. But keeping them alive that's incredibly risky. You're knowingly letting a thread actor operate on your live network. How do you possibly justify that to the business.

Speaker 2

14:21

It's a highly calculated risk, and Roberts gives a checklist for it. The first, absolutely most important question is is the victim safe?

Speaker 1

14:29

Right?

Speaker 2

14:30

If the attacker is about to exfiltrate your entire customer database, or if they're staging ransomware to encrypt your servers, you kill it, yeah, immediately, game over.

Speaker 1

14:39

But if they're just doing reconnaissance, if they're just looking around the network.

Speaker 2

14:44

Then you watch, You lie and wait. You sit back, and you see what commands they type, You see what other internal ips they try to connect to. You map out their entire infrastructure.

Speaker 1

14:54

You wait until you can burn their entire operation to the ground, not just chop off one tentacle.

Speaker 2

14:59

Precisely, it fundamentally changes your role from being a digital janitor just constantly cleaning up messes to doing actual counterintelligence.

Speaker 1

15:09

You want to understand the human on the other side of the keyboard. If you kick them out too early, you never learn their objectives.

Speaker 2

15:14

And that really brings us full circle, doesn't it. We started with man versus machine. We talked about algorithms like Levenstein distance and automated tools like proxies, but ultimately Hunt PDIA keeps coming back to the fact that this is a human on human fight.

Speaker 1

15:28

It really is. Automation handles the known threats. It clears out that low hanging fruit at the bottom of the pyramid of pain. It blocks the bad EPs and the known hashes.

Speaker 2

15:38

But the top of the pyramid is creative. It's novel, and it takes a human mind to spot the anomaly that an algorithm simply ignores because it hasn't.

Speaker 1

15:47

Seen it before a machine sees data. A hunter Season.

Speaker 2

15:50

Ten well put the sources heavily emphasized that while AI and machine learning are great, they are not a replacement for human intuition. Is a human being, they will make mistakes, they will have observable patterns. Right.

Speaker 1

16:04

A machine might miss the typo in the process name because it doesn't understand the intent to deceive, But a human hunter, armed with the right hypothesis will catch it every time exactly, which brings me to a final thought for you to chew on. We talked a lot today about how automation handles the bottom of the pyramid for us, the defenders. But what happens when the attackers start using AI to automate the top of the pyramid?

Speaker 2

16:29

Ooh, that's a scary thought, right.

Speaker 1

16:31

What happens when they use large language models to dynamically rewrite their TTPs on the fly, so there is no consistent behavioral pattern for us to track. The top of the pyramid becomes completely fluid. That is the next frontier of hunting, and it's going to require even more human creativity to solve it.

Speaker 2

16:48

Absolutely will, So for.

Speaker 1

16:49

Everyone listening, don't just sit there waiting for the red light on your dashboard to blink. That's the old way. The challenge that hunt Pedia leaves us with is to be proactive. Ask yourself today, if I were trying to hide in my own network, where would I go?

Speaker 2

17:03

And then go? Look there? Happy hunting.

Speaker 1

17:05

Thanks for joining us. We'll catch you on the next deep dive.

Transcript source: Provided by creator in RSS feed: download file

Episode description

Transcript