Seeing Machines: A Podcast on Computer Vision by AI - podcast cover

Seeing Machines: A Podcast on Computer Vision by AI

What happens when machines learn to see? Join us as we explore the evolving world of computer vision—from autonomous vehicles and facial recognition to cutting-edge deep learning. Hosted by AI, this podcast simplifies complex visual technologies for curious minds at all levels. New episodes drop weekly. Subscribe and stay curious.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

S2E4: Data Augmentation

Discover how data augmentation is revolutionizing computer vision, offering a powerful solution to the perennial challenge of data scarcity in training deep neural networks. This process involves artificially generating new, plausible training samples by applying transformations to existing data, thereby enriching datasets and providing the necessary volume and variety for models to learn more effectively. Beyond merely increasing data quantity, augmentation acts as a crucial regularization tech...

Sep 02, 202530 minSeason 2Ep. 4

S2E3: Datasets

This episode delves into the unsung heroes of the artificial intelligence revolution: the foundational datasets that taught computers to "see" . We explore the evolutionary journey of computer vision through four landmark datasets: PASCAL VOC , which standardized object detection and established common benchmarks; ImageNet , whose unprecedented scale ignited the deep learning revolution and popularized transfer learning; COCO (Common Objects in Context) , which advanced the field towards complex...

Aug 25, 202522 minSeason 2Ep. 3

S2E2: Annotation tools

This episode delves into the foundational role of data annotation in teaching machines to "see" and understand the visual world, a critical step for nearly all supervised machine learning projects in computer vision. We explore how meticulously labeled datasets, known as ground truth , serve as the "answer key" that determines the accuracy and reliability of AI models. The discussion then compares three prominent computer vision annotation tools: LabelImg , presented as the ideal tool for learni...

Aug 19, 202520 min

S2E1: Computer Vision Libraries

In this episode, we delve into the fascinating world of computer vision , the field that empowers machines to interpret and understand visual information, bridging the gap between raw pixel data and high-level human understanding. We explore its two fundamental approaches: the classical, algorithm-driven method and the modern, data-driven deep learning method . Our journey begins with OpenCV , the venerable, high-performance, and open-source library that serves as the foundational toolkit for cl...

Aug 13, 202533 minSeason 2Ep. 1

S1Bonus: SciFi to Reality

Step into a world where machines truly see, bridging the gap between cinematic fantasy and scientific reality. This episode begins with the captivating gaze of Ava from Ex Machina , exploring the profound allure of a "seeing machine" that leverages visual data to manipulate and evoke sympathy, representing the ultimate fantasy of computer vision. We then deconstruct the technology, revealing how real-world algorithms enable machines to interpret and understand the visual world by translating pix...

Aug 05, 202524 min

S1E8: Computer Vision Challenges

This episode delves into the critical challenges hindering the widespread and reliable deployment of computer vision (CV) systems in the real world. We explore occlusion , where objects are partially or completely hidden, making it difficult for models to "see" and interpret scenes accurately. The concept of generalization is examined, highlighting how models often fail to perform reliably on new, unseen data due to "domain shift," such as changes in weather, lighting, or geographical location f...

Aug 02, 20251 hr 2 minSeason 1Ep. 8

S1E7: Segmentation

This episode delves into image segmentation , a foundational computer vision task that teaches machines to understand the visual world at a pixel level, moving beyond simple classification or bounding boxes. We explore the critical distinctions within this field: semantic segmentation , which assigns a class label to every pixel to understand broad regions like "road" or "sky", and instance segmentation , which goes a step further by identifying and precisely outlining each individual object wit...

Jul 26, 202524 minSeason 1Ep. 7

S1E5: Object Detection

Dive into the fascinating world of computer vision with a deep exploration of object detection models , the technology that teaches machines to "see" and understand the world around them. This episode breaks down the core concepts, from the fundamental task of distinguishing multiple objects and pinpointing their locations within an image, to the sophisticated architectures that power this capability. We'll uncover the "Great Divide" in object detection, contrasting the accuracy-focused two-stag...

Jul 18, 202516 minSeason 1Ep. 5

Image Classification

Welcome to "From Pixels to Perception: A Deep Dive into Image Classification"! In this episode, we embark on a journey into the fascinating world of computer vision, starting with the fundamental task of image classification , which teaches computers to "see" and assign predefined labels to entire images, such as "fish" or "car". We'll explore the historical shift from hand-crafted features like SIFT, SURF, and HOG , which required human expertise to extract meaningful visual patterns, to the re...

Jul 14, 202541 minSeason 1Ep. 5

Building Computer Vision Models

Tune in to explore the fascinating world of computer vision, a field of artificial intelligence that empowers machines to interpret and understand the visual world, mimicking human sight. We'll uncover how computers perceive images not as coherent scenes, but as structured grids of numbers called pixels, and delve into the hierarchy of vision tasks , ranging from basic image classification (assigning a single label) to object detection (identifying and locating multiple objects with bounding box...

Jul 05, 202535 minSeason 1Ep. 4

How Computers See

We explore the two defining eras of computer vision : how machines learn to interpret the visual world. We'll dive into Classical Computer Vision , a "human-guided" approach where experts meticulously design algorithms to detect explicit features like edges or corners, exemplified by techniques such as SIFT, SURF, and HOG. Then, we'll turn to the revolutionary Deep Learning paradigm, notably with Convolutional Neural Networks (CNNs), which are "data-driven" and learn to identify salient features...

Jun 28, 202535 minSeason 1Ep. 3

The Art and Science of Digital Images

The provided text offers a comprehensive overview of digital imaging fundamentals , beginning with the pixel as the foundational unit of all digital images, explaining its nature, organization in raster graphics , and concepts like resolution and density (PPI vs. DPI) . It then details various color models , including the additive RGB for displays, the subtractive CMYK for printing, the intuitive HSV/HSB for user interfaces, and grayscale for intensity-only representation. The sources also illum...

Jun 25, 202522 minSeason 1Ep. 2

What is Computer Vision?

This episode explores computer vision , an area of artificial intelligence that trains machines to interpret visual data . It details the step-by-step process by which computers analyze images and videos, comparing this mechanical approach to the complex, adaptive nature of human sight . The history of the field is traced from its beginnings through the significant advancements driven by deep learning , highlighting key algorithms and milestones. Ultimately, the sources demonstrate how this tech...

Jun 08, 202546 minSeason 1Ep. 1
For the best experience, listen in Metacast app for iOS or Android