A tidal wave of computer vision innovation is quickly having an impact on everyone's lives, but not everyone has the time to sit down and read through a bunch of news articles and learn what it means for them. In Computer Vision Decoded, we sit down with Jared Heinly, the Chief Scientist at EveryPoint, to discuss topics in today’s quickly evolving world of computer vision and decode what they mean for you. If you want to be sure you understand everything happening in the world of computer vision, don't miss an episode!
Last refreshed: ⓘ
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more
Jared Heinly traces the journey of 3D reconstruction from its conceptual beginnings to current advanced methods. He explores the historical development, including early stereoscopes, photogrammetry, and the integration of computers for tasks like bundle adjustment and stereo vision. The discussion progresses to modern techniques such as Structure from Motion, large-scale crowdsourced projects, and the transformative impact of machine learning, exemplified by Neural Radiance Fields and Gaussian splatting. The episode concludes with a look at current applications in areas like autonomous driving and consumer devices.
In this episode of Computer Vision Decoded, our hosts Jonathan Stephens and Jared Heinly are joined by Ruilong Li, a researcher at NVIDIA and key contributor to both Nerfstudio and gsplat, to dive deep into 3D Gaussian Splatting. They explore how this relatively new technology works, from the fundamentals of gaussian representations to the optimization process that creates photorealistic 3D scenes. Ruilong explains the technical details behind gaussian splatting, and discusses the development of...
In this episode of Computer Vision Decoded, hosts Jonathan Stephens and Jared Heinly explore the various types of cameras used in computer vision and 3D reconstruction. They discuss the strengths and weaknesses of smartphone cameras, DSLR and mirrorless cameras, action cameras, drones, and specialized cameras like 360, thermal, and event cameras. The conversation emphasizes the importance of understanding camera specifications, metadata, and the impact of different lenses on image quality. The h...
In this episode, Jonathan Stephens and Jared Heinly delve into the intricacies of COLMAP, a powerful tool for 3D reconstruction from images. They discuss the workflow of COLMAP, including feature extraction, correspondence search, incremental reconstruction, and the importance of camera models. The conversation also covers advanced topics like geometric verification, bundle adjustment, and the newer GLOMAP method, which offers a faster alternative to traditional reconstruction techniques. Listen...
In this episode, we discuss practical tips and challenges in 3D reconstruction from images, focusing on various environments such as urban, indoor, and outdoor settings. We explore issues like repetitive structures, lighting conditions, and the impact of reflections and shadows on reconstruction quality. The conversation also touches on the importance of camera motion, lens distortion, and the role of machine learning in enhancing reconstruction processes. Listeners gain insights into optimizing...
In this episode of Computer Vision Decoded, Jonathan Stephens and Jared Heinly explore the concept of depth maps in computer vision. They discuss the basics of depth and depth maps, their applications in smartphones, and the various types of depth maps. The conversation delves into the role of depth maps in photogrammetry and 3D reconstruction, as well as future trends in depth sensing and machine learning. The episode highlights the importance of depth maps in enhancing photography, gaming, and...
After an 18 month hiatus, we are back! In this episode of Computer Vision Decoded, hosts Jonathan Stephens and Jared Heinly discuss the latest advancements in computer vision technology, personal updates, and insights from the industry. They explore topics such as real-time 3D reconstruction, computer vision research, SLAM, event cameras, and the impact of generative AI on robotics. The conversation highlights the importance of merging traditional techniques with modern machine learning approach...
In this episode of Computer Vision Decoded, we are going to dive into our in-house computer vision expert's reaction to the iPhone 15 and iPhone 15 Pro announcement. We dive into the camera upgrades, decode what a quad sensor means, and even talk about the importance of depth maps. Episode timeline: 00:00 Intro 02:59 iPhone 15 Overview 05:15 iPhone 15 Main Camera 07:20 Quad Pixel Sensor Explained 15:45 Depth Maps Explained 22:57 iPhone 15 Pro Overview 27:01 iPhone 15 Pro Cameras 32:20 Spatial Vi...
In this episode of Computer Vision Decoded, we are going to dive into Pierre Moulon's 10 years experience building OpenMVG. We also cover the impact of open-source software in the computer vision industry and everything involved in building your own project. There is a lot to learn here! Our episode guest, Pierre Moulon, is a computer vision research scientist and creator of OpenMVG - a library for computer-vision scientists and targeted for the Multiple View Geometry community. The episode foll...
In this episode of Computer Vision Decoded, we are going to dive into implicit neural representations. We are joined by Itzik Ben-Shabat, a Visiting Research Fellow at the Australian National Universit (ANU) and Technion – Israel Institute of Technology as well as the host of the Talking Paper Podcast . You will learn a core understanding of implicit neural representations, key concepts and terminology, how it's being used in applications today, and Itzik's research into improving output with li...
In this episode of Computer Vision Decoded, we are going to dive into 4 different ways to 3D reconstruct a scene with images. Our cohost Jared Heinly, a PhD in the computer science specializing in 3D reconstruction from images, will dive into the 4 distinct strategies and discuss the pros and cons of each. Links to content shared in this episode: Live SLAM to measure a stockpile with SR Measure: https://srmeasure.com/professional Jared's notes on the iPhone LiDAR and SLAM: https://everypoint.med...
Join our guest, Keith Ito, founder of Scaniverse as we discuss the challenges of creating a 3D capture app for iPhones. Keith goes into depth on balancing speed with quality of 3D output and how he designed an intuitive user experience for his users. In this episode, we discuss… 01:00 - Keith's Ito's background at Google 09:44 - What is the Scaniverse app 11:43 - What inspired Keith to build Scaniverse 17:37 - The challenges of using LiDAR in the early versions of Scaniverse 25:54 - How to build...
In this episode of Computer Vision Decoded, we are going to dive into one of the hottest topics in the industry: Neural Radiance Fields (NeRFs) We are joined by Matt Tancik, a student pursuing a PhD in the computer science and electrical engineering department at UC Berkeley. He has also contributed research to the original NeRF project in 2020 along with several others since then. Last but not least, he is building NeRFStudio - a collaboration friendly studio for NeRFs. In this episode you will...
In this episode of Computer Vision Decoded, we are going to dive into image capture best practices for 3D reconstruction. At the end of this livestream, you will have learned the basics for capturing scenes and objects. We will also provide a downloadable visual guide for reference on your next 3D reconstruction project. Download the official guide here to follow along: https://tinyurl.com/4n2wspkn 00:00 Intro 04:40 Camera motion overview 07:15 Good camera motions 18:43 Transition camera motions...
In this episode of Computer Vision Decoded, we join Jared Heinly and Jonathan Stephens from EveryPoint for their live reaction to the iPhone 14 series announcement. They go in depth into what all the camera specs mean to the average person. We also explain basics of computational photography and how Apple is able to get great photos from a small camera sensor. 00:00 Intro 02:43 Apple Watch Review 06:58 Airpods Pro Review 09:40 iPhone 14 Initial Reaction 15:05 iPhone 14 Camera Specs Breakdown 37:...
In this episode of Computer Vision Decoded, we sit down with Jared Heinly, Chief Scientist at EveryPoint, to discuss 3D reconstruction in the wild. What does “in the wild” mean? This means 3D reconstructing objects and scenes in non-controlled environments where you may have limitations with lighting, access, reflective surfaces, etc. 00:00 Intro 01:30: What are Duplicate Scene Structures and How to Avoid Them 14:30: How Jared used 100 million crowdsourced photos to 3d reconstruct 12,903 landmar...
In this episode of Computer Vision Decoded we dive into Jared Heinly's recent trip to the CVPR Conference. We cover: what the conference about, who should attend, what are the emerging trends in computer vision, how machine learning is being used in 3D reconstruction, and what NeRFs are for. 00:00 - Introduction 00:36 - What is CVPR? 02:49 - Who should attend CVPR? 08:11 - What are emerging trends in Computer Vision? 14:34 - What is the value of NeRFs? 20:55 - How should you attend as a non-scie...
In this inaugural episode of Computer Vision Decoded we dive into the recent announcements at WWDC 2022 and find out what they mean for the computer vision community. We talk about what Apple is doing with their new RoomPlan API and how computer vision scientists can leverage it for better experiences. We also cover the enhancements to video and photo capture during an active ARKit Session. 00:00 - Introduction 00:25 - Meet Jared Heinly 02:10 - RoomPlan API 06:23 - Higher Resolution Video with A...