Weaviate Podcast - podcast cover

Weaviate Podcast

Weaviateweaviate.io
Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Bob van Luijt on Generative Search with Weaviate - Weaviate Podcast #35

This podcast debuts a huge new release from Weaviate... the generate module! The generate module is a new API in Weaviate that facilitates passing YOUR data from the Weaviate database to ChatGPT. This enables ChatGPT to become knowledgeable about your particular business or interests! Here is a great snippet from Bob around the 43 minute mark that describes how this kind of LLM technology is changing the world of database technology, "Yeah so, what I’m really excited about and this is something ...

Feb 07, 202344 min

Dmitry Kan on Neural Search Frameworks - Weaviate Podcast #34

I am so excited to host Dmitry Kan on the Weaviate Podcast!! Dmitry is a world class expert on emerging trends in search technology! This podcast reflects on Dmitry's latest characterization of the field, the Neural Search Pyramid. This describes the different components involved with building a Deep Learning-powered Search experience from the Approximate Nearest Neighbor index algorithms, to Database functionality, LLM orchestration, Vectorization optimization, Data preprocessing, User Interfac...

Jan 25, 20231 hr 48 min

Nils Reimers on Cohere Embedding Models

Weaviate podcast #33. Thank you so much for watching the 33rd Weaviate Podcast! This episode features one of the heroes of Deep Learning for Search, Nils Reimers! Nils' work on SentenceBERT is one of the foundational works for applying Deep Representation Learning to text search. This is the idea that personally inspired me to work in this field. Having seen the successes of Contrastive Representation Learning for Computer Vision, I was mind-blown by the possibility of this for NLP and text sear...

Jan 11, 202356 min

Sam Bean, Zain Hasan, and John Trengrove on You.com and Spark

Weaviate Podcast #32. Thank you so much for watching the Weaviate podcast! We are super excited to host Sam Bean from You.com! As well as welcome Zain Hasan and John Trengrove to the Weaviate podcast for the first time! Sam begins by describing You.com, and then we dive into the Weaviate Spark Connector that Sam played a massive role in creating. I thought this was such a masterclass in the Spark big data technology; John, Sam, and Zain are all data engineering pros and I've never learned more a...

Jan 09, 202352 min

Weaviate 1.17 Release with Etienne Dilocker and Parker Duckworth

Weaviate Podcast #31. Weaviate 1.17!! This is a massive release for Weaviate, debuting Replication, Hybrid Search, BM25, Faster Startup and Import Times, as well as other fixes! Replication and Hybrid Search are two massive features for Weaviate, and we really hope you enjoy the description of them from the podcast. Please also check out the Weaviate 1.17 release blog post for more information as well - https://weaviate.io/blog/2022/12/Weaviate-release-1-17.html! This is also a very special podc...

Dec 21, 202243 min

Bob van Luijt, Chris Dossman and Marco Bianco on the future of search

Weaviate Podcast #30. Chapters 0:00 The future of search! 0:42 Welcome Marco and Chris! 4:28 Solving Hallucination with External Memory LLMs 8:16 Bob van Luijt on Weaviate and LLMs, Collaborations 14:48 What we have is not yet what the technology is capable of 16:45 Everything is Search! 18:55 The Magic of Machine Learning 20:30 Asking follow up questions 22:28 Meaning in LLMs and RLHF 27:10 How ChatGPT is Evangelizing the Technology 29:45 What is the future of search from a user perspective? 34...

Dec 14, 20221 hr 6 min

Matthijs Douze on Quantization and FAISS

Weaviate Podcast #29. Hey everyone, thank you so much for watching another episode of the Weaviate podcast! This episode features Matthijs Douze, one of the most talented and accomplished scientists we've hosted on the Weaviate podcast! Matthijs has pioneered the use of Product Quantization to compress vector representations and enable even faster and more efficient approximate nearest neighbor vector search. Matthijs told an incredible story about the history of this research, from searching fr...

Nov 30, 20221 hr 13 min

Maarten Grootendorst on BERTopic

Weaviate Podcast #28. Thank you so much for watching the 28th Weaviate Podcast! This episode features Maarten Grootendorst, developer of the BERTopic python library and an active evangelist of this exciting cluster analysis technology, (Maarten has written some incredible articles here - https://medium.com/@maartengrootendorst) ! In this podcast, Maarten did an incredible job explaining how BERTopic works, with particular details such as k-Means clustering vs. HDBSCAN, Semi-Supervised topic mode...

Nov 17, 202253 min

Michael Goin on Neural Magic

Weaviate Podcast #27. Thank you so much for watching the 27th episode of the Weaviate Podcast! This is truly one of my favorite podcasts we have published so far, I think the way Weaviate and Neural Magic fit together is really exciting! Michael did an amazing job explaining the concepts behind how Neural Magic delivers and tests inference acceleration, as well as the vision for the future of Deep Learning with Sparsity and CPU inference. I really hope you enjoy the podcast, more than happy to a...

Oct 26, 202244 min

Jonathan Frankle on MosaicML Cloud

Weaviate Podcast #26. Thank you so much for watching the 26th episode of the Weaviate Podcast! This is another really special episode! Jonathan Frankle is one of the world's experts in Deep Learning and is making incredible advances at MosaicML in efficient Deep Learning training. The headline event is the release of MosaicML Cloud and a set of new cost estimates for GPT language models at different scales (linked below). Jonathan explains that these numbers are a baseline and he predicts they c...

Oct 19, 202245 min

Erik Bernhardsson and Etienne Dilocker on Vector Search in Production.

Weaviate Podcast #25. Thank you so much for watching the 25th episode of the Weaviate Podcast! This is a really special episode with Erik Bernhardsson! Erik is one of the early thought leaders on Approximate Nearest Neighbor (ANN) Search, creating the ANNOY library at Spotify. Erik shared incredible insights about vector search at Spotify such as the role of Offline and Online Machine Learning inference and the role of multi-stage re-ranking pipelines. Erik has also done massively impactful work...

Oct 06, 202245 min

Weaviate v1.15 Release with Etienne Dilocker and Dirk Kulawiak

Weaviate Podcast #24. Weaviate v1.15 Release! Thank you so much for checking out the Weaviate podcast -- here is a summary of what is new in Weaviate 1.15: 1. Cloud-native backups – allows you to configure your environment to create backups – of selected classes or the whole database – straight into AWS S3, GCS or local filesystem 2. Reduced memory usage - we found new ways to optimize memory usage, reducing RAM usage by 10-30%. 3. Better control over Garbage Collector – with the introduction of...

Sep 08, 20221 hr 7 min

Yaoshiang Ho on Masterful AI

Weaviate Podcast #22. Thank you so much for watching the 22nd Weaviate Podcast with Yaoshiang Ho! Yaoshiang is a Co-Founder of Masterful AI, a company doing incredible work in the Computer Vision model training and deployment space ( https://www.masterfulai.com/) . I really hope you enjoy this podcast, Yaoshiang and I went deep into some of the cutting edge Computer Vision algorithms such as Noisy Student, SimCLR, and Barlow Twins -- as well as the broader topic of Semi-Supervised Learning in wh...

Aug 10, 202258 min

Laura Ham on Weaviate User Experience

Weaviate Podcast #21. Thank you for watching the 21st Weaviate Podcast with Laura Ham! Laura Ham has worked on Weaviate at SeMI Technologies for a little over 5 years. She has had a heavy influence on all things from the GraphQL User Experience design to the Graph data model, and the creation of educational content! I really enjoyed this podcast, please see the list of topics under “chapters”! Here are some examples of recent coding tutorial videos Laura has made on “How to add custom modules to...

Jul 27, 202253 min

Tuana Celik on Question Answering with Haystack

Weaviate Podcast #20. Tuana Celik, a Developer Advocate at Deepset, presented many exciting ideas around Question Answering! We began with her Game of Thrones Question Answering Demo on HuggingFace Spaces and continued to discuss all topics QA from Extractive to Abstractive, benefits of Retrieve-then-Read, and Zero-Shot Generalization, to give a quick preview. For our Weaviate users, please check out this demo from Laura Ham on how to use Haystack QA in tandem with the Weaviate Vector Search Dat...

Jul 13, 202247 min

Etienne Dilocker on Weaviate v1.14 Release!

Weaviate Podcast #19. SeMI Technologies Co-Founder and CTO Etienne Dilocker returns to the Weaviate podcast to describe what's new with Weaviate v1.14! Please see the chapter outlines if you would like to skip ahead to the update most relevant to you! Please also see this blog post lead by Sebastian Witalec describing the new changes to Weaviate! Weaviate v1.14 Blog Post: https://weaviate.io/blog/2022/07/Weav......

Jul 08, 202248 min

Vincent D. Warmerdam on Applications of Nearest Neighbor Search

Weaviate Podcast #18. Thank you for watching the 18th Weaviate Podcast with Vincent D. Warmerdam! Vincent is an engineer at Spacy working on exciting tools such as Prodigy! Vincent describes how nearest neighbor search can aid in tasks such as Data De-Duplication and Data Labeling! Vincent shared many interesting ideas from representations of text, challenges with annotator disagreement, lessons from hosting data labeling workshops to train data scientists, and many more!

Jun 28, 202258 min

Kyle Lo on Scientific Literature Mining

Weaviate Podcast #17. Thank you for watching the 17th Weaviate Podcast with Kyle Lo! Vector Search enables us to find semantically similar items in large collections. Scientific Literature Mining is an extremely interesting case of this where we search through enormous collections of scientific papers to find evidence and ideas. Kyle has an extremely impressive resume in this application domain, tackling tasks such as Question Answering, Summarization, Fact Verification, and more! We really hope...

May 31, 20221 hr 7 min

Etienne Dilocker on ANN Benchmarks

Weaviate Podcast #16. ANN Benchmarks are a tool for evaluating the performance of in-memory approximate nearest neighbor algorithms. Etienne Dilocker, the CTO of SeMI Technologies, the company behind Weaviate shares some insight knowledge about this interesting topic.

May 24, 202245 min

Maximilian Werk on Jina AI's Neural Search Framework

Weaviate Podcast #15. Weaviate is used as a database for Jina AI's Neural Search Framework. In this podcast, Maximilian Werk, Engineering Director at Jina AI, will talk about all things related to this neural search framework together with Connor Shorten. Also, Maximilian will give a Jina Example Walkthrough... Enjoy!!

May 03, 20221 hr 11 min

UNC research team on VL Adapter for Efficient CLIP Transfer

Weaviate Podcast #14. Thanks for watching the Weaviate podcast! Our 14th episode welcomes Yi-Lin Sung, Jaemin Cho, and Professor Mohit Bansal, a research team from UNC! Our guests present their work on VL Adapter, a technique to achieve full fine-tuning performance while only updating 4% of original parameters!! This is an incredibly interesting finding for the sake of cost-effective tuning of Vision and Language models based on CLIP. We additionally discussed topics around compression bottlenec...

Apr 26, 202228 min

Data Science with Rick Lamers from Orchest

Weaviate Podcast #13. Rick Lamers, CEO, and Founder of Orchest.io. Orchest is a tool targeted at data scientists and this software simplifies building data pipelines.

Apr 05, 202247 min

Jonathan Frankle, Research Scientist in Deep Learning

Weaviate Podcast #12. Please check out Composer from MosaicML! https://github.com/mosaicml/composer Jonathan Frankle is the Chief Scientist at MosaicML and a PhD student in Machine Learning at MIT. Jonathan is the first author of “The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks” which won an ICLR best paper award. You can learn more about Jonathan Frankle here: http://www.jfrankle.com/ . Here is an explanation of how to use the Python library of Composer: https://www.you...

Mar 29, 20221 hr

CEO Han Xiao From Jina AI

Weaviate Podcast #11. You can now use Weaviate as the document store for DocumentArray in Jina AI. We had the pleasure to talk with their CEO Han Xiao. See the timestamps below what it is all about, or check out the recap from Henry AI!

Mar 15, 20221 hr 17 min

Karen Beckers about The Role of Vector Search in eCommerce

Weaviate Podcast #9. Karen Beckers, Data Scientist from Squadra Machine Learning Company, gives insightful information about how to use vector search in eCommerce in this podcast with Connor Shorten. Some topics are image-based datasets, vector search for data scientists, the future of eCommerce, and many more! See the timestamps below for more information.

Mar 03, 202254 min

Brady Neal about Causal Inference in Vector Search

Weaviate Podcast #8. Brady Neal from Oogway talks with Connor Shorten from Henry AI Labs about causal inference and many more. See the timestamps below to check out what this podcast is all about.

Feb 28, 20221 hr 20 min
For the best experience, listen in Metacast app for iOS or Android