Leo Boystov on Information Retrieval Science - Weaviate Podcast #38 - podcast episode cover

Leo Boystov on Information Retrieval Science - Weaviate Podcast #38

Mar 01, 20231 hr 28 min
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Hey everyone! Thank you so much for watching the 38th episode of the Weaviate podcast! This episode features Leo Boystov, an expert in Information Retrieval technology! We discussed a very wide range of topics from an overview of IR methods such as BM25, Neural Bi-Encoder and Cross-Encoder rankers, and a super exciting new work Leo has co-authored on using Large Language Models to generate training data for Neural Ranking models titled "InPars-Light: Cost-Effective Unsupervised Training of Efficient Rankers." We also discussed Leo's work on Non-Metric Space Search, the challenge of long document ranking, Robustness in Generalization Testing, and ended with some thoughts on Hybrid Rank Fusion. I really hope you enjoy the podcast, more than happy to answer any questions you have or clarify anything!

In-Pars Light: Cost-Effective Unsupervised Training of Efficient Rankers - https://arxiv.org/abs/2301.02998

Google Scholar Leo Boystov - https://scholar.google.com/citations?...

Chapters
0:00 Introduction
1:08 Information Retrieval Research
25:20 Ranker Inference Requirements
40:40 Non Metric Space Search
52:38 Code Libraries for IR Research
59:40 Long Document Ranking
1:07:00 Robustness Generalization
1:15:40 Hybrid Rank Fusion

For the best experience, listen in Metacast app for iOS or Android