Jay Shah Podcast - podcast cover

Jay Shah Podcast

Interviews with scientists and engineers working in Machine Learning and AI, about their journey, insights, and discussion on latest research topics.
Last refreshed:
Follow this podcast in the Metacast mobile app to refresh it and see new episodes.
Download Metacast podcast app
Podcasts are better in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episodes

Why Open-Source AI Is the Future and needs its 'Linux Moment'? | Manos Koukoumidis

Manos is the CEO of Oumi, a platform focused on open sourcing the entire lifecycle of foundation and large models. Prior to that he was at Google leading efforts on developing large language models within Cloud services. He also has experience working at Facebook on AR/VR projects and at Microsoft’s cloud division developing machine learning based services. Manos received his PhD in computer engineering from Princeton University and has extensive hands-on experience building and deploying models...

Apr 15, 20251 hr 20 min

Differential Privacy, Creativity & future of AI research in the LLM era | Niloofar Mireshghallah

Niloofar is a Postdoctoral researcher at University of Washington with research interests in building privacy preserving AI systems and studying the societal implications of machine learning models. She received her PhD in Computer Science from UC San Diego in 2023 and has received multiple awards and honors for research contributions. Time stamps of the conversation 00:00:00 Highlights 00:01:35 Introduction 00:02:56 Entry point in AI 00:06:50 Differential privacy in AI systems 00:11:08 Privacy ...

Feb 04, 20251 hr 29 min

Reasoning in LLMs, role of academia and keeping up with AI research | Dr. Vivek Gupta

Vivek is an Assistant Professor at Arizona State university. Prior to that, he was at the University of Pennsylvania as a postdoctoral researcher and completed his PhD in CS from the University of Utah. His PhD research focused on inference and reasoning for semi structured data and his current research spans reasoning in large language models (LLMs), multimodal learning, and instilling models with common sense for question answering. He has also received multiple awards and fellowships for his ...

Dec 24, 20241 hr 49 min

Time series Forecasting using GPT models | Max Mergenthaler Canseco

Max is the CEO and co-founder of Nixtla, where he is developing highly accurate forecasting models using time series data and deep learning techniques, which developers can use to build their own pipelines. Max is a self-taught programmer and researcher with a lot of prior experience building things from scratch. 00:00:50 Introduction 00:01:26 Entry point in AI 00:04:25 Origins of Nixtla 00:07:30 Idea to product 00:11:21 Behavioral economics & psychology to time series prediction 00:16:00 La...

Sep 19, 20241 hr 10 min

Generative AI and the Art of Product Engineering | Golnaz Abdollahian

Golnaz Abdollahian is currently the senior director of big idea innovation at Dolby Laboratories. She has a lot of experience developing and shaping technological products around augmented and virtual reality, smart homes, and generative AI. Before joining Dolby, she had experience working at Microsoft, Apple, and Sony. She also holds PhD in electrical engineering from Purdue University. Time stamps of the conversation 00:00 Highlights 01:08 Introduction 01:52 Entry point in AI 03:00 Leading Big...

Sep 05, 202435 min

Future of Software Development with LLMs, Advice on Building Tech startups & more | Pritika Mehta

Pritika is the co-founder of Butternut AI, a platform that allows the creation of professional websites without hiring web developers. Before butternut, Pritika had entrepreneurship experience building some other products, which later got acquired. Time stamps of the conversation 00:00 Highlights 01:15 Introduction 01:50 Entry point in AI 03:04 Motivation behind Butternut AI 05:00 Can software engineering be automated? 06:36 Large Language Models in Software Development 08:00 AI as a replacement...

Aug 14, 202438 min

Instruction Tuning, Prompt Engineering and Self Improving Large Language Models | Dr. Swaroop Mishra

Swaroop is a research scientist at Google-Deepmind, working on improving Gemini. His research expertise includes instruction tuning and different prompt engineering techniques to improve reasoning and generalization performance in large language models (LLMs) and tackle induced biases in training. Before joining DeepMind, Swaroop graduated from Arizona State University, where his research focused on developing methods that allow models to learn new tasks from instructions. Swaroop has also inter...

Jul 09, 20241 hr 32 min

Role of Large Language Models in AI-driven medical research | Dr. Imon Banerjee

Dr. Imon Banerjee is an Associate Professor at Mayo Clinic in Arizona, working at the intersection of AI and healthcare research. Her research focuses on multi-modality fusion, mitigating bias in AI models specifically in the context of medical applications & more broadly building predictive models using different data sources. Before joining the Mayo Clinic, she was at Emory University as an Assistant Professor and at Stanford as a Postdoctoral fellow. Time stamps of the conversation 00:00 ...

Apr 23, 202447 min

Algorithmic Reasoning, Graph Neural Nets, AGI and Tips to researchers | Petar Veličković

Dr. Petar Veličković is a Staff Research Scientist at Googe DeepMind and an Affiliated lecturer at the University of Cambridge. He is known for his research contributions in graph representation learning; particularly graph neural networks and graph attention networks. At DeepMind, he has been working on Neural Algorithmic Reasoning which we talk about more in this podcast. Petar’s research has been featured in numerous media articles and has been impactful in many ways including Google Maps’s i...

Oct 27, 20231 hr 12 min

Combining Vision & Language in AI perception and the era of LLMs & LMMs | Dr. Yezhou Yang

Dr. Yezhou Yang is an Associate Professor at Arizona State University and director of the Active Perception Group at ASU. He has research interests in Cognitive Robotics and Computer Vision, and understanding human actions from visual input and grounding them by natural language. Prior to joining ASU, he completed his Ph.D. from the University of Maryland and his postdoctoral at the Computer Vision Lab and Perception and Robotics Lab. Timestamps of the conversation 00:01:02 Introduction 00:01:46...

Oct 10, 20231 hr 54 min

Risks of AI in real-world and towards Building Robust Security measures | Hyrum Anderson

Dr Hyrum Anderson is a Distinguished Machine Learning Engineer at Robust Intelligence. Prior to that, he was Principal Architect of Trustworthy Machine Learning at Microsoft where he also founded Microsoft’s AI Red Team; he also led security research at MIT Lincoln Laboratory, Sandia National Laboratories, and Mendiant, and was Chief Scientist at Endgame (later acquired by Elastic). He’s also the co-author of the book “Not a Bug, But with a Sticker” and his research interests include assessing t...

Jul 12, 202352 min

Being aware of Systematic Biases and Over-trust in AI | Meredith Broussard

Meredith is an associate professor at New York University and research director at the NYU Alliance for Public Interest Technology. Her research interests include using data analysis for good and ethical AI. She is also the author of the book “More Than a Glitch: Confronting Race, Gender, and Ability Bias in Tech” and we will discuss more about this with her in this podcast. Time stamps of the conversation 00:42 Introduction 01:17 Background 02:17 Meaning of “it is not a glitch” in the book titl...

Jul 10, 202337 min

P2 Working at DeepMind, Interview Tips & doing a PhD for a career in AI | Dr. David Stutz

Part-2 of my podcast with David Stutz. (Part-1: https://youtu.be/J7hzMYUcfto) David is a research scientist at DeepMind working on building robust and safe deep learning models. Prior to joining DeepMind, he was a PhD student at the Max Plank Institute of Informatics. He also maintains a fantastic blog on various topics related to machine learning and graduate life which is insightful to young researchers out there. 00:00:00 Working at DeepMind 00:08:20 Importance of Abstraction and Collaboratio...

Jul 10, 20231 hr 42 min

Negotiating Higher Salary for AI & Tech roles after Job Offer | Jordan Sale

Rora helps top AI researchers and professionals negotiate their pay -- often as they transition from academia into industry. Moving into tech is a huge transition for many PhDs and post-docs -- the pay is much more significant and the terms of employment are often quite different. In the past 5 years, the Rora team has helped over 1000 STEM professionals negotiate more than $10M in additional earnings from companies like DeepMind, OpenAI, Google Brain, and Anthropic -- and advocate for better ro...

Jul 09, 202358 min

P1 Adversarial robustness in Neural Networks, Quantization and working at DeepMind | David Stutz

Part-1 of my podcast with David Stutz. (Part-2: https://youtu.be/IumJcB7bE20) David is a research scientist at DeepMind working on building robust and safe deep learning models. Prior to joining DeepMind, he was a Ph.D. student at the Max Plank Institute of Informatics. He also maintains a fantastic blog on various topics related to machine learning and graduate life which is insightful to young researchers out there. Check out Rora: https://teamrora.com/jayshah Guide to STEM Ph.D. AI Researcher...

Jul 09, 20231 hr 32 min

Promises and Lies of ChatGPT - understanding how it works | Subbarao Kambhampati

Dr. Subbarao Kambhampati is a Professor of Computer Science at Arizona State University and the director of the Yochan lab where his research focuses on decision-making and planning, specifically in the context of human-aware AI systems. He has been named a fellow of AAAI, AAAS, and ACM in recognition of his research contributions and also received a distinguished alumnus award from the University of Maryland and IIT Madras. Check out Rora: https://teamrora.com/jayshah Guide to STEM Ph.D. AI Res...

Jun 07, 20232 hr 47 min

Building a company in middle of War, Pandemic and Economic Crisis | Karyna Naminas

Karyna Naminas is the CEO of Label Your Data which provides data annotation services to different organizations interested in developing AI-based solutions. Check out Rora: https://teamrora.com/jayshah Guide to STEM Ph.D. AI Researcher + Research Scientist pay: https://www.teamrora.com/post/ai-researchers-salary-negotiation-report-2023 Rora's negotiation philosophy: https://www.teamrora.com/post/the-biggest-misconception-about-negotiating-salary https://www.teamrora.com/post/job-offer-negotiatio...

Jun 04, 20231 hr 14 min

Video recommendations using Machine Learning at Facebook, News feed & Ads ranking | Amey Dharwadker

Amey Dharwadker works as a Machine Learning Tech Lead Manager at Meta, supporting Facebook's Video Recommendations Ranking team and working on building and deploying personalization models for billions of users. He has also been instrumental in driving a significant increase in user engagement and revenue for the company through his work on News Feed and Ads ranking ML models. As an experienced researcher, he has co-authored publications at various AI/ML conferences and patents in the fields of ...

Jun 04, 20231 hr 16 min

Using AI to improve maternal & child health in underserved communities of India | Aparna Taneja

Dr. Aparna Taneja works at Google Research in India on innovative projects driving real-world social impact. Her team collaborates with an NGO called ARMMAN with the mission to improve maternal and child health outcomes in underserved communities of India. Prior to Google she was a Post-Doc at Disney Research, Zurich, and has a PhD from the Computer Vision and Geometry Group in ETH Zurich and a Bachelor's in Computer Science from the Indian Institute of Technology, Delhi. Time stamps of the conv...

May 11, 20231 hr 15 min

Fixing fake news and misinformation online using Robust AI models | Prof. Srijan Kumar

Dr. Srijan Kumar is an Assistant professor at Georgia Tech with research interests in combating misinformation and harmful content on online platforms, building robust AI models prone to adversarial attacks, and behavior modeling for more accurate recommender systems. Before joining Georgia Tech, he was a postdoctoral fellow at Stanford University and completed his Ph.D. in computer science from the University of Maryland. He has received multiple awards for his research work, including Forbes 3...

May 03, 20231 hr 34 min

Combining knowledge of clinical medicine and Artificial Intelligence | Emma Rocheteau

Emma is a final-year medical student at the University of Cambridge and also pursuing her Ph.D. in Machine Learning. With her knowledge of clinical decision-making, she is working on research projects that leverage machine-learning techniques to improve clinical workflow. She will be taking her role as an academic doctor post her graduation. Time stamps of the conversation 00:00:00 Introduction 00:02:08 From clinical science to learning AI 00:13:15 Learning the basics of Artificial Intelligence ...

Mar 31, 20231 hr 37 min

Why are Transformer so effective in Large Language Models like ChatGPT

Understanding why and how transformers are so efficient in large language models nowadays such as #chatgpt and more. Watch the full podcast with Dr. Surbhi Goel here: https://youtu.be/stB0cY_fffo Find Dr. Goel on social media Website: https://www.surbhigoel.com/ Linkedin: https://www.linkedin.com/in/surbhi-goel-5455b25a Twitter: https://twitter.com/surbhigoel_?lang=en Learning Theory Alliance: https://let-all.com/index.html About the Host: Jay is a Ph.D. student at Arizona State University. Link...

Mar 29, 202310 min

History of Large Language Models, Trustworthy AI, ChatGPT & more | Dr. Anupam Datta

Anupam is the co-founder and President of TruEra and prior to that, he was a Professor at Carnegie Mellon University for 15 years. TruEra provides AI solutions that help enterprises use machine learning, improve and monitor model quality, and build trust. His research and other efforts are focused on privacy, fairness, and building trustworthy machine-learning models. He holds a Ph.D. in computer science from Stanford University and Bachelor’s degree in same from IIT Kharagpur in India. Time sta...

Feb 23, 202346 min

Theory of Machine Learning, Transformer models, ChatGPT & tips for research career | Dr. Surbhi Goel

Surbhi is an Assistant Professor at the University of Pennsylvania. She got her Ph.D. in Computer Science from UT Austin and prior to joining UPenn as an Assistant Professor, she was a postdoctoral researcher at Microsoft Research NYC in the Machine Learning group. She has research expertise in theoretical computer science & machine learning, with a particular focus on developing theoretical foundations for modern deep learning paradigms. She also is a part of building the Learning Theory Al...

Feb 16, 20231 hr 31 min

Making Machine Learning more accessible | Sebastian Raschka

Sebastian Raschka​ is the lead AI educator at GridAI. He is the author of the book "Machine Learning with PyTorch and Scikit Learn" and also a few other books that cover the fundamentals of #machinelearning and #deeplearning techniques and implementing them with Python. He is also an Assistant Professor of Statistics at the University of Wisconsin-Madison and has been actively involved in making ML more accessible to beginners through his blogs, video tutorials, tweets and of course his books. H...

Dec 29, 20221 hr 23 min

Current and future state of Artificial Intelligence in Healthcare | Dr. Matthew Lungren

Dr. Matthew Lungren is currently the Chief Medical Information Officer at Nuance Communications - Microsoft company, and also holds part-time appointments with the University of California San Francisco as an Associate Clinical Professor and also as adjunct faculty at Stanford and Duke University. He is a radiologist by training and has led and contributed to multiple projects that use AI and deep learning for medical imaging and precision medicine. Time stamps from the conversation 00:00:55 Int...

Dec 28, 20221 hr 6 min

AI for improving clinical trials & drug development, entrepreneurship & AI safety | Charles Fisher

Dr. Charles Fisher is the CEO and Founder of Unlearn(dot)AI which helps in faster drug development and efficient clinical trials. This year they also raised a series B funding of 50 million dollars. Charles holds a Ph.D. in biophysics from Harvard University and prior to founding Unlearn, he did his Postdoctorate at Boston University, followed by being a principal scientist at Pfizer and a machine learning engineer at a virtual reality company in silicon valley. Time stamps of the conversation 0...

Oct 31, 20221 hr 12 min

Recommendation systems, being an Applied Scientist & Building a good research career | Mina Ghashami

Mina Ghashami is an Applied Scientist in the Alexa Video team at Amazon Science alongside being a lecturer at Stanford University. Prior to joining Amazon, she was a Research Scientist at Visa Research working on recommendation systems built on transactions from users and a few other projects. She completed her Ph.D. in Computer Science from the University of Utah followed by a PostDoctoral position at Rutgers University. At Amazon, she is mainly focused on Video-based ranking recommendation sys...

Sep 14, 20221 hr 15 min

Role of a Principal Scientist do & AI in medicine | Alberto Santamaria-Pang, Microsoft

Alberto Santamaria-Pang is a Principal Applied Data Scientist at Microsoft. He did his Ph.D. in computer science from the University of Houston and has a long experience in research and development on various AI projects including but not limited to medical imaging and deep learning. Prior to Microsoft, he was a principal scientist at GE research. He has led many research projects in industry and also government-funded projects, a few of which we will be discussing today. Time stamps of conversa...

Sep 12, 20221 hr 34 min

Explainability, Human Aware AI & sentience in large language models | Dr. Subbarao Kambhampati

Are large language models really sentient or conscious? What is explainability (XAI) and how can we create human-aware AI systems for collaborative tasks? Dr. Subbarao Kambhampati sheds some light on these topics, generating explanations for human-in-loop AI systems and understanding 'intelligence' in context to AI systems. He is a Prof of Computer Science at Arizona State University and director of the Yochan lab at ASU where his research focuses on decision-making and planning specifically in ...

Jun 27, 20222 hr 25 min
For the best experience, listen in Metacast app for iOS or Android