Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55 - podcast episode cover

Optimizing Qwen3 CPU ONLY inference on Tanzu Platform: Cloud Foundry Weekly: Ep 55

May 21, 20251 hr 5 minEp. 55
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Hot off the presses in model releases - we will explore the Qwen3-30b-a3b MoE model running on the Tanzu Platform. Early testing shows it performs exceptionally well on somewhat older enterprise-grade server CPUs (aka Cascade Lake). This show will provide some insights on how enterprises can use their existing server infrastructure to start their intelligent application modernization efforts.

For the best experience, listen in Metacast app for iOS or Android