How Fal.ai Went From Inference Optimization to Hosting Image and Video Models - podcast episode cover

How Fal.ai Went From Inference Optimization to Hosting Image and Video Models

Jul 25, 202553 minEp. 1541
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

Fal.ai, once focused on machine learning infrastructure, has evolved into a major player in generative media. In this episode of The New Stack Agents, hosts speak with Fal.ai CEO Burkay Gur and investor Glenn Solomon of Notable Capital. Originally aiming to optimize Python runtimes, Fal.ai shifted direction as generative AI exploded, driven by tools like DALL·E and ChatGPT. Today, Fal.ai hosts hundreds of models—from image to audio and video—and emphasizes fast, optimized inference to meet growing demand.

Speed became Fal.ai’s competitive edge, especially as newer generative models require GPU power not just for training but also for inference. Solomon noted that while optimization alone isn't a sustainable business model, Fal’s value lies in speed and developer experience. Fal.ai offers both an easy-to-use web interface and developer-focused APIs, appealing to both technical and non-technical users.

Gur also addressed generative AI’s impact on creatives, arguing that while the cost of creation has plummeted, the cost of creativity remains—and may even increase as content becomes easier to produce.

Learn more from The New Stack about AI’s impact on creatives:

AI Will Steal Developer Jobs (But Not How You Think) 

How AI Agents Will Change the Web for Users and Developers 

Join our community of newsletter subscribers to stay on top of the news and at the top of your game. 

For the best experience, listen in Metacast app for iOS or Android