Scaling Large ML Models to Small Devices with Atila Orhon - podcast episode cover

Scaling Large ML Models to Small Devices with Atila Orhon

May 07, 2024Transcript available on Metacast
--:--
--:--
Listen in podcast apps:

Episode description

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting

The post Scaling Large ML Models to Small Devices with Atila Orhon appeared first on Software Engineering Daily.