Programmers Quickie - podcast cover

Programmers Quickie

Software Engineeringblog.code-code-code.com
Software Engineering Best Practices, System Design, High Scale, Algorithms, Math, Programming Languages, Statistics, Machine Learning, Databases, Front Ends, Frameworks, Low Level Machine Structure, Papers and Computing, Computer Science Book Reviews - Everything!

Episodes

🤖 DeepSeek-V3: A 671B Parameter Mixture-of-Experts Language Model

A 671B parameter Mixture-of-Experts language model. It highlights the model's architecture, including its innovative load balancing and multi-token prediction strategies, and its efficient training process using FP8 precision. Benchmark results demonstrate DeepSeek-V3's strong performance compared to other open-source and some closed-source models, particularly in math and code tasks. The document also provides instructions for running DeepSeek-V3 locally using various frameworks and hardware, i...

Dec 27, 202430 min

LLM Agents

To LLM agents or not that is the question.

Dec 20, 202426 min

Awk

awk

Dec 15, 202415 min
For the best experience, listen in Metacast app for iOS or Android
Open in Metacast