BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs - podcast episode cover

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Apr 29, 2025•21 min•Ep. 718
--:--
--:--
Download Metacast podcast app
Listen to this episode in Metacast mobile app
Don't just listen to podcasts. Learn from them with transcripts, summaries, and chapters for every episode. Skim, search, and bookmark insights. Learn more

Episode description

🤗 Upvotes: 25 | cs.CL, cs.LG

Authors:
Hongyu Wang, Shuming Ma, Furu Wei

Title:
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Arxiv:
http://arxiv.org/abs/2504.18415v1

Abstract:
Efficient deployment of 1-bit Large Language Models (LLMs) is hindered by activation outliers, which complicate quantization to low bit-widths. We introduce BitNet v2, a novel framework enabling native 4-bit activation quantization for 1-bit LLMs. To tackle outliers in attention and feed-forward network activations, we propose H-BitLinear, a module applying an online Hadamard transformation prior to activation quantization. This transformation smooths sharp activation distributions into more Gaussian-like forms, suitable for low-bit representation. Experiments show BitNet v2 trained from scratch with 8-bit activations matches BitNet b1.58 performance. Crucially, BitNet v2 achieves minimal performance degradation when trained with native 4-bit activations, significantly reducing memory footprint and computational cost for batched inference.

For the best experience, listen in Metacast app for iOS or Android