M. Konstantinov
M. Konstantinov
Home
Posts
Publications
Light
Dark
Automatic
Posts
Reverse engineering the Gumbel Max Trick
Intro I recently learned about the so-called Gumbel-Max and Gumbel-Softmax tricks. Essentially, the Gumbel-Max trick says that if we have a categorical distribution
π
→
=
π
1
,
…
π
K
and i.i.d.
Gumbel
(
0
,
1
)
-distributed random variables
G
i
,
1
≤
i
≤
K
, then $$ \forall k \quad \mathbb{P}(G_k + \log(\pi_k) = \max\{G_i + \log(\pi_i) \colon 1 \le i \le K\}) = \pi_k.
Last updated on Jul 24, 2023
Cite
×