INDEX
Explanations
words related to alcohol, specifically different types of liquors and their derivatives
New Auto-Interp
Negative Logits
awai
-0.18
ropol
-0.17
izu
-0.16
èī¯
-0.16
ÑģÑĤан
-0.16
.slim
-0.14
WND
-0.14
agn
-0.14
ÑĭÑģ
-0.14
esp
-0.14
POSITIVE LOGITS
utenant
0.20
ardy
0.19
RARY
0.18
berman
0.18
ège
0.17
entious
0.17
ensor
0.16
infeld
0.16
ecycle
0.16
ARGER
0.15
Activations Density 0.033%