INDEX
Explanations
phrases related to gambling and betting
New Auto-Interp
Head Attr Weights
0:0.02
1:0.03
2:0.11
3:0.29
4:0.02
5:0.02
6:0.07
7:0.07
8:0.05
9:0.14
10:0.06
11:0.07
Negative Logits
ギ
-1.20
Reviewer
-1.18
Anonymous
-1.15
Far
-1.14
Advertisement
-1.09
ODY
-1.07
ん
-1.07
�
-1.06
Items
-1.03
Patch
-1.01
POSITIVE LOGITS
hedon
2.01
sake
1.42
llah
1.36
¯
1.30
schild
1.21
agate
1.19
asio
1.17
lished
1.15
mberg
1.14
hots
1.13
Activations Density 0.004%