INDEX
Explanations
geographical names of countries
New Auto-Interp
Head Attr Weights
0:0.06
1:0.11
2:0.08
3:0.07
4:0.07
5:0.07
6:0.09
7:0.08
8:0.10
9:0.05
10:0.10
11:0.08
Negative Logits
Gamble
-0.98
SPONSORED
-0.95
veter
-0.95
Robbins
-0.93
beforehand
-0.92
Fargo
-0.92
Glass
-0.91
riddled
-0.91
Reeves
-0.90
Speech
-0.90
POSITIVE LOGITS
三
1.19
ロ
1.19
kk
1.14
ibur
1.08
asma
1.04
rador
1.03
inner
1.03
�
1.02
�
1.02
�
1.00
Activations Density 0.009%