INDEX
Explanations
the imperative form of verbs that suggest action or movement
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.07
3:0.08
4:0.09
5:0.07
6:0.07
7:0.07
8:0.09
9:0.08
10:0.08
11:0.09
Negative Logits
phans
-2.10
ramid
-2.07
Memories
-2.00
Pandora
-1.98
��
-1.94
Lost
-1.93
sbm
-1.88
amia
-1.87
�
-1.85
Pension
-1.80
POSITIVE LOGITS
bys
2.02
industrialized
1.97
nort
1.88
Collider
1.86
hunters
1.86
empir
1.85
idiots
1.84
Australians
1.81
Europeans
1.77
hunter
1.77
Activations Density 0.000%