INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.08
4:0.09
5:0.09
6:0.06
7:0.08
8:0.09
9:0.06
10:0.08
11:0.07
Negative Logits
eatures
-2.13
gaard
-2.10
withd
-1.89
�
-1.88
properties
-1.84
compl
-1.83
stood
-1.81
ohl
-1.77
iband
-1.72
ancies
-1.71
POSITIVE LOGITS
enslaved
1.83
Negro
1.76
circus
1.75
emic
1.74
Invasion
1.74
rob
1.73
�
1.68
threatening
1.68
�
1.66
Mafia
1.65
Activations Density 0.000%
No Known Activations
This feature has no known activations.