INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.09
3:0.07
4:0.07
5:0.08
6:0.09
7:0.08
8:0.09
9:0.08
10:0.07
11:0.08
Negative Logits
adelphia
-2.77
gren
-2.74
destro
-2.60
felon
-2.50
rimp
-2.46
greed
-2.43
scorp
-2.39
usterity
-2.39
uncle
-2.37
blade
-2.30
POSITIVE LOGITS
Communities
2.85
Cy
2.84
Ev
2.75
Cong
2.74
Mit
2.71
Diet
2.63
Ples
2.57
Met
2.57
Latvia
2.56
Athe
2.56
Activations Density 0.000%
No Known Activations
This feature has no known activations.