INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.11
2:0.08
3:0.07
4:0.08
5:0.07
6:0.08
7:0.08
8:0.07
9:0.06
10:0.08
11:0.07
Negative Logits
packing
-1.73
Donation
-1.70
pregnancies
-1.69
APTER
-1.68
reb
-1.67
ovan
-1.65
incorpor
-1.65
wra
-1.65
awa
-1.64
encl
-1.63
POSITIVE LOGITS
atoon
1.84
agus
1.73
Latvia
1.56
enemy
1.56
UTE
1.55
aunt
1.55
undo
1.51
Scarborough
1.51
ense
1.48
netflix
1.48
Activations Density 0.000%
No Known Activations
This feature has no known activations.