INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.08
3:0.09
4:0.09
5:0.08
6:0.07
7:0.08
8:0.09
9:0.07
10:0.09
11:0.08
Negative Logits
esides
-1.65
Dickinson
-1.56
Ct
-1.55
Footnote
-1.51
Yok
-1.45
STD
-1.45
obyl
-1.44
(),
-1.43
Poles
-1.43
Freak
-1.42
POSITIVE LOGITS
quad
1.80
cry
1.61
rade
1.56
scribe
1.53
unity
1.50
rim
1.47
orate
1.44
panic
1.44
ray
1.43
gob
1.43
Activations Density 0.000%
No Known Activations
This feature has no known activations.