INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.08
3:0.07
4:0.09
5:0.07
6:0.08
7:0.08
8:0.07
9:0.08
10:0.09
11:0.07
Negative Logits
ortunate
-2.97
anth
-2.89
ipes
-2.50
ulus
-2.49
anth
-2.36
Anth
-2.26
Proto
-2.22
substances
-2.19
Orig
-2.19
pse
-2.18
POSITIVE LOGITS
Reno
2.79
tu
2.63
scaled
2.55
2.47
condem
2.38
Metatron
2.38
dominating
2.30
Kendall
2.29
forearm
2.26
shutter
2.23
Activations Density 0.000%
No Known Activations
This feature has no known activations.