INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.09
4:0.09
5:0.07
6:0.09
7:0.07
8:0.06
9:0.08
10:0.07
11:0.08
Negative Logits
lear
-1.60
nig
-1.47
orth
-1.44
glutamate
-1.35
nitrogen
-1.34
1929
-1.31
srf
-1.31
detox
-1.31
erential
-1.28
Utt
-1.25
POSITIVE LOGITS
%"
1.55
invocation
1.42
solicitation
1.35
Patreon
1.35
/,
1.34
succeed
1.33
Forge
1.32
Stard
1.32
sponsoring
1.31
soDeliveryDate
1.31
Activations Density 0.000%
No Known Activations
This feature has no known activations.