INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.05
2:0.09
3:0.09
4:0.08
5:0.10
6:0.10
7:0.08
8:0.07
9:0.08
10:0.08
11:0.07
Negative Logits
almonds
-1.68
differing
-1.59
foregoing
-1.52
spew
-1.51
mul
-1.50
Neo
-1.49
contrad
-1.46
continuing
-1.46
prose
-1.45
Chick
-1.44
POSITIVE LOGITS
ateur
1.93
anan
1.88
ueller
1.76
ector
1.76
arden
1.73
inator
1.72
iris
1.72
schild
1.71
heid
1.70
ancy
1.70
Activations Density 0.000%
No Known Activations
This feature has no known activations.