INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.09
3:0.07
4:0.10
5:0.07
6:0.06
7:0.08
8:0.09
9:0.07
10:0.08
11:0.08
Negative Logits
merce
-2.19
ebted
-2.03
entimes
-1.87
ebin
-1.83
uilt
-1.79
challeng
-1.76
glomer
-1.74
untarily
-1.71
iosyncr
-1.65
quartered
-1.65
POSITIVE LOGITS
wolf
1.71
��
1.60
🙂
1.60
understatement
1.57
Orion
1.53
Availability
1.47
Reloaded
1.47
Samson
1.46
Sparks
1.45
straw
1.45
Activations Density 0.000%
No Known Activations
This feature has no known activations.