INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.08
3:0.09
4:0.06
5:0.07
6:0.08
7:0.07
8:0.09
9:0.09
10:0.08
11:0.08
Negative Logits
��
-1.88
��
-1.86
merce
-1.74
��
-1.68
ulton
-1.63
umenthal
-1.62
pora
-1.60
phrine
-1.59
Invalid
-1.59
blush
-1.56
POSITIVE LOGITS
Zone
1.55
Member
1.54
Details
1.53
sits
1.52
otes
1.50
aron
1.50
fol
1.50
AMI
1.49
itely
1.49
speaker
1.49
Activations Density 0.000%
No Known Activations
This feature has no known activations.