INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.09
3:0.08
4:0.09
5:0.08
6:0.06
7:0.08
8:0.07
9:0.07
10:0.08
11:0.07
Negative Logits
Dixon
-3.12
Transgender
-2.82
Vaughan
-2.76
Revision
-2.75
Sparks
-2.65
Shade
-2.63
Vision
-2.62
JC
-2.55
Caller
-2.49
Toll
-2.43
POSITIVE LOGITS
五
2.94
)].
2.89
amiya
2.80
urion
2.69
�
2.68
rene
2.52
三
2.52
[|
2.49
�
2.48
�
2.44
Activations Density 0.000%
No Known Activations
This feature has no known activations.