INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.09
2:0.08
3:0.08
4:0.07
5:0.08
6:0.07
7:0.09
8:0.08
9:0.06
10:0.08
11:0.09
Negative Logits
ashtra
-3.08
sylvania
-2.98
lvl
-2.92
tarian
-2.83
akespe
-2.80
enfranch
-2.69
将
-2.66
uthor
-2.64
aida
-2.57
vot
-2.57
POSITIVE LOGITS
stro
2.98
KC
2.78
stretched
2.70
Fischer
2.64
Hicks
2.64
ESC
2.57
AJ
2.53
Simpson
2.44
Carlos
2.44
Nick
2.42
Activations Density 0.000%
No Known Activations
This feature has no known activations.