INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.08
3:0.08
4:0.08
5:0.07
6:0.09
7:0.08
8:0.07
9:0.08
10:0.09
11:0.08
Negative Logits
SAT
-3.58
onge
-3.24
Wald
-3.07
jong
-2.96
SUN
-2.88
NK
-2.83
Nord
-2.81
Niet
-2.78
Cong
-2.74
�
-2.73
POSITIVE LOGITS
Harris
2.71
Amir
2.69
Cyborg
2.56
Harris
2.52
/>
2.46
DC
2.45
Os
2.38
ush
2.38
Floyd
2.31
Hogan
2.30
Activations Density 0.000%
No Known Activations
This feature has no known activations.