INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.07
2:0.08
3:0.08
4:0.08
5:0.09
6:0.07
7:0.08
8:0.08
9:0.08
10:0.07
11:0.08
Negative Logits
Rounds
-1.72
Duration
-1.71
Bench
-1.65
tab
-1.64
weekly
-1.57
Materials
-1.57
プ
-1.56
plex
-1.55
�
-1.54
sem
-1.53
POSITIVE LOGITS
civilian
1.60
.?
1.56
oples
1.55
eous
1.55
citiz
1.51
ggle
1.46
Daesh
1.46
Katrina
1.44
allegiance
1.43
JFK
1.42
Activations Density 0.000%
No Known Activations
This feature has no known activations.