INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.09
1:0.06
2:0.08
3:0.08
4:0.07
5:0.09
6:0.07
7:0.08
8:0.09
9:0.08
10:0.08
11:0.07
Negative Logits
Leilan
-1.88
GOODMAN
-1.83
uther
-1.75
orers
-1.60
2020
-1.57
pha
-1.57
Glob
-1.55
acion
-1.54
ivity
-1.52
anguages
-1.51
POSITIVE LOGITS
ofi
1.76
helm
1.73
Bos
1.67
sbm
1.66
Rus
1.66
lopp
1.62
withd
1.60
Wars
1.59
sic
1.55
bo
1.53
Activations Density 0.000%
No Known Activations
This feature has no known activations.