INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.09
4:0.08
5:0.07
6:0.08
7:0.08
8:0.08
9:0.07
10:0.08
11:0.09
Negative Logits
contrace
-2.94
Amph
-2.65
psychiat
-2.63
nep
-2.57
.","
-2.56
Lima
-2.52
lished
-2.50
Directive
-2.46
�
-2.44
Galile
-2.44
POSITIVE LOGITS
tick
2.54
anke
2.47
ingers
2.45
Chuck
2.44
glass
2.36
elligence
2.23
ocket
2.23
McDonnell
2.20
obby
2.18
Schr
2.17
Activations Density 0.000%
No Known Activations
This feature has no known activations.