INDEX
Explanations
No Explanations Found
New Auto-Interp
Head Attr Weights
0:0.07
1:0.07
2:0.10
3:0.07
4:0.07
5:0.08
6:0.08
7:0.09
8:0.09
9:0.07
10:0.07
11:0.09
Negative Logits
raviolet
-2.06
Mouth
-1.91
lvl
-1.82
Squadron
-1.82
Painter
-1.80
ツ
-1.79
itars
-1.74
ackle
-1.70
Pig
-1.69
penetrate
-1.67
POSITIVE LOGITS
fortunes
2.11
conom
1.94
gnu
1.80
most
1.72
scheme
1.63
destiny
1.62
mone
1.60
constitu
1.60
omo
1.60
phony
1.60
Activations Density 0.000%
No Known Activations
This feature has no known activations.