INDEX
Explanations
phrases indicating an increase or emphasis on quantity
New Auto-Interp
Negative Logits
enet
-0.17
arna
-0.16
chner
-0.16
аниÑĨ
-0.14
indi
-0.14
erna
-0.14
296
-0.14
.accel
-0.14
ohn
-0.14
vant
-0.14
POSITIVE LOGITS
-than
0.17
uger
0.16
nown
0.15
oplast
0.15
ument
0.14
ledged
0.14
CJK
0.14
Shepherd
0.14
gency
0.14
Lens
0.14
Activations Density 0.019%