INDEX
Explanations
phrases indicating abundance or quantity
New Auto-Interp
Negative Logits
linger
-0.15
jem
-0.15
asthan
-0.14
biz
-0.14
ibe
-0.14
ARAM
-0.14
.obtain
-0.14
.habbo
-0.14
536
-0.13
arpa
-0.13
POSITIVE LOGITS
arn
0.15
ways
0.15
forme
0.15
äll
0.15
Garner
0.15
eye
0.15
enough
0.14
other
0.14
arya
0.14
Rover
0.14
Activations Density 0.017%