INDEX
Explanations
phrases indicating strong opinions or emotions
New Auto-Interp
Negative Logits
URI
-0.15
ÑĤол
-0.14
_argv
-0.14
Stable
-0.14
pong
-0.14
lsen
-0.14
conscient
-0.13
Convers
-0.13
Disposable
-0.13
mutable
-0.13
POSITIVE LOGITS
strong
0.28
sharp
0.25
ac
0.25
vir
0.24
forth
0.23
force
0.23
ca
0.23
measured
0.23
pointed
0.23
vit
0.23
Activations Density 0.239%