INDEX
Negative Logits
妮
-0.09
.)↵↵
-0.08
panic
-0.08
ciment
-0.08
Runner
-0.08
){↵↵-0.08
alara
-0.08
eerie
-0.08
acara
-0.07
Artifact
-0.07
POSITIVE LOGITS
responsibly
0.10
verantwort
0.09
alku
0.09
рот
0.09
Rot
0.08
kontroll
0.08
replacement
0.08
roten
0.08
thy
0.07
Rot
0.07
Activations Density 0.011%