INDEX
Negative Logits
invention
-0.08
transmission
-0.08
inventions
-0.08
confidentiality
-0.08
safegu
-0.08
repetition
-0.08
Transmission
-0.08
invented
-0.08
Exceptions
-0.08
Invent
-0.08
POSITIVE LOGITS
demeanor
0.10
Orientierung
0.09
macroph
0.09
Moe
0.09
colère
0.09
beige
0.08
induc
0.08
riv
0.08
Zombie
0.08
polarized
0.08
Activations Density 0.003%