INDEX
Negative Logits
ikler
-0.08
striking
-0.08
deploying
-0.08
TRT
-0.08
-0.08
upgrades
-0.07
trolling
-0.07
Kelly
-0.07
Fortuna
-0.07
Homemade
-0.07
POSITIVE LOGITS
usst
0.08
substantive
0.08
posebej
0.08
hz
0.07
について
0.07
todo
0.07
substant
0.07
之外
0.07
(...)
0.07
Saw
0.07
Activations Density 0.006%