INDEX
Negative Logits
rema
-0.09
uggest
-0.08
delight
-0.08
recated
-0.08
enga
-0.08
.myapplication
-0.08
ậu
-0.08
“…
-0.08
ukh
-0.08
은
-0.07
POSITIVE LOGITS
(remove
0.11
掉
0.10
entfernen
0.10
(Delete
0.10
khỏi
0.09
Removal
0.09
removing
0.09
elimin
0.09
(delete
0.09
removal
0.09
Activations Density 0.060%