INDEX
Negative Logits
paas
-0.08
anim
-0.08
igur
-0.08
herken
-0.08
Dank
-0.07
zo
-0.07
_collection
-0.07
determin
-0.07
haf
-0.07
collection
-0.07
POSITIVE LOGITS
inappropriate
0.13
unnecessarily
0.12
unexpected
0.12
Unexpected
0.11
unnecessary
0.11
unexpected
0.11
prematurely
0.11
Unexpected
0.11
unsolicited
0.11
incorrect
0.10
Activations Density 0.187%