INDEX
Negative Logits
人为
-0.09
gevallen
-0.08
मामला
-0.08
cases
-0.08
েক্ষ
-0.08
تیا
-0.07
Martin
-0.07
deps
-0.07
violation
-0.07
-Holland
-0.07
POSITIVE LOGITS
બદ
0.09
પસંદ
0.08
wheelchair
0.08
redesigned
0.08
ADHD
0.08
સ
0.08
_plain
0.08
simplifying
0.08
Plain
0.08
dementia
0.08
Activations Density 0.028%