INDEX
Negative Logits
ben
-0.10
prefers
-0.08
طوال
-0.08
uzun
-0.08
(undefined
-0.08
promin
-0.08
Stayed
-0.08
Rang
-0.08
�
-0.08
ngon
-0.08
POSITIVE LOGITS
enius
0.08
±
0.07
dp
0.07
±
0.07
Naval
0.07
�
0.07
Addison
0.07
直
0.07
�
0.07
roads
0.07
Activations Density 0.001%