INDEX
Negative Logits
having
-0.08
%i
-0.08
-loved
-0.08
pass
-0.08
الدولة
-0.08
owed
-0.07
icias
-0.07
ऱ
-0.07
я
-0.07
냐
-0.07
POSITIVE LOGITS
Prevent
0.09
Zimmer
0.09
见
0.09
privind
0.08
Scheduler
0.08
presentan
0.08
detall
0.08
spokesperson
0.07
Weston
0.07
Stil
0.07
Activations Density 0.004%