INDEX
Negative Logits
answer
-0.08
answers
-0.07
69
-0.07
Rap
-0.06
limburg
-0.06
xbb
-0.06
intro
-0.06
řik
-0.06
called
-0.06
bombs
-0.06
POSITIVE LOGITS
physician
0.14
physicians
0.12
Physician
0.11
Physicians
0.10
priest
0.08
ian
0.08
IN
0.07
ién
0.07
pistol
0.07
گردید
0.07
Activations Density 0.004%